|
A VECTOR SPACE MODEL FOR INFORMATION RETRIEVAL: A MATLAB APPROACHKeywords: tf , idf , vector space model , cosine similarities , term-document , term-query matrices , dot products. Abstract: By and large, three classic framework models have been used in the process of retrieving information: Boolean, Vector Space and Probabilistic. Boolean model is a light weight model which matches the query with precise semantics. Because of its boolean nature, results may be tides, missing partial matching, while on the contrary,vector space model, considering term-frequency, inverse document frequency measures, achieves utmost relevancy in retrieving documents in information retrieval. This paper implements and discusses the issues of information retrieval system with vector space model using MATLAB on Cranfield data collection of aerodynamics domain.
|