|
EXTRACTING IMAGES FROM THE WEB USING DATA MINING TECHNIQUEKeywords: Image retrieval , object recognition , computer vision , weakly supervised. Abstract: The objective of this work is to generate a large number of images for specified object class. The approach is to em-ploy text, metadata and visual features and to use to gather many high quality images from the web. Candidates images are obtained by text based web search. The web page and the images are downloaded. The task is to remove irrelevant images and to re-rank. First, the images query page is down-loaded. Second, it extracts images URL from downloaded page and place it in the database then ranking is done based on text surrounding and metadata features. SVM and Naive bayes classifier algorithm are compared for ranking. The top ranked images are used as training data and an SVM visual classifier is learned to improve re-ranking. The principal idea of the overall method is in combining text or metadata or visual features in order to achieve a completely automatic ranking of images
|