All Title Author
Keywords Abstract


CADWeb: categoriza??o automática de documentos digitais

DOI: 10.1590/S0100-19652011000100005

Keywords: information technology, categorization, digital libraries, text mining, digital documents.

Full-Text   Cite this paper   Add to My Lib

Abstract:

the evolution of information technology and dissemination of digital documents on the web calls for a mechanism for the organization of such documents in order to facilitate the search and recall processes. in digital libraries or repositories of electronic works, for example, there is a need for tools that will automatically classify documents, since the classification process (categorizations) is done manually. such a tool will represent an important resource and support for cataloging. this article presents the development of a tool whose chief objective is to categorize digital documents automatically, using pre-established categories, where each document will belong to one or more categories according to its content, thus making the classification of such documents more efficient and also quicker. techniques and algorithms of text mining were used to develop and validate the tool; also, some categories were defined in the case study, as well as related terms such as: information technology, law and physics.

Full-Text

comments powered by Disqus