%0 Journal Article %T An¨¢lisis de los descriptores de diferentes ¨¢reas del conocimiento indizadas en bases de datos del CSIC. Aplicaci¨®n a la indizaci¨®n autom¨¢tica %A Gil Leiva %A Isidoro %A Rodr¨ªguez Mu£¿oz %A Jos¨¦ V. %J Revista Espa£¿ola de Documentaci¨®n Cient¨ªfica %D 1997 %I Consejo Superior de Investigaciones Cient¨ªficas %X The value of scientific articles titles and abstracts as sources of terms for document indexing is studied in relation with six knowledge areas: Library and Information Science, Medicine, Chemistry, Biology, Psychology and Physics, indexed in the databases ISOC, IME and ICYT of the CSIC. The sintagmatic structures of the indexing terms found in the field Descriptors is also examined, as well as the relation between the length of the documents and the number of descriptors. In order to do this, six searches were made in the databases for the six knowledge areas, and 450 bibliographical references were selected (75 for knowledge area), obtaining 2.077 descriptors; of these, 38,1% appear in the titles, in the abstracts or in both. With respect to the syntactic structures it was found that 41,9% were nouns , 32,3% are noun+adjective groups, and 11,8% are noun+noun groups, with a 14% for other different structures. Lastly, regarding the relationship between length of documents and number of descriptors, all possible combinations were found: short articles with a few descriptors, long articles with a small amount of descriptors, short articles with a important quantity of descriptors, and documents with a high number both of pages and descriptors The following conclusions can be raised from the data obtained: first, if the abstracts are not well made and the titles are not precise, they are not definitives sources for the extraction of concepts; second, the most common syntactic structures is the noun phrase , followed by noun+adjective and noun-noun : third, no significant relation is found between length of documents and number of descriptors assigned to it. Se estudia el valor de los t¨ªtulos y res¨²menes de los art¨ªculos cient¨ªficos como fuentes suministradoras de t¨¦rminos para la indizaci¨®n de los documentos en seis ¨¢reas del conocimiento indizadas en las bases de datos ISOC, IME e ICYT del CSIC. Asimismo, se examina la estructura sintagm¨¢tica de los t¨¦rminos de indizaci¨®n hallados en el campo Descriptores , y la posible relaci¨®n entre el n¨²mero de descriptores de un documento con la cantidad de p¨¢ginas del mismo. Para tales fines se seleccionaron las ¨¢reas del conocimiento de Biblioteconom¨ªa y Documentaci¨®n, Medicina, Qu¨ªmica , Biolog¨ªa, Psicolog¨ªa y F¨ªsica, y se realizaron seis b¨²squedas en estas bases de datos de las que seleccionamos 450 referencias bibliogr¨¢ficas (75 por ¨¢rea) proporcionando un total de 2.077 descriptores. El 38,1% de los descriptores asignados a dichos registros aparece en el t¨ªtulo, resumen o en el t¨ªtulo y resumen a la v %K Descriptors analysis %K linguistic analysis %K statistical analysis %K automatic indexing %K CSIC databases %K An¨¢lisis de descriptores %K an¨¢lisis ling¨¹¨ªstico %K an¨¢lisis estad¨ªstico %K indizaci¨®n autom¨¢tica %K bases de datos del CSIC %U http://redc.revistas.csic.es/index.php/redc/article/view/589/664