全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

GLIMIR: Manifestation and Content Clustering within WorldCat

Full-Text   Cite this paper   Add to My Lib

Abstract:

The GLIMIR project at OCLC clusters and assigns an identifier to WorldCat records representing the same manifestation. These include parallel records in different languages (e.g., a record with English descriptive notes and subject headings and one for the same book with French equivalents). It also clusters records that probably represent the same manifestation, but which could not be safely merged by OCLC's Duplicate Detection and Resolution (DDR) program for various reasons. As the project progressed, it became clear that it would also be useful to create content-based clusters for groups of manifestations that are generally equivalent from the end user perspective (e.g., the original print text with its microform, ebook and reprint versions, but not new editions). Lessons from the GLIMIR project have improved OCLC's duplicate detection program through the introduction of new matching techniques. GLIMIR has also had unexpected benefits for OCLC's FRBR algorithm by providing new methods for identifying outliers thus enabling more records to be included in the correct work cluster.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133