全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
-  2016 

基于SVD的档案学主题挖掘
Text topic mining of archives research based on SVD

DOI: 10.6040/j.issn.1671-9352.1.2015.C03

Keywords: 权重设计,主题挖掘,奇异值分解,档案学课题,词项-文档矩阵,
term-document matrix
,singular value decomposition,topic mining,weight design,archives project

Full-Text   Cite this paper   Add to My Lib

Abstract:

摘要: 收集2010—2014年国家社科基金档案学领域立项课题,基于课题名称进行分词等预处理,得到词项-文档矩阵,依据词项重要性设计局部及全局权重,组合局部与全局权重,得到词项-文档矩阵权重值。利用奇异值分解SVD进行特征降维,研究在不同维度下近5 a国家社科档案学立项课题研究主题。经过可视化分析得到社科档案学七大研究主题为:非物质文化遗产保护、电子文件管理、数字资源建设及体系、档案信息资源价值与挖掘、档案保护机制、档案馆研究、档案信息安全。
Abstract: The data of National Social Science Fund Project on Archives Field from 2010 to 2014 were collected, the words of the project title are separated, and the term-document matrix was obtained. According to the importance level of the terms, local and whole weight was designed, local weight was integrated with whole weight, which obtained the weight value of the term-document matrix. Feature dimension reduction was implemented by SVD, the recent National Social Science Archives Project themes in different dimensions were studied. Eventually, seven research topics of social science archives were obtained by visually analyzing, which were the intangible cultural heritage protection, electronic document management, digital resource construction, value and research of the archival information resource, archival information protecting system, research of the archives, security of the archival information

References

[1]  廖安平,刘建州.矩阵论[M].长沙:湖南大学出版社,2005:57-58.
[2]  SHAIK Z, GARLA S, CHAKRABORTY G. An application of text mining to reveal trends[EB/OL].(2012-04-02)[2015-05-06]. http://support.sas.com/resources/papers/proceedings12/135-2012.pdf.
[3]  ALBRIGHT R.Taming text with the SVD[EB/OL].[2015-11-29].http://ftp.sas.com/techsup/download/EMiner/TamingTextwiththeSVD.pdf.
[4]  CHAKRABORTY G, PAGOLU M, GARLA S.Text mining and analysis: practical methods, examples, and case studies using SAS[M]. North Carolina Carey, America:SAS Institute Inc, 2013:72-83.
[5]  全国哲学社会科学规划办公室.国家社科基金项目数据库[DB/OL].[2015-05-06].http://www.npopss-cn.gov.cn/.
[6]  毕建新,郑建明.近十年档案学国家级基金项目计量研究[J].档案学通讯,2013(5):31-34.
[7]  SAS.Getting Started with SAS text miner13.2[EB/OL].[2015-11-29]. http://support.sas.com/documentation/onlinedoc/txtminer/index.html#txtminer13x.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133