全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

一种基于Web的大规模人物社会关系提取方法*

, PP. 740-744

Keywords: 人物社会关系,描述模式,关系提取,模拟退火

Full-Text   Cite this paper   Add to My Lib

Abstract:

Web上的人物社会关系是一类重要的Web信息.本文提出一种轻量级的大规模人物社会关系提取方法,并引入模拟退火方法,迭代发掘网页中蕴涵的表述人物社会关系的最小描述模式集合.利用Web信息冗余性,高效准确地从Web上提取人物关系信息.为验证本文方法的有效性,定义6种人物社会关系,基于1张大规模Web人名列表,对这6种关系进行提取.实验结果表明本文方法的平均准确率为84.79%,平均召回率为81.69%.

References

[1]  Agichtein E, Gravano L. Snowball: Extracting Relations from Large PlainText Collections // Proc of the 5th ACM Conference on Digital Libraries. San Antonio, USA, 2000: 8594
[2]  Sundaresan N, Yi J. Mining the Web for Relations // Proc of the 9th International World Wide Web Conference on Computer Networks. Amsterdam, Netherlands, 2000: 699711
[3]  Kautz H, Selman B, Shah M. Referral Web: Combining Social Networks and Collaborative Filtering. Communications of the ACM, 1997, 40(3): 6365
[4]  Kautz H, Selman B, Shah M. The Hidden Web. AI Magazine, 1997, 18(2): 2736
[5]  Matsuo Y, Mori J, Hamasaki M, et al. POLYPHONET: An Advanced Social Network Extraction System from the Web // Proc of the 15th International Conference on World Wide Web. New York, USA, 2006: 397406
[6]  Li Jianhua, Wang Xiaolong. An Effective Method on Automatic Identification of Chinese Name. Chinese High Technology Letters, 2000, 10(2): 4649 (in Chinese) (李建华,王晓龙.中文人名自动识别的一种有效方法.高技术通讯, 2000, 10(2): 4649)
[7]  Zhang Huaping, Liu Qun. Automatic Recognition of Chinese Personal Name Based on Role Tagging. Chinese Journal of Computers, 2004, 27(1): 8591 (in Chinese) (张华平,刘 群.基于角色标注的中国人名自动识别研究.计算机学报, 2004, 27(1): 8591)
[8]  Kirkpatrick S, Gellat C D, Jr Vecchi M P. Optimization by Simulated Annealing. Science, 1983, 220(4598): 671680
[9]  Davis I, Jr Vitiello E. RELATIONSHIP: A Vocabulary for Describing Relationships between People [EB/OL]. [20061201]. http://vocab.org/relationship
[10]  Li Xiaoming. An Estimation of the Growth of Chinese Web Pages. Acta Scientiarum Naturalium Universitatis Pekinensis, 2003, 39(3): 394398 (in Chinese) (李晓明.对中国曾有过静态网页数的一种估计.北京大学学报:自然科学版, 2003, 39(3): 394398)
[11]  Miller S, Fox H, Ramshaw L A, et al. A Novel Use of Statistical Parsing to Extract Information from Text // Proc of the 1st Annual Meeting of the North American Chapter of the Association for Computational Linguistics. Seattle, USA, 2000: 226233
[12]  Zelenko D, Aone C, Richardella A. Kernel Methods for Relation Extraction. Journal of Machine Learning Research, 2003, 3(6): 10831106
[13]  McCallum A. Efficiently Inducing Features of Conditional Random Fields // Proc of the 19th Conference on Uncertainty in Artificial Intelligence. Acapulco, Mexico, 2003: 403410
[14]  Brin S. Extracting Patterns and Relations from the World Wide Web // Proc of the WebDB Workshop at the 6th International Conference on Extending Database Technology. Valencia, Spain, 1998: 172183
[15]  Yao Conglei, Nan D I. Technical Report: Mining the Whole Set of Person Names from the Chinese Web [EB/OL]. [20061201]. http://net.pku.edu.cn/~ycl/WholePersonNamesSet.pdf

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133