全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

网络文本蕴涵地理信息抽取:研究进展与展望

DOI: 10.3724/SP.J.1047.2015.00127, PP. 127-134

Keywords: 网络文本,信息抽取,地理定位,自然语言处理,地理信息

Full-Text   Cite this paper   Add to My Lib

Abstract:

互联网的普及产生了大量蕴含着丰富地理语义的文本,为地理信息的深度挖掘和知识发现带来了巨大机遇。同时,蕴含地理语义文本的异构性和动态性,使得地理实体的属性数量和种类激增、地理语义关系复杂,对地理信息检索、空间分析和推理、智能化位置服务等提出了严峻的挑战。本文阐述了网络文本蕴含地理信息抽取的技术流程,从地理实体识别、地理实体定位、地理实体属性抽取、地理实体关系构建、地理事件抽取5个方面总结了网络文本蕴含地理信息抽取的进展和关键技术瓶颈,分析了可用于网络文本蕴含地理信息抽取的开放资源,并展望了未来的发展方向。

References

[1]  Shi G, Barker K. Extraction of geospatial information on the Web for GIS applications[C]. 2011 10th IEEE International Conference on Cognitive Informatics & Cognitive Computing, 2011:41-48.
[2]  姜吉发.一种事件信息抽取模式获取方法[J].计算机工程,2005,31(15):96-98.
[3]  吴家皋,周凡坤,张雪英.HMM模型和句法分析相结合的事件属性信息抽取[J].南京师大学报(自然科学版),2014,37(1):30-34.
[4]  Jannik S, Michael G, Pavel P. Extraction and exploration of spatio-temporal information[C]. Proceedings of the 6th Workshop on Geographic Information Retrieval, 2010.
[5]  Qiu P Y, Lu F, Zhang H C. Extracting traffic information from Web texts with a D-S evidence theory based approach[C]. 2013 21st International Conference on Geoinformatics, 2013:1-5.
[6]  金澎,吴云芳,俞士汶.词义标注语料库建设综述[J].中文信息学报,2008,22(3):16-23.
[7]  Mihalcea R. The SENSEVAL 3 english lexical sample task[C]. Proceedings of ACL-SIGLEX SENSEVAL 3 worshop, 2004:25-28.
[8]  王跃龙,姬东鸿.汉语树库综述[J].当代语言学,2009,11(1):47-55.
[9]  Ballatore A, Wilson D C, Bertolotto M. A survey of volunteered open geo-knowledge bases in the semantic Web[J]. Quality Issues in the Management of Web Information Intelligent Systems Reference Library, 2013:93-120.
[10]  Jones C B, Purves R S, Clough P D, et al. Modelling vague places with knowledge from the Web[J]. International Journal of Geographical Information Science, 2008,22(10):1045-1065.
[11]  Durme B V, Qian Ting, Schubert L. Class-driven attribute extraction[C]. Proceedings of the 22nd International Conference on Computational Linguistics, 2008:921-928.
[12]  Putthividhya D P, Hu J L. Bootstrapped named entity recognition for product attribute extraction[C]. Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011:1557-1567.
[13]  Raju S, Pingali P, Varma V. An unsupervised approach to product attribute extraction[C]. 31st European Conference on IR Research, 2009:6-9.
[14]  Wong T L, Lam W, Wong T S. An unsupervised framework for extracting and normalizing product attributes from multiple Web sites[C]. Proceedings of the 31st Annual International ACM SIGIR Conferenc on Research and Development in Information Retrieval, 2008:35-42.
[15]  贾真,杨宇飞,何大可,等.面向中文网络百科的属性和属性值抽取[J].北京大学学报(自然科学版),2014,50(01):41-47.
[16]  Pa?ca M, Durme B V, Garera N. The role of documents vs. queries in extracting class attributes from text[C]. Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007:485-494.
[17]  Ballatore A, Wilson D C, Bertolotto M. Computing the semantic similarity of geographic terms using volunteered lexical definitions[J]. International Journal of Geographical Information Science, 2013,27(10):2099-2118.
[18]  Li W W, Raskin R, Goodchild M F. Semantic similarity measurement based on knowledge mining: An artificial neural net approach[J]. International Journal of Geographical Information Science, 2012,26(8):1415-1435.
[19]  Matsuo Y, Sakaki T, Uchiyama K, et al. Graph-based word clustering using a Web search engine[C]. Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, 2006:542-550.
[20]  Abreu S C, Bonamigo T L, Vieira R. A review on relation extraction with an eye on Portuguese[J]. Journal of the Brazilian Computer Society, 2013,19(4):553-571.
[21]  张苇如,孙乐,韩先培.基于维基百科和模式聚类的实体关系抽取方法[J].中文信息学报,2012,26(2):75-81.
[22]  Pa?ca M. Organizing and searching the World Wide Web of facts-step two: Harnessing the wisdom of the crowds[C]. Proceedings of the 16th International Conference on World Wide Web, 2007:101-110.
[23]  张雪英,闾国年.自然语言空间关系及其在GIS中的应用研究[J].地球信息科学,2007,9(6): 77-81.
[24]  乐小虬,杨崇俊,于文洋.基于空间语义角色的自然语言空间概念提取[J].武汉大学学报(信息科学版),2005,30(12):1100-1103.
[25]  朱少楠,张雪英,张春菊.地理空间关系描述的句法模式识别[C].Proceedings of 2010 International Conference on Broadcast Technology and Multimedia Communication,2010:354-357.
[26]  赵妍妍,秦兵,车万翔,等.中文事件抽取技术研究[J].中文信息学报,2008,22(1):3-8.
[27]  许红磊,陈锦秀,周昌乐,等.自动识别事件类别的中文事件抽取技术研究[J].心智与计算,2010,4(1):33-44.
[28]  Sanderson M, Kohler J. Analyzing geographic queries[C]. SIGIR Workshop on Geographic Information Retrieval, 2004.
[29]  Piskorski J, Yangarber R. Information extraction: Past, present and future[C]. Multi-source, Multilingual Information Extraction and Summarization. Berlin Heidelberg: Springer-Verlag, 2013:23-49.
[30]  赵军,刘康,周光有,等.开放式文本信息抽取[J].中文信息学报,2011,25(6):98-110.
[31]  刘振, 张智雄.开放信息抽取技术的现状分析[J].情报杂志,2013,32(11):145-149.
[32]  Oren E, Michael C, Doug D, et al. Unsupervised named-entity extraction from the Web: An experimental study[J]. Artificial Intelligence, 2005,165(1):91-134.
[33]  Joanna B, Erdal K, Fabian M S. Inside YAGO2s: A transparent information extraction architecture[C]. Proceedings of the 22nd International Conference on World Wide Web Companion, 2013:325-328.
[34]  Daniel S W, Raphael H, Fei Wu. Using Wikipedia to bootstrap open information extraction[C]. ACM SIGMOD Record, 2008,37(4):62-68.
[35]  Michele B, Michael J C, Stephen S, et al. Open information extraction from the Web[C]. Proceedings of the 20th International Joint Conference on Artificial Intelligence, 2007:2670-2676.
[36]  Oren E, Anthony F, Janara C, et al. Open information extraction: The second generation[C]. Proceedings of the 22nd International Joint Conference on Artificial Intelligence, 2011:3-10.
[37]  Fei Wu, Daniel S W. Open information extraction using Wikipedia[C]. Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, 2010:118-127.
[38]  Alan A, Alexander L. KrakeN: N-ary facts in open information extraction[C]. Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction, 2012:52-56.
[39]  Shi G, Barker K. Extraction of geospatial information on the Web for GIS applications[C]. Proceedings of the 10th IEEE International Conference on Congitive Informatics and Cognitive Computing, 2011:18-20.
[40]  Jones C B, Purves R S. Geographical information retrieval[J]. International Journal of Geographical Information Science, 2008,22(3):219-228.
[41]  Hess B, Gasimov A, Sutanto J. A universal approach that makes legacy online content location-based[C]. Proceedings of the 10th International Conference on Mobile and Ubiquitous Multimedia, 2011:127-133.
[42]  Sundheim B M. Overview of results of the MUC-6 evaluation[C]. Proceedings of the 6th Conference on Message Understanding, 1995:13-31.
[43]  黄德根,岳广玲,杨元生.基于统计的中文地名识别[J].中文信息学报,2002,17(2):36-41.
[44]  Florian A T, Philip D S, Christopher B J. Mining the Web to detect place names[C]. Proceedings of the 2nd International Workshop on Geographic Information Retrieval, 2008:43-44.
[45]  唐旭日,陈小荷,许超,等.基于篇章的中文地名识别研究[J].中文信息学报,2010,24(2):24-32.
[46]  Clare D. Reading geography between the lines: Extracting local place knowledge from text[J]. Spatial Information Theory, 2013,8116:320-337.
[47]  Mónica M, Julián U, Sonia S C, et al. Named entity recognition: Fallacies, challenges and opportunities[J]. Computer Standards & Interfaces, 2013,35(5): 482-489.
[48]  张雪英,闾国年,李伯秋,等.基于规则的中文地址要素解析方法[J].地球信息科学学报,2010,12(1):9-16.
[49]  乐小虬,杨崇俊,刘冬林.空间命名实体的识别[J].计算机工程,2005,31(20):49-53.
[50]  唐旭日,陈小荷,张雪英.中文文本的地名解析方法研究[J].武汉大学学报(信息科学版),2010,35(8): 930-935.
[51]  周俊生,戴新宇,尹存燕,等.基于层叠条件随机场模型的中文机构名自动识别[J].电子学报,2006,34(5):804-809.
[52]  胡万亭,杨燕,尹红风,等.一种基于词频统计的组织机构名识别方法[J].计算机应用研究,2013,30(7):2014-2016.
[53]  李玉森,张雪英,袁正午.面向GIS的地理命名实体识别研究[J].重庆邮电大学学报(自然科学版),2008,20(6):719-724.
[54]  Liu X H, Wei F R, Zhang S D, et al. Named entity recogintion for Tweets[C]. ACM Transactions on Intelligent Systems and Technology, 2013:1-15.
[55]  朱少楠,张雪英,李明,等.基于行政隶属关系树状图的地名消歧方法[J].地理与地理信息科学,2013,29(3):39-42.
[56]  Buscaldi D, Rosso P. A conceptual density-based approach for the disambiguation of toponyms[J]. International Journal of Geographical Information Science, 2008,22(3):301-313.
[57]  Lee L H, Yu Y T, Huang C R. Chinese WordNet domains: Bootstrapping Chinese WordNet with semantic domain labels[C]. Proceedings of PACLIC, 2009:288-296.
[58]  Lieberman M D, Samet H. Multifaceted toponym recognition for streaming news[C]. Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011:843-852.
[59]  王瑞琴,孔繁胜.无监督词义消歧研究[J].软件学报,2009,20(8):2138-2152.
[60]  刘瑜,袁一泓,张毅.基于认知的模糊地理要素建模——以中关村为例[J].遥感学报,2008,12(2): 370-377.
[61]  V?gele T, Schlieder C, Visser U. Intuitive modelling of place name regions for spatial information retrieval[C]. Spatial Information Theory: Foundations of Geographic Information Science, Berlin Heidelberg: Springer-Verlag, 2003:239-252.
[62]  Steven S, Philip D S, Alia I A, et al. Mining topological relations from the Web[C]. Proceedings of the 19th International Workshop on Database and Expert Systems Application, 2008:652-656.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133