All Title Author
Keywords Abstract


Semantic and Time-Dependent Expertise Profiling Models in Community-Driven Knowledge Curation Platforms

DOI: 10.3390/fi5040490

Keywords: knowledge acquisition, knowledge representation, semantic Web, text processing, expertise profiling, expertise visualization

Full-Text   Cite this paper   Add to My Lib

Abstract:

Online collaboration and web-based knowledge sharing have gained momentum as major components of the Web 2.0 movement. Consequently, knowledge embedded in such platforms is no longer static and continuously evolves through experts’ micro-contributions. Traditional Information Retrieval and Social Network Analysis techniques take a document-centric approach to expertise modeling by creating a macro-perspective of knowledge embedded in large corpus of static documents. However, as knowledge in collaboration platforms changes dynamically, the traditional macro-perspective is insufficient for tracking the evolution of knowledge and expertise. Hence, Expertise Profiling is presented with major challenges in the context of dynamic and evolving knowledge. In our previous study, we proposed a comprehensive, domain-independent model for expertise profiling in the context of evolving knowledge. In this paper, we incorporate Language Modeling into our methodology to enhance the accuracy of resulting profiles. Evaluation results indicate a significant improvement in the accuracy of profiles generated by this approach. In addition, we present our profile visualization tool, Profile Explorer, which serves as a paradigm for exploring and analyzing time-dependent expertise profiles in knowledge-bases where content evolves overtime. Profile Explorer facilitates comparative analysis of evolving expertise, independent of the domain and the methodology used in creating profiles.

References

[1]  Sampson, M. Expertise Profiles—How Links to Contributions Changed the Dynamics at IBM. Available online: http://currents.michaelsampson.net/2011/07/expertise-profiles.html (accessed on 30 September 2013).
[2]  O’Reilly, T.; Musser, J. Web 2.0: Principles and Best Practices; O’Reilly Media: Sebastopol, CA, USA, 2006.
[3]  Berners-Lee, T.; Hendler, J.; Lassila, O. The semantic web. Sci. Am. 2001, 284, 34–43, doi:10.1038/scientificamerican0501-34.
[4]  Clark, T.; Kinoshita, J. AlzForum and SWAN: The present and future of scientific Web communities. Brief. Bioinforma. 2007, 8, 163–171, doi:10.1093/bib/bbm012.
[5]  Gene Wiki. Available online: http://en.wikipedia.org/wiki/Gene_Wiki (accessed on 30 September 2013).
[6]  Zhang, J.; Tang, J.; Li, J. Expert finding in a social network. Adv. Databases 2007, 4443, 1066–1069.
[7]  Ziaimatin, H.; Groza, T.; Bordea, G.; Buitelaar, P.; Hunter, J. Expertise profiling in evolving knowledge-curation platforms. Glob. Sci. Technol. Forum J. Comput. 2012, 2, 118–127.
[8]  Jonquet, C.; Shah, N.; Musen, M. The Open Biomedical Annotator. In Proceedings of the Summit of Translational Bioinformatics, San Francisco, CA, USA, 15–17 March 2009; pp. 56–60.
[9]  Thiagarajan, R.; Manjunath, G.; Stumptner, M. Finding Experts by Semantic Matching of User Profiles. Technical Report HPL-2008-172; HP Laboratories: Karlruhe, Germany, 2008.
[10]  Ziaimatin, H. DC Proposal: Capturing Knowledge Evolution and Expertise in Community-Driven Knowledge Curation Platforms. In Proceedings of the International Semantic Web Conference, Bonn, Germany, 23–27 October 2011.
[11]  Mons, B.; Velterop, J. Nano-Publication in the E-Science Era. In Proceedings of the Workshop on Semantic Web Applications in Scientific Discourse, Washington, DC, USA, 25–29 October 2009.
[12]  Casati, F.; Giunchiglia, F.; Marchese, M. Liquid Publications, Scientific Publications Meet the Web. Technical Rep. DIT-07-073, Informatica e Telecomunicazioni; University of Trento: Trento, Italy, 2007.
[13]  Wikipedia:WikiProject Molecular and Cellular Biology. Available online: http://en.wikipedia.org/wiki/Wikipedia:MCB (accessed on 30 September 2013).
[14]  Wikipedia:WikiProject Genetics. Available online: http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Genetics (accessed on 30 September 2013).
[15]  Hoffmann, R. A Wiki for the Life Sciences where Authorship Matters. Available online: http://www.nature.com/ng/journal/v40/n9/full/ng.f.217.html (accessed on 30 September 2013).
[16]  OMIM Online Mendelian Inheritance in Man. Available online: http://omim.org (accessed on 30 September 2013).
[17]  Ziaimatin, H.; Groza, T.; Hunter, J. Expertise Modelling in Community-driven Knowledge Curation Platforms. In Proceedings of the 7th Australasian Ontology Workshop, Co-Located with AI 2011, Perth, Australia, 4 December 2011.
[18]  Ziaimatin, H. Profile Explorer (tested only on Firefox). Available online: http://skeletome.metadata.net/dpro/handler/profile/explorer (accessed on 30 September 2013).
[19]  Jonquet, C.; Musen, M.; Shah, N. Building a biomedical ontology recommender web service. J. Biomed. Semant. 2010, 1 (Suppl 1), S1:1–S1:18.
[20]  Stemming and Lemmatization. Available online: http://nlp.stanford.edu/IR-book/html/htmledition/stemming-and-lemmatization-1.html (accessed on 30 September 2013).
[21]  Lemmatisation. Available online: http://en.wikipedia.org/wiki/Lemmatisation (accessed on 30 September 2013).
[22]  Liu, H.; Christiansen, T.; Baumgartner, W.A.; Verspoor, K. BioLemmatizer: A lemmatization tool for morphological processing of biomedical text. J. Biomed. Semant. 2012, 3, 3:1–3:29.
[23]  Language Model. Available online: http://en.wikipedia.org/wiki/Language_model (accessed on 30 September 2013).
[24]  Blei, D.M. Topic Modeling. Available online: http://www.cs.princeton.edu/~blei/topicmodeling.html (accessed on 30 September 2013).
[25]  De Kok, D.; Brouwer, H. Natural Language Processing for the Working Programmer. Available online: http://nlpwp.org/book/index.xhtml (accessed on 30 September 2013).
[26]  Blei, D.M. Probabilistic topic models. Commun. ACM 2011, 55, 77–84, doi:10.1145/2133806.2133826.
[27]  Blei, D.M.; Ng, A.; Jordan, M. Latent Dirichlet allocation. J. Mach. Learn. Res. 2003, 3, 993–1022.
[28]  Groza, T.; Zankl, A.; Li, Y.-F.; Hunter, J. Using Semantic Web Technologies to Build a Community-Driven Knowledge Curation Platform for the Skeletal Dysplasia Domain. In Proceedings of the 10th International Semantic Web Conference, Bonn, Germany, 23–27 October 2011; pp. 81–96.
[29]  N-gram. Available online: http://en.wikipedia.org/wiki/N-gram (accessed on 1 October 2013).
[30]  Timeline JS. Available online: http://timeline.verite.co (accessed on 1 October 2013).
[31]  Data-Driven Documents. Available online: http://d3js.org (accessed on 1 October 2013).
[32]  SciVal Experts. Available online: http://info.scival.com/experts (accessed on 1 October 2013).
[33]  BiomedExperts. Available online: http://www.biomedexperts.com/ (accessed on 1 October 2013).
[34]  Text REtrieval Conference (TREC). Available online: http://trec.nist.gov/ (accessed on 1 October 2013).
[35]  Zhu, J.; Song, D.; Rueger, S. Integrating multiple windows and document features for expert finding. J. Am. Soc. Inf. Sci. Technol. 2009, 60, 694–715.
[36]  Yang, L.; Zhang, W. A Study of the Dependencies in Expert Finding. In Proceedings of the 2010 Third International Conference on Knowledge Discovery and Data Mining, Phuket, Thailand, 9–10 January 2010.
[37]  Demartini, G. Finding Experts Using Wikipedia. In Proceedings of the ExpertFinder Workshop, Co-Located with ISWC 2007, Busan, Korea, 11–15 November 2007.
[38]  SemEval-2007. Available online: http://nlp.cs.swarthmore.edu/semeval/ (accessed on 1 October 2013).
[39]  Fuhr, N.; Govert, N.; Kazai, G.; Lalmas, M. INEX: INitiative for the Evaluation of XML Retrieval. In Proceedings of the SIGIR 2002 Workshop on XML and Information Retrieval, Tampere, Finland, 11–15 August 2002.
[40]  Balog, K.; de Rijke, M. Determining Expert Profiles (with an Application to Expert Finding). In Proceedings of the 20th International Joint Conference on Artificial Intelligence, Hyderabad, India, 6–12 January 2007; pp. 2657–2662.
[41]  Balog, K. EARS. Available online: http://code.google.com/p/ears/ (accessed on 1 October 2013).
[42]  Price, S.; Flach, P.A.; Spiegler, S.; Bailey, C.; Rogers, N. SubSift Web Services and Workflows for Profiling and Comparing Scientists and Their Published Works. In Proceedings of the 2010 IEEE 6th International Conference on e-Science, Brisbane, Australia, 7–10 December 2010.
[43]  Aleman-Meza, B.; Bojars, U.; Boley, H.; Breslin, J.; Mochol, M.; Nixon, L.; Polleres, A.; Zhdanova, A. Combining RDF Vocabularies for Expert Finding. In Proceedings of the 4th European Semantic Web Conference, Innsbruck, Austria, 3–7 June 2007; pp. 235–250.
[44]  Hoffmann, R. A wiki for the life sciences where authorship matters. Nat. Genet. 2008, 40, 1047–1051, doi:10.1038/ng.f.217.
[45]  Michelson, M.; Macskassy, S. Discovering Users’ Topics of Interest on Twitter: A First Look. In Proceedings of the 4th Workshop on Analytics for Noisy Unstructuredco-located with the 19th ACM CIKM Conference, Toronto, Canada, 26–30 October 2010; pp. 73–80.
[46]  Abel, F.; Gao, Q.; Houben, G.; Tao, K. Semantic Enrichment of Twitter Posts for User Profile Construction on the Social Web. In Proceedings of the 8th Extended Semantic Web Conference, Heraklion, Greece, 29 May–2 June 2011; pp. 375–389.
[47]  Monaghan, F.; Bordea, G.; Samp, K.; Buitelaar, P. Exploring Your Research: Sprinkling some Saffron on Semantic Web Dog Food. In Proceedings of the Semantic Web Challenge at the International Semantic Web Conference, Shanghai, China, 7–11 November 2010.
[48]  Moeller, K.; Heath, T.; Handschuh, S.; Domingue, J. Recipes for Semantic Web Dog Food—The ESWC and ISWC Metadata Projects. In Proceedings of the 6th International Semantic Web Conference, Busan, Korea, 11–15 November 2007; pp. 802–815.
[49]  Bizer, C.; Heath, T.; Berners-Lee, T. Linked data—The story so far. Int. J. Semant. Web Inf. Syst. 2009, 5, 1–22.
[50]  PubMed. Available online: http://www.ncbi.nlm.nih.gov/pubmed/ (accessed on 1 October 2013).

Full-Text

comments powered by Disqus