全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Text and Structural Data Mining of Influenza Mentions in Web and Social Media

DOI: 10.3390/ijerph7020596

Keywords: disease surveillance, public health epidemiology, health informatics, graph-based data mining, web and social media, social network analysis

Full-Text   Cite this paper   Add to My Lib

Abstract:

Text and structural data mining of web and social media (WSM) provides a novel disease surveillance resource and can identify online communities for targeted public health communications (PHC) to assure wide dissemination of pertinent information. WSM that mention influenza are harvested over a 24-week period, 5 October 2008 to 21 March 2009. Link analysis reveals communities for targeted PHC. Text mining is shown to identify trends in flu posts that correlate to real-world influenza-like illness patient report data. We also bring to bear a graph-based data mining technique to detect anomalies among flu blogs connected by publisher type, links, and user-tags.

References

[1]  Corley, C; Mikler, A; Cook, D; Singh, K. Monitoring Influenza Trends through Mining Social Media. Proceedings of the 2009 International Conference on Bioinformatics and Bioengineering (BIOCOMP09), Las Vegas, NV, USA, July 2009.
[2]  Ginsberg, J; Mohebbi, M; Patel, R; Brammer, L; Smolinski, M; Brilliant, L. Detecting influenza epidemics using search engine query data. Nature?2009, 457, 1012–1014.
[3]  Eysenbach, G. Proceedings of the AMIA Annual Symposium, Washington, DC, USA; 2005; pp. 244–248.
[4]  Polgreen, P; Chen, Y; Pennock, D; Nelson, F. Using internet searches for influenza surveillance. Clin. Infect. Dis?2008, 47, 1443–1448.
[5]  Hulth, A; Rydevik, G; Linde, A; Montgomery, J. Web Queries as a Source for Syndromic Surveillance. PLoS ONE?2009, 4, e4378.
[6]  Johnson, H; Wagner, M; Hogan, W; Chapman, W; Olszewski, R; Dowling, J; Barnas, G. Analysis of web access logs for surveillance of influenza. St. Heal. T?2003, 107, 1202–1206.
[7]  Yih, W; Teates, K; Abrams, A; Kleinman, K; Kulldorff, M; Pinner, R; Harmon, R; Wang, S; Platt, R; Montgomery, J. Telephone triage service data for detection of influenza-like illness. PLoS ONE?2009, 4, e5260.
[8]  Spinn3r Weblog Crawling provided by Spinn3r.
[9]  van Rossum, G. Python Language Reference Manual; Drake, FL, Jr, Ed.; Network Theory Ltd.: UK, 2002.
[10]  Miller, P.
[11]  Mihalcea, R. The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data Ronen Feldman and James Sanger (Bar-Ilan University and ABS Ventures) Cambridge, England: Cambridge University Press, 2007, xii+410. Comput. Linguist?2008, 34, 125–127.
[12]  Feldman, R; Sanger, J. The Text Mining Handbook; Cambridge University Press: Cambridge, UK, 2007.
[13]  Cook, DJ; Holder, LB. Substructure discovery using minimum description length and background knowledge. J. Artif. Int. Res?1993, 1, 231–255.
[14]  The Current Population Survey (CPS). U.S. Census Bureau, 2008.
[15]  Summary Health Statistics for the US Population: National Health Interview Survey (NHIS), 2007 report DHHS Publication No.(PHS) 2009—1564; Series 10, Number 238.
[16]  Porter, M. An algorithm for suffix stripping. Program?1980, 14, 130–137.
[17]  Liljeros, F; Edling, C; Amaral, L; Stanley, H; Aberg, Y. The web of human sexual contacts. Nature?2001, 411, 907–908.
[18]  Flake, G; Lawrence, S; Giles, C. Efficient identification of Web communities. Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, MA, USA, 20–23 August, 2000.
[19]  Flake, G; Lawrence, S; Giles, C; Coetzee, F. Self-organization and identification of Web communities. Computer?2001, 35, 66–71.
[20]  Eberle, W; Holder, L. Anomaly detection in data represented as graphs. Intell. Data Anal?2007, 11, 663–689.
[21]  Wasserman, S; Faust, K. Social Network Analysis: Methods and Applications; Cambridge University Press: Cambridge, UK, 1994.
[22]  Brin, S; Page, L. The anatomy of a large-scale hypertextual Web search engine. Comput. Networks ISDN Syst?1998, 30, 107–117.
[23]  Girvan, M; Newman, M. Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA?2002, 99, 7821–7826.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133