全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

Distribution Estimation of Invasive Species Based on Crowdsourcing Reports

DOI: 10.4236/oalib.1109474, PP. 1-11

Subject Areas: Bioinformatics, Big Data Search and Mining, Information Management

Keywords: Data Mining, Public Health, Biotechnology, Cluster Analysis, Crowdsourcing Data

Full-Text   Cite this paper   Add to My Lib

Abstract

Species invasion will cause certain harm to the local ecosystem. Vespa mandarinia, discovered on Vancouver Island, is harmful to agriculture and predators of European honeybees. The government tried to use a crowdsourcing system to collect information and formulate policies to eliminate Vespa mandarinia. However, the information provided by the local population about Vespa mandarinia is not entirely accurate. For this problem, we build a method to mine trusted information in massive crowdsourcing Vespa mandarinia reports. We consider providing the date and location of the report, and establishing a credibility calculation model for further analysis. For the report date, we calculate the normal distribution parameters based on the frequency of the report in each season to measure the reliability of a single report. For report location, we use K-means cluster analysis to find the location of the center point, which is regarded as a hive, count the report points in each hive radiation range, and use these points to generate two-dimensional normal distribution parameters to normalize the data and eliminate statistical errors. We take the probability density of the report at its location as the reliability of the reports. Through credibility, we can screen out reports that are more likely to be positive for prioritizing investigation. In order to better analyze the newly discovered reports in the future and ensure the timeliness of the model, we set up distributed incremental adjustment model to modify normal distribution parameters, and update the existing model.

Cite this paper

Shi, Y. , Liu, S. and Liu, T. (2022). Distribution Estimation of Invasive Species Based on Crowdsourcing Reports. Open Access Library Journal, 9, e9474. doi: http://dx.doi.org/10.4236/oalib.1109474.

References

[1]  Keating, M., Rhodes, B. and Richards, A. (2013) Crowdsourcing: A Flexible Method for Innovation, Data Collection, and Analysis in Social Science Research. In: Hill, C.A., Dean, E. and Murphy, J., Eds., Social Media, Sociality, and Survey Research, Wiley, New York, 179-201. https://doi.org/10.1002/9781118751534.ch8
[2]  Yuen, M.-C., King, I. and Leung, K.-S. (2011) A Survey of Crowdsourcing Systems. 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third International Conference on Social Computing, Boston, MA, 9-11 October 2011, 766-773. https://doi.org/10.1109/PASSAT/SocialCom.2011.203
[3]  Willett, W., Ginosar, S., Steinitz, A., Hartmann, B. and Agrawala, M. (2013) Identifying Redundancy and Exposing Provenance in Crowdsourced Data Analysis. IEEE Transactions on Visualization and Computer Graphics, 19, 2198-2206. https://doi.org/10.1109/TVCG.2013.164
[4]  Koswatte, S., McDougall, K. and Liu, X.Y. (2017) VGI and Crowdsourced Data Credibility Analysis Using Spam Email Detection Techniques. International Journal of Digital Earth, 11, 520-532. https://doi.org/10.1080/17538947.2017.1341558
[5]  Loganathan, V., Subramani, G. and Bhaskar, N. (2020) Crowdsourcing Data Analysis for Crowd Systems. In: Ranganathan, G., Chen, J. and Rocha, á., Eds., Inventive Communication and Computational Technologies, Springer, Singapore. https://doi.org/10.1007/978-981-15-0146-3_117
[6]  Shamir, L., Diamond, D. and Wallin, J. (2015) Leveraging Pattern Recognition Consistency Estimation for Crowdsourcing Data Analysis. IEEE Transactions on Human-Machine Systems, 46, 474-480. https://doi.org/10.1109/THMS.2015.2463082
[7]  Silverman, M.P. (2019) Extraction of Information from Crowdsourcing: Experimental Test Employing Bayesian, Maximum Likelihood, and Maximum Entropy Methods. Open Journal of Statistics, 9, 571-600. https://doi.org/10.4236/ojs.2019.95038
[8]  Shi, Y.X. (2022) Vespa Mandarinia Crowdsourcing Reports. Figshare. Dataset. https://doi.org/10.6084/m9.figshare.21333966.v1
[9]  Wang, Z., Liu, Q. and Chen, E. (2009) A K-Means Algorithm for Optimizing the Initial Center Point. Pattern Recognition and Artificial Intelligence, 22, 299-304.
[10]  Xia, X.-F., Liu, X. and Li, X.-M. (2010) User-Item Missing Ratings Complement Based on Two-Dimensional Normal Distribution. 2010 2nd International Workshop on Database Technology and Applications, Wuhan, 27-28 November 2010, 1-6. https://doi.org/10.1109/DBTA.2010.5658988
[11]  Huang, S.K. (2001) The Preliminary Report on Vespa mandarinia and Other Arthropods in Its Cave. Journal of Fujian Agricultural University (Natural Science), 30, 99-102.
[12]  Wu, G.J., Zhang, J.L. and Yuan, D. (2019) Automatically Obtaining K Value Based on K-Means Elbow Method. Computer Engineering & Software, 40, 167-170.

Full-Text


comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413