全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
PLOS ONE  2013 

Spatio-Temporal Variation of Conversational Utterances on Twitter

DOI: 10.1371/journal.pone.0077793

Full-Text   Cite this paper   Add to My Lib

Abstract:

Conversations reflect the existing norms of a language. Previously, we found that utterance lengths in English fictional conversations in books and movies have shortened over a period of 200 years. In this work, we show that this shortening occurs even for a brief period of 3 years (September 2009–December 2012) using 229 million utterances from Twitter. Furthermore, the subset of geographically-tagged tweets from the United States show an inverse proportion between utterance lengths and the state-level percentage of the Black population. We argue that shortening of utterances can be explained by the increasing usage of jargon including coined words.

References

[1]  Alis CM, Lim MT (2012) Adaptation of fictional and online conversations to communication media. The European Physical Journal B 85: 1–7.
[2]  Ritter A, Cherry C, Dolan B (2010) Unsupervised modeling of Twitter conversations. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Los Angeles, California: Association for Computational Linguistics. pp. 172–180.
[3]  Kumar R, Mahdian M, McGlohon M (2010) Dynamics of conversations. In: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. New York, NY, USA: ACM, KDD ‘10, p. 553–562. doi:10.1145/1835804.1835875.
[4]  Huberman B, Romero DM, Wu F (2008) Social networks that matter: Twitter under the microscope. First Monday 14. Available: http://journals.uic.edu/ojs/index.php/fm?/article/view/2317. Accessed 20 December 2008.
[5]  Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of the 19th international conference on World Wide Web. New York, NY, USA: ACM, WWW ‘10, p. 591–600. doi:10.1145/1772690.1772751.
[6]  Gon?alves B, Perra N, Vespignani A (2011) Modeling users’ activity on twitter networks: Validation of Dunbar’s number. PLoS ONE 6: e22656.
[7]  Bollen J, Mao H, Zeng X (2011) Twitter mood predicts the stock market. Journal of Computational Science 2: 1–8.
[8]  Golder SA, Macy MW (2011) Diurnal and seasonal mood vary with work, sleep, and daylength across diverse cultures. Science 333: 1878–1881.
[9]  Kloumann IM, Danforth CM, Harris KD, Bliss CA, Dodds PS (2012) Positivity of the English language. PLoS ONE 7: e29484.
[10]  Semiocast (2012). Twitter reaches half a billion accounts - more than 140 millions in the U.S. Available: http://semiocast.com/publications/2012_0?7_30_Twitter_reaches_half_a_billion_acco?unts_140m_in_the_US.Accessed 30 July 2012.
[11]  O’Donovan J, Kang B, Meyer G, Hollerer T, Adalii S (2012) Credibility in context: An analysis of feature distributions in Twitter. In: Privacy, Security, Risk and Trust (PASSAT), 2012 International Conference on and 2012 International Conference on Social Computing (SocialCom). pp. 293–301. doi:10.1109/SocialCom-PASSAT.2012.128.
[12]  Eisenstein J, O’Connor B, Smith NA, Xing EP (2010) A latent variable model for geographic lexical variation. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. p. 1277–1287.
[13]  Eisenstein J, O’Connor B, Smith NA, Xing EP (2012) Mapping the geographical diffusion of new words. In: Proceedings of Social Network and Social Media Analysis: Methods, Models and Applications. Lake Tahoe, Nevada: NIPS.
[14]  Twitter (2012). What are @replies and mentions? Available: https://support.twitter.com/articles/140?23-what-are-replies-and-mentions. Accessed 25 September 2012.
[15]  Williams E (2008). How @replies work on twitter (and how they might). Available: http://blog.twitter.com/2008/05/how-repl?ies-work-on-twitter-and-how.html. Accessed 25 September 2012.
[16]  Wooffitt R (2005) Conversation analysis and discourse analysis: a comparative and critical introduction. London: Sage Publications Ltd.
[17]  Yule GU (1939) On sentence-length as a statistical characteristic of style in prose: with application to two cases of disputed authorship. Biometrika 30: 363–390.
[18]  Sigurd B, Eeg-Olofsson M, van de Weijer J (2004) Word length, sentence length and frequency - Zipf revisited. Studia Linguistica 58: 37–52 (16)..
[19]  Klee T, Fitzgerald MD (1985) The relation between grammatical development and mean length of utterance in morphemes. Journal of Child Language 12: 251–269.
[20]  Dollaghan CA, Campbell TF, Paradise JL, Feldman HM, Janosky JE, et al. (1999) Maternal education and measures of early speech and language. J Speech Lang Hear Res 42: 1432–1443.
[21]  Strauss U, Grzybek P, Altmann G (2006) Word Length and Word Frequency. In: Grzybek P,editor. Contributions to the Science of Text and Language. Berlin/Heidelberg: Springer-Verlag, Vol. 31: 277–294.
[22]  Twitter (2012). The t.co URL wrapper. Available: https://dev.twitter.com/docs/tco-url-wra?pper. Accessed 14 January 2013.
[23]  Kruskal WH, Wallis WA (1952) Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association 47: 583.
[24]  Mann HB, Whitney DR (1947) On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat 18: 50–60.
[25]  United States Census Bureau (2013) State and County QuickFacts. Washington: Government Printing Office.
[26]  Collins J (1999) The Ebonics controversy in context: literacies, subjectivities, and language idelogies in the united states. In: Blommaert J, editor, Language Ideological Debates, Walter de Gruyter.
[27]  Cancho RFi, Solé RV (2003) Least effort and the origins of scaling in human language. Proceedings of the National Academy of Sciences of the United States of America 100: 788–791.
[28]  Mocanu D, Baronchelli A, Perra N, Gon?alves B, Zhang Q, et al. (2013) The Twitter of Babel: Mapping world languages through microblogging platforms. PLoS ONE 8: e61981.
[29]  Smith A, Brenner J (2012) Twitter use. Technical report, Pew Internet & American Life Project. Available http://pewinternet.org/Reports/2012/Twit?ter-Use-2012/Findings.aspx. Accessed 31 May 2012.
[30]  Kalucki J (2010). Streaming API documentation. Available: http://apiwiki.twitter.com/w/page/225546?73/Streaming-API-Documentation?rev=12683?51420. Accessed 15 April 2011.
[31]  Lui M, Baldwin T (2012) langid.py: An off-the-shelf language identification tool. In: Proceedings of the ACL 2012 System Demonstrations. Jeju Island, Korea: Association for Computational Linguistics, 25|30.
[32]  Nakatani S (2012). Short text language detection with infinity-gram. Available: http://shuyo.wordpress.com/2012/05/17/sh?ort-text-language-detection-with-infinit?y-gram/. 30 December 2012.
[33]  United States Census Bureau (2012). 2012 TIGER/Line shapefiles [machine-readable data files].
[34]  Twitter (2013). FAQs about tweet location. Available: https://support.twitter.com/articles/785?25-about-the-tweet-location-feature. Accessed: 24 January 2013.
[35]  Cafaro M, Tempesta P (2011) Finding frequent items in parallel. Concurrency and Computation: Practice and Experience 23: 1774–1788.
[36]  Metwally A, Agrawal D, Abbadi AE (2005) Efficient computation of frequent and top-k elements in data streams. In: Eiter T, Libkin L, editors. Database Theory - ICDT 2005. Lecture Notes in Computer Science. Springer Berlin Heidelberg. pp. 398–412.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133