全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Clicking through the Clickstream: A Novel Statistical Modeling Approach to Improve Information Usage of Clickstream Data by E-Commerce Entities

DOI: 10.4236/iim.2023.153010, PP. 180-215

Keywords: Business Intelligence, Intelligent Information Management, Web Analytics, Web Technology Management, Exit Rate, Bounce Rate, Online Consumer Model, Discrete Choice Model

Full-Text   Cite this paper   Add to My Lib

Abstract:

Success or failure of an E-commerce platform is often reduced to its ability to maximize the conversion rate of its visitors. This is commonly regarded as the capacity to induce a purchase from a visitor. Visitors possess individual characteristics, histories, and objectives which complicate the choice of what platform features that maximize the conversion rate. Modern web technology has made clickstream data accessible allowing a complete record of a visitor’s actions on a website to be analyzed. What remains poorly constrained is what parts of the clickstream data are meaningful information and what parts are accidental for the problem of platform design. In this research, clickstream data from an online retailer was examined to demonstrate how statistical modeling can improve clickstream information usage. A conceptual model was developed that conjectured relationships between visitor and platform variables, visitors’ platform exit rate, boune rate, and decision to purchase. Several hypotheses on the nature of the clickstream relationships were posited and tested with the models. A discrete choice logit model showed that the content of a website, the history of website use, and the exit rate of pages visited had marginal effects on derived utility for the visitor. Exit rate and bounce rate were modeled as beta distributed random variables. It was found that exit rate and its variability for pages visited were associated with site content, site quality, prior visitor history on the site, and technological preferences of the visitor. Bounce rate was also found to be influenced by the same factors but was in a direction opposite to the registered hypotheses. Most findings supported that clickstream data is amenable to statistical modeling with interpretable and comprehensible models.

References

[1]  Ramanathan, V., Yellayi, V.S., Karim, S., Funari, P., Wilk, J. and Schneier, C.R. (2018) E-Commerce Trends—A Service Enterprise Engineering Perspective. Penn State College of Engineering. SEE360Initiative.
https://see360.psu.edu/files/2018/07/E-Commerce-White-Paper-rf0fxz.pdf
[2]  Chaffey, D. and Patron, M. (2012) From Web Analytics to Digital Marketing Optimization: Increasing the Commercial Value of Digital Analytics. Journal of Direct, Data, and Digital Marketing Practice, 14, 30-45.
https://doi.org/10.1057/dddmp.2012.20
[3]  Thelwall, M. (2009) Introduction to Webometrics: Quantitative Web Research for the Social Sciences. Morgan and Claypool, San Rafael.
https://doi.org/10.1007/978-3-031-02261-6
[4]  Penniman, W.D. (1975) A Stochastic Process Analysis of Online User Behavior. Annual Meeting of the American Society for Information Science, Washington DC, 30 March-3 April 1975, 147-148.
[5]  Penniman, W.D. (2008) Historic Perspective of Log Analysis. In: Jansen, B.J., Spink, A. and Taksa, I., Eds., Handbook of Research on Web Log Analysis, IGI, Hershey, 18-38.
https://doi.org/10.4018/978-1-59904-974-8.ch002
[6]  Peters, T. (1993) The History and Development of Transaction Log Analysis. Library Hi Tech, 42, 41-66.
https://doi.org/10.1108/eb047884
[7]  Peterson, E. (2004) Web Analytics Demystified: A Marketer’s Guide to Understanding How Your Web Site Affects Your Business. Celilo Group Media, New York.
[8]  Booth, D.L. and Jansen, B.J. (2008) A Review of Methodologies for Analyzing Websites. In: Jansen, B.J., Spink, A. and Taksa, I., Eds., Handbook of Research on Web Log Analysis, IGI, Hershey, 143-164.
https://doi.org/10.4018/978-1-59904-974-8.ch008
[9]  Özmutlu, S., Özmutlu, H.C. and Spink, A. (2008) Topic Analysis and Identification of Queries. In: Jansen, B.J., Spink, A. and Taksa, I., Eds., Handbook of Research on Web Log Analysis, IGI, Hershey, 345-358.
https://doi.org/10.4018/978-1-59904-974-8.ch017
[10]  Jansen, B.J. (2009) Understanding User-Web Interactions via Web-Analytics. Synthesis Lectures on Information Concepts, Retrieval, and Services. Springer, Berlin.
https://doi.org/10.1007/978-3-031-02264-7
[11]  Chen, H.-M. and Cooper, M.D. (2002) Stochastic Modeling of Usage Patterns in a Web-Based Information System. Journal of the American Society for Information Science and Technology, 53, 536-548.
https://doi.org/10.1002/asi.10076
[12]  Chen, H.-M. and Cooper, M.D. (2001) Using Clustering Techniques to Detect Usage Patterns in a Web-Based Information System. Journal of the American Society for Information Science and Technology, 52, 888-904.
https://doi.org/10.1002/asi.1159
[13]  Burby, J. and Atchison, S. (2007) Actionable Web Analytics: Using Data to Make Smart Business Decisions. Wiley, Indianapolis.
[14]  Becher, J.D. (2005) Why Metrics-Centric Performance Management Solutions Fall Short. Information Management Magazine, March.
[15]  Sapir, D. (2004) Online Analytics and Business Performance Management. BI Report.
[16]  Ansari, S., Kohavi, R., Mason, L. and Zheng, Z. (2001) Integrating E-Commerce and Data Mining: Architecture and Challenges. IEEE International Conference on Data Mining, San Jose, 29 November-2 December 2001, 27-34.
[17]  Moore, W.W. and Fader, P.S. (2004) Capturing Evolving Visit Behavior in Clickstream Data. Journal of Interactive Marketing, 18, 5-19.
https://doi.org/10.1002/dir.10074
[18]  Chatterjee, P., Hoffman, D.L. and Novak, T.P. (1998) Modeling the Clickstream: Implications for Web-Based Advertising Efforts. Marketing Science, 22, 520-541.
https://doi.org/10.1287/mksc.22.4.520.24906
[19]  Wang, G., Zhang, X., Tang, S., Zheng, H. and Zhao, B. (2016) Unsupervised Clustering for User Behavior Analysis. CHI‘16: Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, San Jose, 7-12 May 2016, 225-236.
https://doi.org/10.1145/2858036.2858107
[20]  Senecal, S., kalczynski, P.J. and Nantel, J. (2005) Consumers’ Decision-Making Process and Their Online Shopping Behavior: A Clickstream Analysis. Journal of Business Research, 58, 1599-1608.
https://doi.org/10.1016/j.jbusres.2004.06.003
[21]  Howard, J.A. and Sheth, J.N. (1969) The Theory of Buyer Behavior. John Wiley & Sons, Inc., New York.
[22]  Cialdini, R.B. (2001) Harnessing the Science of Persuasion. Harvard Business Review, 10, 72-79.
[23]  Baumeister, R.F. (2002) Yielding to Temptation: Self-Control Failure, Impulsive Purchasing, and Consumer Behavior. Journal of Consumer Research, 28, 670-676.
https://doi.org/10.1086/338209
[24]  Li, D. and Wang, M. (2015) 60% Purchase Is Impulsive.
http://www.bbtnews.com.cn/2015/1208/131385.shtml
[25]  Vohs, K.D. and Faber, R.J. (2007) Spent Resources: Self-Regulatory Resource Availability Affects Impulse Buying. Journal of Consumer Research, 33, 537-547.
https://doi.org/10.1086/510228
[26]  Cho, C.-H., Kang, J. and Cheon, H.J. (2006) Online Shopping Hesitation. Cyberpsychology and Behavior, 9, 261-274.
https://doi.org/10.1089/cpb.2006.9.261
[27]  Park, C.-H. and Kim, Y.-G. (2010) Identifying Key Factors Affecting Consumer Purchase Behavior in an Online Shopping Context. International Journal of Retail & Distribution Management, 31, 16-29.
https://doi.org/10.1108/09590550310457818
[28]  Hasan, B. (2016) Perceived Irritation in Online Shopping: The Impact of Website Design Characteristics. Computers in Human Behavior, 54, 224-230.
https://doi.org/10.1016/j.chb.2015.07.056
[29]  Swinyard, W.R. and Smith, S.M. (2003) Why People (Don’t) Shop Online: A Lifestyle Study of the Internet Consumer. Psychology & Marketing, 20, 567-597.
https://doi.org/10.1002/mar.10087
[30]  Sakar, C.O., Polat, S.O., Katircioglu, M. and Kastro, Y. (2019) Real-Time Prediction of Online Shopper’s Purchasing Intention Using Multilayer Pereptron and LSTM Recurrent Neural Networks. Neural Computing and Applications, 31, 6893-6908.
https://doi.org/10.1007/s00521-018-3523-0
[31]  Moshe, B., Mcfadden, D., Abe, M., Bockenholt, U., Bolduc, D., Gopinath, D., Morikawa, T., Ramaswamy, V., Rao, V., Revelt, D. and Steinberg, D. (1997) Modeling Methods for Discrete Choice Analysis. Marketing Letters, 8, 273-286.
https://doi.org/10.1023/A:1007956429024
[32]  Horowitz, J.L. (1994) Advances in Random Utility Models: Report of the Workshop on Advances in Random Utility Models. Marketing Letters, 5, 311-322.
https://doi.org/10.1007/BF00999207
[33]  Abe, M. (2012) A Generalized Additive Model for Discrete Choice Data. Journal of Business & Economic Statistics, 17, 271-284.
https://doi.org/10.1080/07350015.1999.10524817
[34]  Cover, T. and Thomas, J. (2006) Elements of Information Theory. 2nd Edition, Wiley & Sons, Hoboken.
[35]  Reshef, Y.A., Reshef, D.N., Finucane, H.K., Sabeti, P.C. and Mitzenmacher, M. (2016) Measuring Dependence Powerfully and Equitably. Journal of Machine Learning Research, 17, 1-63.
[36]  Reshef, D.N., Reshef, Y.A., Finucane, H.K., Grossman, S.R., McVean, G. and Turnbaul, P.T. (2011) Detecting Novel Associations in Large Data Sets. Science, 334, 1518-1524.
https://doi.org/10.1126/science.1205438
[37]  Szekely, G.J., Rizzo, M.L. and Bajirov, N.K. (2007) Measuring and Testing Dependence by Correlation of Distances. The Annals of Statistics, 25, 2769-2794.
https://doi.org/10.1214/009053607000000505
[38]  Ferrari, S. and Cribari-Neta, F. (2004) Beta Regression for Modelling Rates and Proportions. Journal of Applied Statistics, 31, 799-815.
https://doi.org/10.1080/0266476042000214501
[39]  Schmid, M., Wickler, F., Maloney, K.O., Mitchell, R., Fenske, N. and Mayr, A. (2013) Boosted Beta Regression. PLOS ONE, 8, e61623.
https://doi.org/10.1371/journal.pone.0061623
[40]  Smithson, M. and Verkuilen, J. (2006) A Better Lemon Squeezer? Maximum-Likelihood Regression with Beta-Distributed Dependent Variables. Psychological Methods, 11, 54-71.
https://doi.org/10.1037/1082-989X.11.1.54
[41]  Simas, A.B., Barreto-Souza, W. and Rocha, A.V. (2010) Improved Estimators for a General Class of Beta Regression Models. Computational Statistics & Data Analysis, 54, 348-366.
https://doi.org/10.1016/j.csda.2009.08.017
[42]  Cribari-Neto, F. and Zeileis, A. (2010) Beta Regression in R. Journal of Statistical Software, 34, 1-24.
http://www.jstatsoft.org/v34/i02
https://doi.org/10.18637/jss.v034.i02
[43]  Hatfield, L.A., Boye, M.E., Hackshaw, M.D. and Carlin, B.P. (2012) Models for Survival Times and Longitudinal Patient Reported Outcomes with Many Zeros. Journal of the American Statistical Association, 107, 875-885.
https://doi.org/10.1080/01621459.2012.664517
[44]  Ospina, R. and Ferrari, S.L. (2012) A General Class of Zero-or-One Inflated Beta Regression Models. Computational Statistics & Data Analysis, 56, 1609-1623.
https://doi.org/10.1016/j.csda.2011.10.005
[45]  Swearingen, C.J., Melguizo castro, M.S. and Bursac, Z. (2012) Inflated Beta Regression: Zero, One, and Everything in between. SAS Global Forum 2012: Statistics and Data Analysis, Cary, North Carolina, 2012, Paper 325.
[46]  Zeileis, A., Hothorn, T. and Hornik, K. (2008) Model-Based Recursive Partitioning. Journal of Computational and Graphical Statistics, 17, 492-514.
https://doi.org/10.1198/106186008X319331
[47]  Breiman, L., Friedman, J., Olshen, R.A. and Stone, C.J. (1984) Classification and Regression Trees. Chapman and Hall, New York.
[48]  Grün, B., Kosmidis, I. and Zeileis, A. (2020) Extended Beta Regression in R: Shaken, Stirred, Mixed, and Partitioned. Journal of Statistical Software, 48, 1-25.
https://doi.org/10.18637/jss.v048.i11
[49]  McLachlan, G. and Peel, D. (2000) Finite Mixture Models. Wiley, New York.
https://doi.org/10.1002/0471721182
[50]  Wedel, M. and DeSarbo, W.S. (1994) A Review of Recent Developments in Latent Class Regression Models. In: Bagozzi, R., Ed., Advanced Methods of Marketing Research, Blackwell Pub., Hoboken, 352-388.
https://ssrn.com/abstract=2789856
[51]  Magidson, J. and Vermunt, J.K. (2004) Latent Class Models. In: Kaplan, D., Ed., The Sage Handbook of Quantitative Methodology for the Social Sciences, Sage, Thousand Oaks, 175-198.
https://doi.org/10.4135/9781412986311.n10
[52]  Muthén, B.O. and Asparouhov, T. (2009) Multilevel Regression Mixture Analysis. Journal of the Royal Statistical Society, Series A, 172, 639-657.
https://doi.org/10.1111/j.1467-985X.2009.00589.x
[53]  Oberski, D.L. (2015) Beyond the Number of Classes: Separating Substantive from Non-Substantive Dependence in Latent Class Analysis. Advances in Data Analysis and Classification, 10, 171-182.
https://doi.org/10.1007/s11634-015-0211-0
[54]  Hastie, T., Tibshirani, R. and Friedman, J. (2017) The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd Edition, Springer, Berlin.
[55]  Tibshirani, R. (1996) Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society, Series B, 58, 267-288.
https://doi.org/10.1111/j.2517-6161.1996.tb02080.x
[56]  Lemeshow, S. and Hosmer, D.W. (1982) A Review of Goodness of Fit Statistics for Use in the Development of Logistic Regression Models. American Journal of Epidemiology, 115, 92-106.
https://doi.org/10.1093/oxfordjournals.aje.a113284
[57]  Hosmer, D.W., Hosmer, T., le Cessie, S. and Lemeshow, S. (1997) A Comparison of Goodness-of-Fit Tests for the Logistic Regression Model. Statistics in Medicine, 16, 965-980.
https://doi.org/10.1002/(SICI)1097-0258(19970515)16:9<965::AID-SIM509>3.0.CO;2-O
[58]  McFadden, D. (1974) Conditional Logit Analysis of Qualitative Choice Behavior. In: Zarembka, P., Ed., Frontiers in Econometrics, Academic Press, Cambridge, 105-142.
[59]  Shneiderman, B., Plaisant, C. and Cohen, M. (2008) Designing the User Interface: Strategies for Effective Human-Computer Interaction. 5th Edition, Pearson, London.
[60]  Wickens, C.D., Lee, J.D., Liu, Y.L. and Gordon-Becker, S. (2003) An Introduction to Human Factors Engineering. 2nd Edition, Pearson Prentice Hall, London.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133