OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

Theoretical Economics Letters 2025

Semantic Diversification in Equity Portfolios

DOI: 10.4236/tel.2025.151011, PP. 187-198

Crina Pungulescu

Keywords: Text Analysis, Portfolio Performance, Natural Language Processing, BERT, GPT, Semantic Fingerprinting

Full-Text Cite this paper Add to My Lib

Abstract:

In the race to harvest the power of Artificial Intelligence (AI) in virtually every field, researchers and practitioners are faced with an ever increasing supply of novel tools that have not undergone domain-specific tests. This paper informs the methodological choices of researchers in economics and finance by comparing the performance of three Natural Language Processing (NLP) methods at an important task, namely using text analysis for portfolio diversification. Portfolio management can benefit from analysing text data in the form of company descriptions, since the returns of companies with similar descriptions tend to be correlated and consequently, portfolios of dissimilar companies should have lower risk. In this paper, three NLP methods are used to construct so-called minimum semantic concentration portfolios, which are designed to leverage the semantic diversity of the business descriptions of constituent companies to reduce portfolio volatility. Two widely used large language models (BERT and GPT) and an alternative AI solution inspired by neuroscience, called semantic fingerprinting are put to the test of comparing meaningfully the business descriptions of the S&P 500 and respectively Europe 600 constituents in order to derive actionable investment insights. The results show that all three NLP methods are able to extract relevant information from company descriptions: the minimum semantic concentration portfolios have significantly lower volatility than portfolios constructed with randomly chosen weights. While no NLP method is able to claim absolute superiority over its peers, semantic fingerprinting appears the most consistent and robust performer, since BERT and GPT demonstrate not only their potential but also a caveat, as their performances are volatile even across very similar tasks.

References

[1]	Alexeev, V. V., & Tapon, F. (2013). Equity Portfolio Diversification: How Many Stocks Are Enough? Evidence from Five Developed Markets. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.2182295
[2]	Ash, E., & Hansen, S. (2023). Text Algorithms in Economics. Annual Review of Economics, 15, 659-688. https://doi.org/10.1146/annurev-economics-082222-074352
[3]	Chen, L., Zaharia, M., & Zou, J. (2023). FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance. Working Paper, Under Review as a Conference Paper at ICLR 2024.
[4]	De Sousa Webber, F. (2016). Semantic Folding Theory and Its Applications in Semantic Fingerprinting. White Paper, arXiv: 1511.08855.
[5]	Devlin, J., Chang, M., Lee, K., & Toutanova, K. (2019). BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT 2019 (pp. 4171-4186). Association for Computational Linguistics.
[6]	Dodge, J., Prewitt, T., Tachet des Combes, R., Odmark, E., Schwartz, R., Strubell, E. et al. (2022). Measuring the Carbon Intensity of AI in Cloud Instances. In 2022 ACM Conference on Fairness, Accountability, and Transparency (pp. 1877-1894). Association for Computing Machinery. https://doi.org/10.1145/3531146.3533234
[7]	Hawkins, J. (2021). A Thousand Brains: A New Theory of Intelligence. Hachette.
[8]	Hawkins, J., Ahmad, S., & Cui, Y. (2017). A Theory of How Columns in the Neocortex Enable Learning the Structure of the World. Frontiers in Neural Circuits, 11, Article 81. https://doi.org/10.3389/fncir.2017.00081
[9]	Ibriyamova, F., Kogan, S., Salganik-Shoshan, G., & Stolin, D. (2017). Using Semantic Fingerprinting in Finance. Applied Economics, 49, 2719-2735. https://doi.org/10.1080/00036846.2016.1245844
[10]	Ibriyamova, F., Kogan, S., Salganik-Shoshan, G., & Stolin, D. (2019). Predicting Stock Return Correlations with Brief Company Descriptions. Applied Economics, 51, 88-102. https://doi.org/10.1080/00036846.2018.1494377
[11]	Pungulescu, C. (2022a). Bilateral Home Bias: A New Measure of Proximity. Journal of Neuroscience, Psychology, and Economics, 15, 163-177. https://doi.org/10.1037/npe0000162
[12]	Pungulescu, C. (2022b). Using Textual Analysis to Diversify Portfolios. The Economics and Finance Letters, 9, 87-98. https://doi.org/10.18488/29.v9i1.3028
[13]	Pungulescu, C. (2024). Predicting Return Correlations in European Stocks Using NLP. Working Paper.
[14]	Pungulescu, C., & Stolin, D. (2023). Measuring Document Similarity: A Comparative Analysis of NLP Methods in Finance. Mendeley Data. https://doi.org/10.17632/kmb89v8yhz.1

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133