全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

MLP, XGBoost, KAN, TDNN, and LSTM-GRU Hybrid RNN with Attention for SPX & NDX European Call Option Pricing

DOI: 10.4236/jmf.2025.152016, PP. 385-431

Keywords: European Call Option, Moneyness, Time-Delay Neural Network (TDNN), Multilayer Perceptron (MLP), Kolmogorov-Arnold Network (KAN)

Full-Text   Cite this paper   Add to My Lib

Abstract:

We explore the performance of various artificial neural network architectures, including a multilayer perceptron (MLP), Kolmogorov-Arnold network (KAN), LSTM-GRU hybrid recursive neural network (RNN) models, and a time-delay neural network (TDNN) for pricing European call options. In this study, we attempt to leverage the ability of supervised learning methods, such as ANNs, KANs, and gradient-boosted decision trees, to approximate complex multivariate functions in order to calibrate option prices based on past market data. The motivation for using ANNs and KANs is the Universal Approximation Theorem and Kolmogorov-Arnold Representation Theorem, respectively. Specifically, we use S&P 500 (SPX) and NASDAQ 100 (NDX) index options traded during 2015-2023 with times to maturity ranging from 15 days to over 4 years (OptionMetrics IvyDB US dataset). Black & Scholes’s (BS) PDE [1] model’s performance in pricing the same options compared to real data is used as a benchmark. This model relies on strong assumptions, and it has been observed and discussed in the literature that real data does not match its predictions. Supervised learning methods are widely used as an alternative for calibrating option prices due to some of the limitations of this model. In our experiments, the BS model underperforms compared to all of the others. Also, the best MLP model outperforms the best TDNN model on all error metrics. We implement a simple self-attention mechanism to enhance the RNN models, significantly improving their performance. The best-performing model overall is the LSTM-GRU hybrid RNN model with attention. Also, the KAN model outperforms the TDNN and MLP models. We analyze the performance of all models by ticker, moneyness category, and over/under/correctly-priced percentage.

References

[1]  Black, F. and Scholes, M. (1973) The Pricing of Options and Corporate Liabilities. Journal of Political Economy, 81, 637-654.
https://doi.org/10.1086/260062
[2]  Oosterlee, C.W. and Grzelak, L.A. (2020) Mathematical Modeling and Computation in Finance with Exercises and Python and MATLAB Computer Codes. World Scientific Publishing Europe Ltd.
[3]  Hull, J.C. (2014) Options, Futures, and Other Derivatives. 9th Edition, Prentice Hall.
[4]  Haug, E.G. (2007) The Complete Guide to Option Pricing Formulas. 2nd Edition, McGraw-Hill.
[5]  McMillan, L.G. (2012) Options as a Strategic Investment. 5th Edition, Prentice Hall Press.
[6]  Natenberg, S. (1994) Option Volatility and Pricing: Advanced Trading Strategies and Techniques. 2nd Edition, McGraw-Hill.
[7]  Jackwerth, J.C. and Rubinstein, M. (1996) Recovering Probability Distributions from Option Prices. The Journal of Finance, 51, 1611-1631.
https://doi.org/10.1111/j.1540-6261.1996.tb05219.x
[8]  Cont, R. (2001) Empirical Properties of Asset Returns: Stylized Facts and Statistical Issues. Quantitative Finance, 1, 223-236.
https://doi.org/10.1080/713665670
[9]  Gatheral, J. (2006) The Volatility Surface: A Practitioner’s Guide. John Wiley & Sons.
[10]  Rubinstein, M. (1985) Nonparametric Tests of Alternative Option Pricing Models Using All Reported Trades and Quotes on the 30 Most Active CBOE Option Classes from August 23, 1976 through August 31, 1978. The Journal of Finance, 40, 455-480.
https://doi.org/10.1111/j.1540-6261.1985.tb04967.x
[11]  Derman, E. and Kani, I. (1994) Riding on a Smile. Risk, 7, 32-39.
[12]  Aı̈t-Sahalia, Y. and Lo, A.W. (2000) Nonparametric Risk Management and Implied Risk Aversion. Journal of Econometrics, 94, 9-51.
https://doi.org/10.1016/s0304-4076(99)00016-0
[13]  Heston, S.L. (1993) A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options. Review of Financial Studies, 6, 327-343.
https://doi.org/10.1093/rfs/6.2.327
[14]  Hull, J. and White, A. (1987) The Pricing of Options on Assets with Stochastic Volatilities. The Journal of Finance, 42, 281-300.
https://doi.org/10.1111/j.1540-6261.1987.tb02568.x
[15]  Dupire, B. (1994) Pricing with a Smile. Risk, 7, 18-20.
[16]  Derman, E. and Kani, I. (1998) Stochastic Implied Trees: Arbitrage Pricing with Stochastic Term and Strike Structure of Volatility. International Journal of Theoretical and Applied Finance, 1, 61-110.
https://doi.org/10.1142/s0219024998000059
[17]  Coleman, T., Li, Y. and Verma, A. (1999) Reconstructing the Unknown Local Volatility Function. The Journal of Computational Finance, 2, 77-102.
https://doi.org/10.21314/jcf.1999.027
[18]  Merton, R.C. (1976) Option Pricing When Underlying Stock Returns Are Discontinuous. Journal of Financial Economics, 3, 125-144.
https://doi.org/10.1016/0304-405x(76)90022-2
[19]  Kou, S.G. (2002) A Jump-Diffusion Model for Option Pricing. Management Science, 48, 1086-1101.
https://doi.org/10.1287/mnsc.48.8.1086.166
[20]  Kou, S.G. and Wang, H. (2004) Option Pricing under a Double Exponential Jump Diffusion Model. Management Science, 50, 1178-1192.
https://doi.org/10.1287/mnsc.1030.0163
[21]  Barndorff-Nielsen, O.E. (1997) Processes of Normal Inverse Gaussian Type. Finance and Stochastics, 2, 41-68.
https://doi.org/10.1007/s007800050032
[22]  Carr, P., Geman, H., Madan, D.B. and Yor, M. (2002) The Fine Structure of Asset Returns: An Empirical Investigation. The Journal of Business, 75, 305-333.
https://doi.org/10.1086/338705
[23]  Eberlein, E. (2001) Application of Generalized Hyperbolic Lévy Motions to Finance. In: Lévy Processes: Theory and Applications, Birkhäuser, 319-336.
https://doi.org/10.1007/978-1-4612-0197-7_14
[24]  Matache, A.M., von Petersdorff, T. and Schwab, C. (2004) Fast Deterministic Pricing of Options on Lévy Driven Assets. Finance and Stochastics, 8, 369-393.
[25]  Raible, S. (2000) Lévy Processes in Finance: Theory, Numerics, and Empirical Facts. Ph.D. Thesis, Albert Ludwigs University of Freiburg.
[26]  Merton, R.C. (1973) Theory of Rational Option Pricing. The Bell Journal of Economics and Management Science, 4, 141-183.
https://doi.org/10.2307/3003143
[27]  Hull, J. and White, A. (1990) Pricing Interest-Rate-Derivative Securities. Review of Financial Studies, 3, 573-592.
https://doi.org/10.1093/rfs/3.4.573
[28]  Black, F., Derman, E. and Toy, W. (1990) A One-Factor Model of Interest Rates and Its Application to Treasury Bond Options. Financial Analysts Journal, 46, 33-39.
https://doi.org/10.2469/faj.v46.n1.33
[29]  Heath, D., Jarrow, R. and Morton, A. (1992) Bond Pricing and the Term Structure of Interest Rates: A New Methodology for Contingent Claims Valuation. Econometrica, 60, 77-105.
https://doi.org/10.2307/2951677
[30]  Rouah, F.D. (2013) Four Derivations of the Black-Scholes PDE.
https://api.semanticscholar.org/CorpusID:34908553
[31]  Malliaris, M. and Salchenberger, L. (1993) A Neural Network Model for Estimating Option Prices. Applied Intelligence, 3, 193-206.
https://doi.org/10.1007/bf00871937
[32]  Hornik, K., Stinchcombe, M. and White, H. (1989) Multilayer Feedforward Networks Are Universal Approximators. Neural Networks, 2, 359-366.
https://doi.org/10.1016/0893-6080(89)90020-8
[33]  Csáji, B.C. (2001) Approximation with Artificial Neural Networks. Faculty of Sciences, Eötvös Loránd University.
[34]  Leshno, M., Lin, V.Y., Pinkus, A. and Schocken, S. (1993) Multilayer Feedforward Networks with a Nonpolynomial Activation Function Can Approximate Any Function. Neural Networks, 6, 861-867.
https://doi.org/10.1016/s0893-6080(05)80131-5
[35]  Hutchinson, J.M., Lo, A.W. and Poggio, T. (1994) A Nonparametric Approach to Pricing and Hedging Derivative Securities via Learning Networks. The Journal of Finance, 49, 851-889.
https://doi.org/10.1111/j.1540-6261.1994.tb00081.x
[36]  Ruf, J. and Wang, W. (2020) Neural Networks for Option Pricing and Hedging: A Literature Review. The Journal of Computational Finance, 24, 1-46.
https://doi.org/10.21314/jcf.2020.390
[37]  Ferraz, J.D.M. (2022) Pricing Options Using the XGBoost Model. Master’s Thesis, Instituto Superior de Economia e Gestão.
[38]  Dhiman, A. and Hu, Y. (2023) Physics Informed Neural Network for Option Pricing.
[39]  Dugas, C., Rouah, A. and Stafford, J. (2009) Capturing the Volatility Smile with Deep Neural Networks. Journal of Computational Finance, 12, 1-28.
[40]  Liu, X., Yang, J. and Zhou, Z. (2019) Deep Learning-Based Modeling of the Implied Volatility Surface. Journal of Financial Markets, 43, 50-75.
[41]  Pohjonen, T. (2022) Pricing and Hedging Options with Neural Networks: A Case of FTSE 100 Index Options. Master’s Thesis, School of Business and Management, Business Administration.
[42]  Soni, M. (2020) What Is Decision Tree Regression?
https://maniksonituts.medium.com/what-is-decision-treeregression-dcd0ea40a323
[43]  Demajo, L.M. (2020) Explainable AI for Interpretable Credit Scoring.
https://www.researchgate.net/figure/XGBoost-model-Source-Self_fig2_350874464
[44]  Bengio, Y., Simard, P. and Frasconi, P. (1994) Learning Long-Term Dependencies with Gradient Descent Is Difficult. IEEE Transactions on Neural Networks, 5, 157-166.
https://doi.org/10.1109/72.279181
[45]  Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780.
https://doi.org/10.1162/neco.1997.9.8.1735
[46]  Cho, K., van Merrienboer, B., Bahdanau, D. and Bengio, Y. (2014) On the Properties of Neural Machine Translation: Encoder-Decoder Approaches. Proceedings of SSST-8th Workshop on Syntax, Semantics and Structure in Statistical Translation, Doha, October 2014, 103-111.
https://doi.org/10.3115/v1/w14-4012
[47]  Chung, J., Gulcehre, C., Cho, K. and Bengio, Y. (2014) Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling.
https://doi.org/10.48550/arXiv.1412.3555
[48]  Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L. and Polosukhin, I. (2017) Attention Is All You Need. Advances in Neural Infor-mation Processing Systems, 5998-6008.
[49]  Bahdanau, D., Cho, K. and Bengio, Y. (2014) Neural Machine Translation by Jointly Learning to Align and Translate.
[50]  Luong, T., Pham, H. and Manning, C.D. (2015). Effective Approaches to Attention-Based Neural Machine Translation. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, September 2015, 1412-1421.
https://doi.org/10.18653/v1/d15-1166
[51]  Liu, Z., Wang, Y., Vaidya, S., Ruehle, F., Halverson, J., Soljačić, M., Hou, T.Y. and Tegmark, M. (2024) KAN: Kolmogorov-Arnold Networks.
[52]  Bozorgasl, Z. and Chen, H. (2024) Wav-KAN: Wavelet Kolmogorov-Arnold Networks.
https://doi.org/10.48550/arXiv.2405.12832
[53]  Xu, J., Chen, Z., Yang, S., Li, J., Wang, W., Hu, X. and Ngai, E.C.-H. (2024) FourierKAN-GCF: Fourier Kolmogorov-Arnold Network—An Effective and Efficient Feature Transformation for Graph Collaborative Filtering.
[54]  Gist Noesis (Company) (2024) GitHub FourierKAN Repository.
https://github.com/GistNoesis/FourierKAN
[55]  Gu, A. and Dao, T. (2023) Mamba: Linear-Time Sequence Modeling with Selective State Spaces.
[56]  Dao, T. and Gu, A. (2024) Transformers Are SSMs: Generalized Models and Efficient Algorithms through Structured State Space Duality.
[57]  Ding, Z., Li, Y., He, Y., Norelli, A., Wu, J., Tresp, V., Ma, Y. and Bronstein, M. (2024) DyGMamba: Efficiently Modeling Long-Term Temporal Dependency on Continuous-Time Dynamic Graphs with State Space Models.
[58]  Wal, R., Byeon, W., Riach, D., Norick, B., Korthikanti, V., Dao, T., Gu, A., Hata-mizadeh, A., Singh, S., Narayanan, D., Kulshreshtha, G., Singh, V., Casper, J., Kautz, J., Shoeybi, M. and Catanzaro, B. (2024) An Empirical Study of Mamba-Based Language Models.
[59]  Shmuel, A., Glickman, O. and Lazebnik, T. (2024) A Comprehensive Benchmark of Machine and Deep Learning across Diverse Tabular Datasets.
https://www.arxiv.org/abs/2408.14817
[60]  Sabour, S., Frosst, N. and Hinton, G.E. (2017) Dynamic Routing between Capsules.
https://doi.org/10.48550/arXiv.1710.09829

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133