image: COMPARISON of Functions discovered by Symbolic Regression, and functions discovered by Symbolic Modeling
Credit: Xiangwu Zuo, Anxiao (Andrew) Jiang
In a breakthrough for artificial intelligence (AI) and finance, computer scientists from Texas A&M University have developed a machine learning based method called Symbolic Modeling to handle financial asset pricing tasks. Published in The Journal of Finance and Data Science, their approach reduces prediction errors compared to the widely used Fama-French 3-Factor model, while maintaining an acceptable model length.
Classic asset pricing models rely on linear combinations of manually selected financial factors, such as market volatility and company size. While effective, these models struggle with complex market dynamics. The proposed approach instead uses genetic programming and deep learning to automatically generate nonlinear expression that adapt to multiple datasets simultaneously.
“Most of the current financial asset pricing models are like a fixed recipe—you combine predetermined ingredients in set proportions. Symbolic Modeling is more like a master chef who creates entirely new recipes optimized for each diner,” explains Xiangwu Zuo, first author of this study, “The expressions generated by our method take into consideration for hidden relationships between market factors that humans might overlook.”
Zuo and the co-author, Andrew Jiang, generated an asset pricing model based on nearly four decades (1980–2018) of financial data from hundreds of companies, including well-known firms like Coca-Cola and ExxonMobil. The researchers tested their approach across both training and testing data, finding consistent improvements over traditional models.
“Our model achieves lower prediction errors compared to the classic asset pricing models,” shares Zuo. “Beyond just improved prediction accuracy, the model also reduces the “alpha” value—a crucial metric in finance representing unexplained returns.”
Notably, the models generated by the Symbolic Modeling method incorporated nonlinear factors combinations, which are absent in human-designed asset pricing models. “This allows better capture of nonlinear market behaviors,” adds Zuo.
What makes Symbolic Modeling particularly innovative is its ability to discover a unified mathematical model that can represent multiple assets simultaneously. Unlike traditional Symbolic Regression, which would create separate formulas for each company’s data, this new approach finds a single flexible expression that adapts to different datasets by adjusting its coefficients.
The authors see potential AI applications in financial asset pricing, portfolio optimization and development of trading strategies for complex market conditions. “We plan to expand our work by integrating the technique with more machine learning approaches to further enhance AI in finance,” says Zuo.
###
Contact the author: Xiangwu Zuo, Department of Computer Science, Texas A&M University, College Station, Texas, USA, dkflame@tamu.edu
The publisher KeAi was established by Elsevier and China Science Publishing & Media Ltd to unfold quality research globally. In 2013, our focus shifted to open access publishing. We now proudly publish more than 200 world-class, open access, English language journals, spanning all scientific disciplines. Many of these are titles we publish in partnership with prestigious societies and academic institutions, such as the National Natural Science Foundation of China (NSFC).
Journal
The Journal of Finance and Data Science
Method of Research
Computational simulation/modeling
Subject of Research
Not applicable
Article Title
Symbolic Modeling for financial asset pricing
COI Statement
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.