News Release

Generating asset pricing model via symbolic modeling—a machine learning-based approach

Peer-Reviewed Publication

KeAi Communications Co., Ltd.

COMPARISON OF FUNCTIONS DISCOVERED BY SYMBOLIC REGRESSION, AND FUNCTIONS DISCOVERED BY SYMBOLIC MODELING

image: 

COMPARISON of Functions discovered by Symbolic Regression, and functions discovered by Symbolic Modeling

view more 

Credit: Xiangwu Zuo, Anxiao (Andrew) Jiang

In a breakthrough for artificial intelligence (AI) and finance, computer scientists from Texas A&M University have developed a machine learning based method called Symbolic Modeling to handle financial asset pricing tasks. Published in The Journal of Finance and Data Science, their approach reduces prediction errors compared to the widely used Fama-French 3-Factor model, while maintaining an acceptable model length.

Classic asset pricing models rely on linear combinations of manually selected financial factors, such as market volatility and company size. While effective, these models struggle with complex market dynamics. The proposed approach instead uses genetic programming and deep learning to automatically generate nonlinear expression that adapt to multiple datasets simultaneously.

“Most of the current financial asset pricing models are like a fixed recipe—you combine predetermined ingredients in set proportions. Symbolic Modeling is more like a master chef who creates entirely new recipes optimized for each diner,” explains Xiangwu Zuo, first author of this study, “The expressions generated by our method take into consideration for hidden relationships between market factors that humans might overlook.”

Zuo and the co-author, Andrew Jiang, generated an asset pricing model based on nearly four decades (1980–2018) of financial data from hundreds of companies, including well-known firms like Coca-Cola and ExxonMobil. The researchers tested their approach across both training and testing data, finding consistent improvements over traditional models.

“Our model achieves lower prediction errors compared to the classic asset pricing models,” shares Zuo. “Beyond just improved prediction accuracy, the model also reduces the “alpha” value—a crucial metric in finance representing unexplained returns.”

Notably, the models generated by the Symbolic Modeling method incorporated nonlinear factors combinations, which are absent in human-designed asset pricing models. “This allows better capture of nonlinear market behaviors,” adds Zuo.

What makes Symbolic Modeling particularly innovative is its ability to discover a unified mathematical model that can represent multiple assets simultaneously. Unlike traditional Symbolic Regression, which would create separate formulas for each company’s data, this new approach finds a single flexible expression that adapts to different datasets by adjusting its coefficients.

The authors see potential AI applications in financial asset pricing, portfolio optimization and development of trading strategies for complex market conditions. “We plan to expand our work by integrating the technique with more machine learning approaches to further enhance AI in finance,” says Zuo.

###

Contact the author: Xiangwu Zuo, Department of Computer Science, Texas A&M University, College Station, Texas, USA, dkflame@tamu.edu

The publisher KeAi was established by Elsevier and China Science Publishing & Media Ltd to unfold quality research globally. In 2013, our focus shifted to open access publishing. We now proudly publish more than 200 world-class, open access, English language journals, spanning all scientific disciplines. Many of these are titles we publish in partnership with prestigious societies and academic institutions, such as the National Natural Science Foundation of China (NSFC).


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.