全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

高维因子模型两类“主成分”估计的比较——以S & P 500股票数据分析为例
Comparison of Two Types of Principal Component Estimation in High-Dimensional Factor Model—A Case Study of S & P 500 Stock Data Analysis

DOI: 10.12677/sa.2024.132045, PP. 453-460

Keywords: 高维因子模型,主成分估计,信息准则,特征值比值估计
High-Dimensional Factor Model
, Principal Component Estimation, Information Criterion, Eigenvalue Ratio Estimation

Full-Text   Cite this paper   Add to My Lib

Abstract:

高维因子模型在超高维度的大型数据集降维处理中发挥了重要作用。目前,高维因子模型有两种主成分估计方法,分别是基于协方差的主成分估计PCE和基于滞后自协方差的主成分估计LPCE。本文以S & P 500公司股票数据的高维因子建模为例,比较了PCE和LPCE在高维股票数据降维中的实际表现,其中因子个数通过信息准则法和特征值比值估计法确定。结果表明,在高维非平稳序列因子模型中,PCE的均方根误差和预测误差都比LPCE小,PCE得到的因子也比LPCE更能捕捉高维非平稳序列变化特征。在高维平稳序列因子模型中,PCE和LPCE的估计误差相同,两者的估计因子均能还原高维平稳序列的变化特征。此外,在确定因子个数时,信息准则倾向于高估因子个数,表现出严重的过拟合。特征值比值估计法的估计结果相对更准确和稳定,在PCE中倾向于放弃相对弱势的主成分,在LPCE中则倾向于将弱势的主成分视为因子。
High-dimensional factor models play a crucial role in dimensionality reduction of large datasets with ultra-high dimensions. Currently, there are two principal component methods for estimating high-dimensional factor models: Principal Component Estimation (PCE) based on covariance and Lagged Principal Component Estimation (LPCE) based on lagged autocovariance. This paper utilizes high-dimensional factor modeling of S & P 500 company stock data as a case study to compare the practical performance of PCE and LPCE in dimensionality reduction of high-dimensional stock data, where the number of factors is determined through the information criterion method and eigenvalue ratio method. Results indicate that in high-dimensional non-stationary sequence factor models, both the root mean square error and prediction error of PCE are smaller than LPCE. Additionally, factors obtained from PCE are more effective in capturing the characteristics of high-dimensional non-stationary sequence changes compared to LPCE. In high-dimensional stationary sequence factor models, the estimation errors of PCE and LPCE are identical, and both estimation methods can effectively capture the changing characteristics of high-dimensional stationary sequences. Furthermore, when determining the number of factors, the information criterion tends to overestimate the number of factors, indicating severe overfitting. The eigenvalue ratio method provides relatively more accurate and stable estimation results, with PCE tending to discard relatively weaker principal components, while LPCE tends to treat weaker principal components as factors.

References

[1]  廖春芳, 刘金山. 基于贝叶斯方法的高维因子模型在中国股市的应用[J]. 佛山科学技术学院学报(自然科学版), 2016, 34(6): 31-36.
[2]  李伯龙. 基于高维分位数因子模型的中国股市尾部系统风险分析[J]. 系统管理学报, 2021, 30(6): 1079-1087.
[3]  郑红景, 蒋梦梦, 周杰. 股票市场的高维动态因子模型及其实证分析[J]. 计算机工程与应用, 2020, 56(12): 243-249.
[4]  Bai, J. (2003) Inferential Theory for Factor Models of Large Dimensions. Econometrica, 71, 135-171.
https://doi.org/10.1111/1468-0262.00392
[5]  Lam, C., Yao, Q. and Bathia, N. (2011) Estimation of Latent Factors for High-Dimensional Time Series. Biometrika, 98, 901-918.
https://doi.org/10.1093/biomet/asr048
[6]  Bai, J. and Ng, S. (2002) Determining the Number of Factors in Approximate Factor Models. Econometrica, 70, 191-221.
https://doi.org/10.1111/1468-0262.00273
[7]  Lam, C. and Yao, Q. (2012) Factor Modeling for High-Dimensional Time Series: Inference for the Number of Factors. The Annals of Statistics, 40, 694-726.
https://doi.org/10.1214/12-AOS970
[8]  Ahn, S.C. and Horenstein, A.R. (2013) Eigenvalue Ratio Test for the Number of Factors. Econometrica, 81, 1203-1227.
https://doi.org/10.3982/ECTA8968
[9]  Xia, Q., Liang, R. and Wu, J. (2017) Transformed Contribution Ratio Test for the Number of Factors in Static Approximate Factor Models. Computational Statistics & Data Analysis, 112, 235-241.
https://doi.org/10.1016/j.csda.2017.03.005
[10]  Chamberlain, G. and Rothschild, M. (1983) Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets. Econometrica, 51, 1281-1304.
https://doi.org/10.2307/1912275

Full-Text

Contact Us

[email protected]

QQ:3279437679

WhatsApp +8615387084133