全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

A Focused Bayesian Information Criterion

DOI: 10.1155/2014/504325

Full-Text   Cite this paper   Add to My Lib

Abstract:

Myriads of model selection criteria (Bayesian and frequentist) have been proposed in the literature aiming at selecting a single model regardless of its intended use. An honorable exception in the frequentist perspective is the “focused information criterion” (FIC) aiming at selecting a model based on the parameter of interest (focus). This paper takes the same view in the Bayesian context; that is, a model may be good for one estimand but bad for another. The proposed method exploits the Bayesian model averaging (BMA) machinery to obtain a new criterion, the focused Bayesian model averaging (FoBMA), for which the best model is the one whose estimate is closest to the BMA estimate. In particular, for two models, this criterion reduces to the classical Bayesian model selection scheme of choosing the model with the highest posterior probability. The new method is applied in linear regression, logistic regression, and survival analysis. This criterion is specially important in epidemiological studies in which the objective is often to determine a risk factor (focus) for a disease, adjusting for potential confounding factors. 1. Introduction A variety of model selection criteria (Bayesian or frequentist) have been proposed in the literature; most of them aim at selecting a single model for any purposes. For an overview of model selection criteria, see the studies by Leeb and Poetscher [1] and Zucchini [2]; for inference after model selection, see the studies by Nguefack-Tsague [3], Nguefack-Tsague and Zucchini [4], Zucchini et al. [5], Behl et al. [6], and Nguefack-Tsague [7–9]. Allen [10], within the context of Mallows’ [11], developed a criterion that depends on a given prediction. In a frequentist approach, Claeskens and Hjort [12] developed a focused information criterion (FIC) for model selection which, unlike common model selection criteria that lead to a single model for all purposes, selects different models for different purposes. Thus Allen’s criterion can be considered as an early precursor of FIC. So far the FIC is gaining in popularity as evidenced by its applications in various fields and specific models. Some of these applications include missing response (Sun et al. [13]), energy substitution (Behl et al. [6]), economic applications (Behl et al. [14]), Tobit model (Zhang et al. [15]), additive partial models (Zhang and Liang [16]), volatility forecasting (Brownlees and Gallo [17]), and Cox proportional hazard regression models (Hjort and Claeskens [18]). Focused information criterion and model averaging can be found in the studies by Sueishi

References

[1]  H. Leeb and B. M. Poetscher, “Model selection,” in Handbook of Financial Time Series, T. G. Anderson, R. A. Davis, J.-P. Kreiss, and T. Mikosch, Eds., pp. 785–821, Springer, New York, NY, USA, 2008.
[2]  W. Zucchini, “An introduction to model selection,” Journal of Mathematical Psychology, vol. 44, no. 1, pp. 41–61, 2000.
[3]  G. Nguefack-Tsague, “An alternative derivation of some commons distributions functions: a post-model selection approach,” International Journal of Applied Mathematics and Statistics, vol. 42, no. 12, pp. 138–147, 2013.
[4]  G. Nguefack-Tsague and W. Zucchini, “Post-model selection inference and model averaging,” Pakistan Journal of Statistics and Operation Research, vol. 7, no. 2, pp. 347–361, 2011.
[5]  W. Zucchini, G. Claeskens, and G. Nguefack-Tsague, “Model selection,” in International Encyclopedia of Statistical Sciences, M. Lovric, Ed., chapter part 13, pp. 830–833, Springer, Berlin, Germany, 2011.
[6]  P. Behl, H. Dette, M. Frondel, and H. Tauchmann, “Energy substitution: when model selection depends on the focus,” Energy Economics, vol. 39, pp. 233–238, 2013.
[7]  G. Nguefack-Tsague, “On bootstrap and post-model selection inference,” International Journal of Mathematics and Computation, vol. 21, no. 4, pp. 51–64, 2013.
[8]  G. Nguefack-Tsague, “Estimation of a multivariate mean under model selection uncertainty,” Pakistan Journal of Statistics and Operation Research, vol. 10, no. 2, pp. 131–145, 2014.
[9]  G. Nguefack-Tsague, “On optimal weighting scheme in model averaging,” American Journal of Applied Mathematics and Statistics, vol. 2, no. 3, pp. 150–156, 2014.
[10]  D. M. Allen, “Mean square error of prediction as a criterion for selecting variables,” Technometrics, vol. 13, no. 3, pp. 469–475, 1971.
[11]  C. L. Mallows, “Some comments on ,” Technometrics, vol. 15, no. 4, pp. 661–675, 1973.
[12]  G. Claeskens and N. L. Hjort, “The focused information criterion,” Journal of the American Statistical Association, vol. 98, no. 464, pp. 900–916, 2003.
[13]  Z. Sun, Z. Su, and J. Ma, “Focused vector information criterion model selection and model averaging regression with missing response,” Metrika, vol. 77, no. 3, pp. 415–432, 2014.
[14]  P. Behl, H. Dette, M. Frondel, and H. Tauchmann, “Choice is suffering: a focused information criterion for model selection,” Economic Modelling, vol. 29, no. 3, pp. 817–822, 2012.
[15]  X. Zhang, A. T. Wan, and S. Z. Zhou, “Focused information criteria, model selection, and model averaging in a Tobit model with a nonzero threshold,” Journal of Business and Economic Statistics, vol. 30, no. 1, pp. 132–142, 2012.
[16]  X. Zhang and H. Liang, “Focused information criterion and model averaging for generalized additive partial linear models,” Annals of Statistics, vol. 39, no. 1, pp. 174–200, 2011.
[17]  C. T. Brownlees and G. M. Gallo, “On variable selection for volatility forecasting: the role of focused selection criteria,” Journal of Financial Econometrics, vol. 6, no. 4, pp. 513–539, 2008.
[18]  N. L. Hjort and G. Claeskens, “Focused information criteria and model averaging for the Cox’s hazard regression model,” Journal of the American Statistical Association, vol. 101, no. 476, pp. 1449–1464, 2006.
[19]  N. Sueishi, “Generalized empirical likelihood-based focused information criterion and model averaging,” Econometrics, vol. 1, no. 2, pp. 141–156, 2013.
[20]  J. Du, Z. Zhang, and T. Xie, “Focused information criterion and model averaging in quantile regression,” Communications in Statistics: Theory and Methods, vol. 42, no. 20, pp. 3716–3734, 2013.
[21]  P. Behl, G. Claeskens, and H. Dette, “Focused model selection in quantile regression,” Statistica Sinica, vol. 24, pp. 601–624, 2014.
[22]  G. Xu, S. Wang, and J. Z. Huang, “Focused information criterion and model averaging based on weighted composite quantile regression,” Scandinavian Journal of Statistics, vol. 41, no. 2, pp. 365–381, 2014.
[23]  R. E. Kass and A. E. Raftery, “Bayes factors,” Journal of the American Statistical Association, vol. 90, no. 430, pp. 773–795, 1995.
[24]  G. Schwarz, “Estimating the dimension of a model,” The Annals of Statistics, vol. 6, no. 2, pp. 461–464, 1978.
[25]  Y. Guan and M. Stephens, “Bayesian variable selection regression for genome-wide association studies and other large-scale problems,” The Annals of Applied Statistics, vol. 5, no. 3, pp. 1780–1815, 2011.
[26]  G. Nguefack-Tsague, “Using bayesian networks to model hierarchical relationships in epidemiological studies,” Epidemiology and Health, vol. 33, no. 1, Article ID e2011006, 2011.
[27]  C. M. Carvalho and J. G. Scott, “Objective Bayesian model selection in Gaussian graphical models,” Biometrika, vol. 96, no. 3, pp. 497–512, 2009.
[28]  B. L. Fridley, “Bayesian variable and model selection methods for genetic association studies,” Genetic Epidemiology Series B, vol. 33, no. 1, pp. 27–37, 2009.
[29]  C. P. Robert, The Bayesian choice, Springer, New York, NY, USA, 2001.
[30]  F. Liang, R. Paulo, G. Molina, M. A. Clyde, and J. O. Berger, “Mixtures of priors for Bayesian variable selection,” Journal of the American Statistical Association, vol. 103, no. 481, pp. 410–423, 2008.
[31]  J.-M. Bernardo and A. F. M. Smith, Bayesian Theory, John Wiley & Sons, 1994.
[32]  S. Geisser, Predictive Inference: An Introduction, Prentice Hall, New York, NY, USA, 1993.
[33]  M. A. Clyde and E. I. George, “Model uncertainty,” Statistical Science, vol. 19, no. 1, pp. 81–94, 2004.
[34]  A. E. Raftery, D. Madigan, and J. A. Hoeting, “Bayesian model averaging for linear regression models,” Journal of the American Statistical Association, vol. 92, no. 437, pp. 179–191, 1997.
[35]  E. E. Leamer, Specification Searches: Ad Hoc Inference with Experimental Data, John Wiley & Sons, New York, NY, USA, 1978.
[36]  D. Draper, “Assessment and propagation of model uncertainty,” Journal of the Royal Statistical Society Series B, vol. 57, no. 1, pp. 45–97, 1995.
[37]  D. Madigan and A. E Raftery, “Model selection and accounting for model uncertainty in graphical models using Occam's window,” Journal of the American Statistical Association, vol. 89, pp. 1535–1546, 1994.
[38]  I. J. Good, “Rational decisions,” Journal of the Royal Statistical Society Series B: Methodological, vol. 14, pp. 107–114, 1952.
[39]  J. A. Hoeting, D. Madigan, A. E. Raftery, and C. T. Volinsky, “Bayesian model averaging: a tutorial,” Statistical Science, vol. 14, no. 4, pp. 382–417, 1999.
[40]  M. A. Clyde, “Bayesian model averaging and model search strategies,” in Bayesian Statistics VI, pp. 157–185, Oxford University Press, Oxford, UK, 1999.
[41]  G. Nguefack-Tsague, “Bayesian estimation of a multivariate mean under model uncertainty,” International Journal of Mathematics and Statistics, vol. 13, no. 1, pp. 83–92, 2013.
[42]  R Development Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, 2013.
[43]  A. E. Raftery, I. S. Painter, and C. T. Volinsky, “BMA: an R package for Bayesian model averaging,” Rnews, vol. 5, no. 2, pp. 2–8, 2005.
[44]  I. Ehrlich, “Participation in illegitimate activities: a theoretical and empirical investigation,” Journal of Political Economy B, vol. 81, pp. 521–565, 1973.
[45]  A. E. Raftery, “Approximate Bayes factors and accounting for model uncertainty in generalised linear models,” Biometrika, vol. 83, no. 2, pp. 251–266, 1996.
[46]  D. W. Hosmer and S. Lemeshow, Applied Logistic Regression, Wiley, New York, NY, USA, 1989.
[47]  C. Candolo, “The focused information criterion in logistic regression to predict repair of dental restorations,” Revista Brasileira de Biometria, vol. 31, no. 4, pp. 547–557, 2014.
[48]  T. Therneau and P. Grambsch, Modeling Survival Data: Extending the Cox Model, University Press, Cambridge, Mass, USA, 2000.
[49]  C. T. Volinsky, D. Madigan, A. E. Raftery, and R. A. Kronmal, “Bayesian model averaging in proportional hazard models: assessing the risk of a stroke,” Journal of the Royal Statistical Society C: Applied Statistics, vol. 46, no. 4, pp. 433–448, 1997.

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413