OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

Epidemiology Research International 2013

Robust Medical Test Evaluation Using Flexible Bayesian Semiparametric Regression Models

DOI: 10.1155/2013/131232

Adam J. Branscum,Wesley O. Johnson,Andre T. Baron

Full-Text Cite this paper Add to My Lib

Abstract:

The application of Bayesian methods is increasing in modern epidemiology. Although parametric Bayesian analysis has penetrated the population health sciences, flexible nonparametric Bayesian methods have received less attention. A goal in nonparametric Bayesian analysis is to estimate unknown functions (e.g., density or distribution functions) rather than scalar parameters (e.g., means or proportions). For instance, ROC curves are obtained from the distribution functions corresponding to continuous biomarker data taken from healthy and diseased populations. Standard parametric approaches to Bayesian analysis involve distributions with a small number of parameters, where the prior specification is relatively straight forward. In the nonparametric Bayesian case, the prior is placed on an infinite dimensional space of all distributions, which requires special methods. A popular approach to nonparametric Bayesian analysis that involves Polya tree prior distributions is described. We provide example code to illustrate how models that contain Polya tree priors can be fit using SAS software. The methods are used to evaluate the covariate-specific accuracy of the biomarker, soluble epidermal growth factor receptor, for discerning lung cancer cases from controls using a flexible ROC regression modeling framework. The application highlights the usefulness of flexible models over a standard parametric method for estimating ROC curves. 1. Introduction Bayesian analysis is often used in the support of epidemiologic research [1–7]. A contemporary area of research in the population health sciences involves the development and application of statistical methods for evaluating the accuracy of medical tests. With binary outcome data, statistical methods focus on estimating sensitivity and specificity, while with quantitative data, standard objects of interest are the receiver operating characteristic (ROC) curve and area under the curve (AUC). The ROC curve can be regarded as a graphical portrayal of the degree of separation between the distributions of test outcomes for “diseased” and nondiseased populations. The formula for an ROC curve depends on and , the sensitivity and specificity of the test at classification threshold . We proceed under the innocuous assumption that test outcomes tend to be higher for individuals in the diseased population. Let denote a continuous test outcome for a disease , where disease status is labeled as for disease absent and for disease present. In general, can be any continuously measured classifier that varies according to a cumulative

References

[1]	D. B. Dunson, “Commentary: practical advantages of Bayesian analysis of epidemiologic data,” American Journal of Epidemiology, vol. 153, no. 12, pp. 1222–1226, 2001.
[2]	S. Greenland, “Bayesian perspectives for epidemiological research: I. Foundations and basic methods,” International Journal of Epidemiology, vol. 35, no. 3, pp. 765–775, 2006.
[3]	S. Greenland, “Bayesian perspectives for epidemiological research. II. Regression analysis,” International Journal of Epidemiology, vol. 36, no. 1, pp. 195–202, 2007.
[4]	S. Greenland, “Bayesian perspectives for epidemiologic research: III. Bias analysis via missing-data methods,” International Journal of Epidemiology, vol. 38, no. 6, pp. 1662–1673, 2009.
[5]	A. Lawson, Bayesian Disease Mapping: Hierarchical Modeling in Spatial Epidemiology, CRC Press, Boca Raton, Fla, USA, 2009.
[6]	R. F. MacLehose, J. M. Oakes, and B. P. Carlin, “Turning the Bayesian crank,” Epidemiology, vol. 22, no. 3, pp. 365–367, 2011.
[7]	S. R. Cole, H. Chu, S. Greenland, G. Hamra, and D. B. Richardson, “Bayesian posterior distributions without Markov chains,” American Journal of Epidemiology, vol. 175, no. 5, pp. 368–375, 2012.
[8]	A. Erkanli, M. Sung, E. J. Costello, and A. Angold, “Bayesian semi-parametric ROC analysis,” Statistics in Medicine, vol. 25, no. 22, pp. 3905–3928, 2006.
[9]	T. E. Hanson, A. Kottas, and A. J. Branscum, “Modelling stochastic order in the analysis of receiver operating characteristic data: Bayesian non-parametric approaches,” Journal of the Royal Statistical Society C, vol. 57, no. 2, pp. 207–225, 2008.
[10]	A. J. Branscum, W. O. Johnson, T. E. Hanson, and I. A. Gardner, “Bayesian semiparametric ROC curve estimation and disease diagnosis,” Statistics in Medicine, vol. 27, no. 13, pp. 2474–2496, 2008.
[11]	T. E. Hanson, A. J. Branscum, and I. A. Gardner, “Multivariate mixtures of Polya trees for modeling ROC data,” Statistical Modelling, vol. 8, no. 1, pp. 81–96, 2008.
[12]	J. Gu, S. Ghosal, and A. Roy, “Bayesian bootstrap estimation of ROC curve,” Statistics in Medicine, vol. 27, no. 26, pp. 5407–5420, 2008.
[13]	C. Wang, B. W. Turnbull, Y. T. Gr？hn, and S. S. Nielsen, “Nonparametric estimation of ROC curves based on Bayesian models when the true disease state is unknown,” Journal of Agricultural, Biological, and Environmental Statistics, vol. 12, no. 1, pp. 128–146, 2007.
[14]	G. T. Fosgate, H. M. Scott, and E. R. Jordan, “Development of a method for Bayesian nonparametric ROC analysis with application to an ELISA for Johne's disease in dairy cattle,” Preventive Veterinary Medicine, vol. 81, no. 1–3, pp. 178–193, 2007.
[15]	V. Inácio, A. A. Turkman, C. T. Nakas, and T. A. Alonzo, “Nonparametric Bayesian estimation of the three-way receiver operating characteristic surface,” Biometrical Journal, vol. 53, no. 6, pp. 1011–1024, 2011.
[16]	I. V. de Carvalho, A. Jara, T. E. Hanson, and M. de Carvalho, “Bayesian nonparametric ROC regression modeling,” Bayesian Analysis, vol. 8, no. 3, pp. 623–646, 2013.
[17]	M. Ladouceur, E. Rahme, P. Bélisle, A. N. Scott, K. Schwartzman, and L. Joseph, “Modeling continuous diagnostic test data using approximate Dirichlet process distributions,” Statistics in Medicine, vol. 30, no. 21, pp. 2648–2662, 2011.
[18]	T. Hanson and W. O. Johnson, “Modeling regression error with a mixture of Polya trees,” Journal of the American Statistical Association, vol. 97, no. 460, pp. 1020–1033, 2002.
[19]	T. E. Hanson, “Inference for mixtures of finite Polya tree models,” Journal of the American Statistical Association, vol. 101, no. 476, pp. 1548–1565, 2006.
[20]	E. J. Bedrick, R. Christensen, and W. Johnson, “A new perspective on priors for generalized linear models,” Journal of the American Statistical Association, vol. 91, no. 436, pp. 1450–1460, 1996.
[21]	A. T. Baron, J. M. Lafky, C. H. Boardman et al., “Serum sErbB1 and epidermal growth factor levels as tumor biomarkers in women with stage III or IV epithelial ovarian cancer,” Cancer Epidemiology Biomarkers and Prevention, vol. 8, no. 2, pp. 129–137, 1999.
[22]	A. T. Baron, C. H. Boardman, J. M. Lafky et al., “Soluble epidermal growth factor receptor (SEG-FR) and cancer antigen 125 (CA125) as screening and diagnostic tests for epithelial ovarian cancer,” Cancer Epidemiology Biomarkers and Prevention, vol. 14, no. 2, pp. 306–318, 2005.
[23]	S. Geisser and W. F. Eddy, “A predictive approach to model selection,” Journal of the American Statistical Association, vol. 74, pp. 153–160, 1979.
[24]	R. Christensen, W. Johnson, A. Branscum, and T. Hanson, Bayesian Ideas and Data Analysis: An Introduction for Scientists and Statisticians, CRC Press, Boca Raton, Fla, USA, 2010.
[25]	R. Christensen, T. Hanson, and A. Jara, “Parametric nonparametric statistics: an introduction to mixtures of finite Polya trees,” The American Statistician, vol. 62, no. 4, pp. 296–306, 2008.

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413