%0 Journal Article %T Machine Learning Assisted Discovery of Novel Predictive Lab Tests Using Electronic Health Record Data %A C. David Page %A Finn Kuusisto %A Ian Ross %A Jeremy Weiss %A Peggy L. Peissig %A Ron Stewart %A Ross Kleiman %J Archive of "AMIA Summits on Translational Science Proceedings". %D 2019 %X Epidemiological studies identifying biological markers of disease state are valuable, but can be time-consuming, expensive, and require extensive intuition and expertise. Furthermore, not all hypothesized markers will be borne out in a study, suggesting that higher quality initial hypotheses are crucial. In this work, we propose a high-throughput pipeline to produce a ranked list of high-quality hypothesized marker laboratory tests for diagnoses. Our pipeline generates a large number of candidate lab-diagnosis hypotheses derived from machine learning models, filters and ranks them according to their potential novelty using text mining, and corroborate final hypotheses with logistic regression analysis. We test our approach on a large electronic health record dataset and the PubMed corpus, and find several promising candidate hypotheses %U https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6568080/