%0 Journal Article %T An Application of Machine Learning to Thalassemia Diagnosis %A Sitan Liu %J Journal of Computer and Communications %P 211-230 %@ 2327-5227 %D 2024 %I Scientific Research Publishing %R 10.4236/jcc.2024.122013 %X Mediterranean anemia is a genetic disease that currently relies heavily on expert clinical experience to determine whether patients are affected. This method is overly reliant on expert experience and is not precise enough. This paper proposes two modeling methods to predict whether patients have Mediterranean anemia. The first method involves using Principal Component Analysis (PCA) to reduce the dimensionality of the data, followed by logistic regression modeling (PCA-LR) on the reduced dataset. The second method involves building a Partial Least Squares Regression (PLS) model. Experimental results show that the prediction accuracy of the PCA-LR model is 87.5% (degree = 2, ¦Ë=4), and the prediction accuracy of the PLS model is 92.5% (ncomp = 4), indicating good predictive performance of the models. %K Multicollinearity %K Statistical Analysis Models %K Data Mining %K PCA-LR %K PLS %U http://www.scirp.org/journal/PaperInformation.aspx?PaperID=131568