Cancer has become a cause of concern in recent years. Cancer genomics is currently a key research direction in the fields of genetic biology and biomedicine. This paper analyzes 5 different types of cancer genes, such as breast, kidney, colon, lung and prostate through machine learning methods, with the goal of building a robust classification model to identify each type of cancer, which will allow us to identify each type of cancer early, thereby reducing mortality.
References
[1]
Alberts, B., Bray, D., Hopkin, K., et al. (2013) Essential Cell Biology. Garland Science, New York.
[2]
Wei, Y.Z., Li, M.M. and Xu, B.S. (2017) Research on Establish an Efficient Log Analysis System with Kafka and Elastic Search. Journal of Software Engineering and Applications, 10, 843-853. https://doi.org/10.4236/jsea.2017.1011047
[3]
Abdi, H. and Williams, L.J. (2010) Principal Component Analysis. Wiley Interdisciplinary Reviews: Computational Statistics, 2, 433-459. https://doi.org/10.1002/wics.101
[4]
Jolliffe, I.T. (2002) Principal Component Analysis. Wiley Online Library, Hoboken.
[5]
Newman, D., Asuncion, A., Smyth, P. and Welling, M. (2009) Distributed Algorithms for Topic Models. Journal of Machine Learning Research, 10, 1801-1828.
[6]
Kobak, D. and Berens, P. (2021) Understanding Deep Learning through T-SNE. Journal of Machine Learning Research, 22, 1-37.
[7]
Celebi, M.E., Kingravi, H.A. and Vela, P.A. (2013) A Comparative Study of Efficient Initialization Methods for the K-Means Clustering Algorithm. Expert Systems with Applications, 40, 200-210. https://doi.org/10.1016/j.eswa.2012.07.021
[8]
Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R. and Wu, A.Y. (2002) An Efficient K-Means Clustering Algorithm: Analysis and Implementation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24, 881-892. https://doi.org/10.1109/TPAMI.2002.1017616
[9]
Verma, R. and Kumar, V. (2018) Decision Trees for Data Mining: A Review. Current Trends in Computer Science and Mechanical Automation, 1, 1-10.
[10]
Wang, H.F., Zheng, B.C., Yoon, S.W. and Ko, H.S. (2018) A Support Vector Machine-Based Ensemble Algorithm for Breast Cancer Diagnosis. European Journal of Operational Research, 267, 687-699. https://doi.org/10.1016/j.ejor.2017.12.001
[11]
Rish, I. (2001) An Empirical Study of the Naive Bayes Classifier. IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, Seattle, 4 August 2001, 41-46.
[12]
Kumar, M., Rath, N.K., Swain, A. and Rath, S.K. (2015) Feature Selection and Classification of Microarray Data Using MapReduce Based ANOVA and K-Nearest Neighbor. Procedia Computer Science, 54, 301-310. https://doi.org/10.1016/j.procs.2015.06.035
[13]
Schmidhuber, J. (2015) Deep Learning in Neural Networks: An Overview. Neural Networks, 61, 85-117. https://doi.org/10.1016/j.neunet.2014.09.003
[14]
Kourou, K., Exarchos, T.P., Exarchos, K.P., Karamouzis, M.V. and Fotiadis, D.I. (2015) Machine Learning Applications in Cancer Prognosis and Prediction. Computational and Structural Biotechnology Journal, 13, 8-17. https://doi.org/10.1016/j.csbj.2014.11.005
[15]
Guyon, I., Weston, J., Barnhill, S. and Vapnik, V. (2002) Gene Selection for Cancer Classification Using Support Vector Machines. Machine Learning, 46, 389-422. https://doi.org/10.1023/A:1012487302797
[16]
Kar, S., Sharma, K.D. and Maitra, M. (2015) Gene Selection from Microarray Gene Expression Data for Classification of Cancer Subgroups Employing PSO and Adaptive K-Nearest Neighborhood Technique. Expert Systems with Applications, 42, 612-627. https://doi.org/10.1016/j.eswa.2014.08.014