A Machine Learning Approach to Predict Creatine Kinase Test Results

Zehra Nur Canbolat, Gökhan Silahtaroğlu, Özge Doğuç, Nevin Yılmaztürk


Most of the research done in the literature are based on statistical approaches and used for deriving reference limits based on lab results. As more data are available to the researchers, ML methods are more effectively used by the clinicians and practitioners to reduce cost and provide more accurate diagnoses. This study aims to contribute to the medical laboratory processes by providing an automated method in order to predict the lab results accurately by machine learning from the previous test results. All patient data obtained have been anonymized, and a total of 449,471 test results have been used to build an integrated dataset. A total of 107,646 unique patients’ data has been used. This study aims to predict the value range of the Creatine Kinase tests, which are taken in separate tubes and usually needs more processing time than the other tests do. Using the lab results and the Random Forest Algorithm, this study reports that the outcome of the Creatine Kinase test can be determined with 97% accuracy by using the AST and ALT test values. This is an important achievement for the practitioners and the patients, as this study submits significant reduction in Creating Kinase test evaluation time.


Laboratory Tests; Creatine Kinase; Data Mining; Machine Learning; Decision Tree.


Cabitza, Federico, and Giuseppe Banfi. “Machine Learning in Laboratory Medicine: Waiting for the Flood?” Clinical Chemistry and Laboratory Medicine (CCLM) 56, no. 4 (March 28, 2018): 516–524. doi:10.1515/cclm-2017-0287.

Jones, Richard G., Owen A. Johnson, and Gifford Batstone. "Informatics and the clinical laboratory." The Clinical Biochemist Reviews 35, no. 3 (2014): 177.

Horowitz, Gary L. “The Power of Asterisks.” Clinical Chemistry 61, no. 8 (August 1, 2015): 1009–1011. doi:10.1373/clinchem.2015.243048.

Connelly, D. P. "Embedding expert systems in laboratory information systems." American journal of clinical pathology 94, no. 4 Suppl 1 (1990): S7-14.

Yamamoto, Yoichiro, Akira Saito, Ayako Tateishi, Hisashi Shimojo, Hiroyuki Kanno, Shinichi Tsuchiya, Ken-ichi Ito, et al. “Quantitative Diagnosis of Breast Tumors by Morphometric Classification of Microenvironmental Myoepithelial Cells Using a Machine Learning Approach.” Scientific Reports 7, no. 1 (April 25, 2017). doi:10.1038/srep46732.

Badrick, Tony. "Evidence-based laboratory medicine." The Clinical Biochemist Reviews 34, no. 2 (2013): 43.

Luo, Yuan, Peter Szolovits, Anand S. Dighe, and Jason M. Baron. “Using Machine Learning to Predict Laboratory Test Results.” American Journal of Clinical Pathology 145, no. 6 (June 2016): 778–788. doi:10.1093/ajcp/aqw064.

Shih, Mu-Chin, Huey-Mei Chang, Ni Tien, Chiung-Tzu Hsiao, and Ching-Tien Peng. “Building and Validating an Autoverification System in the Clinical Chemistry Laboratory.” Laboratory Medicine 42, no. 11 (November 2011): 668–673. doi:10.1309/lm5am4iixc4oietd.

Lippi, Giuseppe, Antonella Bassi, and Chiara Bovo. “The Future of Laboratory Medicine in the Era of Precision Medicine.” Journal of Laboratory and Precision Medicine 1 (2016): 7–7. doi:10.21037/jlpm.2016.12.01.

B. Raheemi, ‘Data Mining and Knowledge Discovery in Healthcare and Medicine’, University of Ottawa, 2014.

Silahtaroğlu, Gökhan, and Nevin Yılmaztürk. “Data Analysis in Health and Big Data: A Machine Learning Medical Diagnosis Model Based on Patients’ Complaints.” Communications in Statistics - Theory and Methods (June 3, 2019): 1–10. doi:10.1080/03610926.2019.1622728.

Contreras, Ivan, and Josep Vehi. “Artificial Intelligence for Diabetes Management and Decision Support: Literature Review.” Journal of Medical Internet Research 20, no. 5 (May 30, 2018): e10775. doi:10.2196/10775.

Kang, YannaShen, and Mehmet Kayaalp. “Extracting Laboratory Test Information from Biomedical Text.” Journal of Pathology Informatics 4, no. 1 (2013): 23. doi:10.4103/2153-3539.117450.

Hao, Tianyong, Hongfang Liu, and Chunhua Weng. “Valx: A System for Extracting and Structuring Numeric Lab Test Comparison Statements from Text.” Methods of Information in Medicine 55, no. 03 (2016): 266–275. doi:10.3414/me15-01-0112.

Bing, Lidong, Tak-Lam Wong, and Wai Lam. “Unsupervised Extraction of Popular Product Attributes from E-Commerce Web Sites by Considering Customer Reviews.” ACM Transactions on Internet Technology 16, no. 2 (April 20, 2016): 1–17. doi:10.1145/2857054.

Shinzato, Keiji, and Satoshi Sekine. "Unsupervised extraction of attributes and their values from product description." In Proceedings of the Sixth International Joint Conference on Natural Language Processing, (2013): 1339-1347.

Silahtaroğlu, Gökhan, and Zehra Nur Canbolat. “An Early Prediction And Diagnosis Of Sepsis in Intensive Care Units: An Unsupervised Machine Learning Model.” Mugla Journal of Science and Technology (June 15, 2020). doi:10.22531/muglajsci.643554.

Nelson, David W., Anders Rudehill, Robert M. MacCallum, Anders Holst, Michael Wanecek, Eddie Weitzberg, and Bo-Michael Bellander. “Multivariate Outcome Prediction in Traumatic Brain Injury with Focus on Laboratory Values.” Journal of Neurotrauma 29, no. 17 (November 20, 2012): 2613–2624. doi:10.1089/neu.2012.2468.

Razavian, Narges, Saul Blecker, Ann Marie Schmidt, Aaron Smith-McLallen, Somesh Nigam, and David Sontag. “Population-Level Prediction of Type 2 Diabetes From Claims Data and Analysis of Risk Factors.” Big Data 3, no. 4 (December 2015): 277–287. doi:10.1089/big.2015.0020.

Yuan, Cao, Cheng Ming, and Hu Chengjin. “UrineCART, a Machine Learning Method for Establishment of Review Rules Based on UF-1000i Flow Cytometry and Dipstick or Reflectance Photometer.” Clinical Chemistry and Laboratory Medicine 50, no. 12 (January 1, 2012). doi:10.1515/cclm-2012-0272.

Goldstein, Benjamin A., Ann Marie Navar, and Rickey E. Carter. “Moving Beyond Regression Techniques in Cardiovascular Risk Prediction: Applying Machine Learning to Address Analytic Challenges.” European Heart Journal (July 19, 2016): ehw302. doi:10.1093/eurheartj/ehw302.

Doguc Özge, Canbolat Z.N. and Yilmazturk N. '"Intelligent Lab Test Approval Support System". In Proceeding of International Aegean Symposiums on Social Sciences & Humanities, Izmir Turkey 2020.

Delanaye, Pierre, Etienne Cavalier, and Hans Pottel. “Serum Creatinine: Not So Simple!” Nephron 136, no. 4 (2017): 302–308. doi:10.1159/000469669.

Bloom, B., J. Pott, Y. Freund, J. Grundlingh, and T. Harris. “The Agreement Between Abnormal Venous Lactate and Arterial Lactate in the ED: a Retrospective Chart Review.” The American Journal of Emergency Medicine 32, no. 6 (June 2014): 596–600. doi:10.1016/j.ajem.2014.03.007.

Brancaccio, P., N. Maffulli, and F. M. Limongelli. “Creatine Kinase Monitoring in Sport Medicine.” British Medical Bulletin 81–82, no. 1 (February 6, 2007): 209–230. doi:10.1093/bmb/ldm014.

"Creatine Kinase: Reference Range, Interpretation, Collection and Panels". Available online: https://emedicine.medscape.com/article/2074023-overview (accessed on 16 January 2020).

Denis, Francois, Anne Laurent, Rémi Gilleron, and Marc Tommasi. "Text classification and co-training from positive and unlabeled examples." In Proceedings of the ICML 2003 workshop: the continuum from labeled to unlabeled data, pp. 80-87. 2003.

Agatonovic-Kustrin, S, and R Beresford. “Basic Concepts of Artificial Neural Network (ANN) Modeling and Its Application in Pharmaceutical Research.” Journal of Pharmaceutical and Biomedical Analysis 22, no. 5 (June 2000): 717–727. doi:10.1016/s0731-7085(99)00272-1.

Estruch, V., C. Ferri, J. Hernández-Orallo, and M.J. Ramírez-Quintana. “Web Categorisation Using Distance-Based Decision Trees.” Electronic Notes in Theoretical Computer Science 157, no. 2 (May 2006): 35–40. doi:10.1016/j.entcs.2005.12.043.

Fletcher, G.P., and C.J. Hinde. “Interpretation of Neural Networks as Boolean Transfer Functions.” Knowledge-Based Systems 7, no. 3 (September 1994): 207–214. doi:10.1016/0950-7051(94)90007-8.

Xhemali, D., J HINDE, C., & G STONE, R. (2009). Naïve bayes vs. decision trees vs. neural networks in the classification of training web pages. D. XHEMALI, CJ HINDE and Roger G. STONE," Naive Bayes vs. Decision Trees vs. Neural Networks in the Classification of Training Web Pages", International Journal of Computer Science Issues, IJCSI, Volume 4, Issue 1, pp16-23, September 2009, 4(1).

Zou, Jinming, Yi Han, and Sung-Sau So. “Overview of Artificial Neural Networks.” Artificial Neural Networks (2008): 14–22. doi:10.1007/978-1-60327-101-1_2.

Chen, Enhong, Zhengya Zhang, Xufa Wang, and Jie Yang. “Index Based Document Classification with CC4 Neural Networks.” Intelligent Agent Technology (September 2001). doi:10.1142/9789812811042_0039.

Fletcher, G.P., and C.J. Hinde. “Using Neural Networks as a Tool for Constructing Rule Based Systems.” Knowledge-Based Systems 8, no. 4 (August 1995): 183–189. doi:10.1016/0950-7051(95)96215-d.

Obermeyer, Ziad, and Ezekiel J. Emanuel. “Predicting the Future — Big Data, Machine Learning, and Clinical Medicine.” New England Journal of Medicine 375, no. 13 (September 29, 2016): 1216–1219. doi:10.1056/nejmp1606181.

Forsting, Michael. “Machine Learning Will Change Medicine.” Journal of Nuclear Medicine 58, no. 3 (February 2, 2017): 357–358. doi:10.2967/jnumed.117.190397.

Lidbury, Brett A., Alice M. Richardson, and Tony Badrick. “Assessment of Machine-Learning Techniques on Large Pathology Data Sets to Address Assay Redundancy in Routine Liver Function Test Profiles.” Diagnosis 2, no. 1 (February 1, 2015): 41–51. doi:10.1515/dx-2014-0063.

Cismondi, F., L.A. Celi, A.S. Fialho, S.M. Vieira, S.R. Reti, J.M.C. Sousa, and S.N. Finkelstein. “Reducing Unnecessary Lab Testing in the ICU with Artificial Intelligence.” International Journal of Medical Informatics 82, no. 5 (May 2013): 345–358. doi:10.1016/j.ijmedinf.2012.11.017.

Ardakani, Ali Abbasian, Alireza Rajabzadeh Kanafi, U. Rajendra Acharya, Nazanin Khadem, and Afshin Mohammadi. “Application of Deep Learning Technique to Manage COVID-19 in Routine Clinical Practice Using CT Images: Results of 10 Convolutional Neural Networks.” Computers in Biology and Medicine 121 (June 2020): 103795. doi:10.1016/j.compbiomed.2020.103795.

Luo, Yuan, Peter Szolovits, Anand S. Dighe, and Jason M. Baron. “Using Machine Learning to Predict Laboratory Test Results.” American Journal of Clinical Pathology 145, no. 6 (June 2016): 778–788. doi:10.1093/ajcp/aqw064.

Kumwilaisak, Kanya, Alberto Noto, Ulrich H. Schmidt, Clare I. Beck, Claudia Crimi, Kent Lewandrowski, and Luca M. Bigatello. “Effect of Laboratory Testing Guidelines on the Utilization of Tests and Order Entries in a Surgical Intensive Care Unit*.” Critical Care Medicine 36, no. 11 (November 2008): 2993–2999. doi:10.1097/ccm.0b013e31818b3a9d.

Gortmaker, Steven L., Arthur F. Bickford, Herbert O. Mathewson, Karin Dumbaugh, and Peter C. Tirrell. “A Successful Experiment to Reduce Unnecessary Laboratory Use in a Community Hospital.” Medical Care 26, no. 6 (June 1988): 631–642. doi:10.1097/00005650-198806000-00011.

Baigelman, W., S. J. Bellin, L. A. Cupples, D. Dombrowski, and J. Coldiron. “Overutilization of Serum Electrolyte Determinations in Critical Care Units.” Intensive Care Medicine 11, no. 6 (December 1985): 304–308. doi:10.1007/bf00273541.

Van der Aalst, W. M. P., V. Rubin, H. M. W. Verbeek, B. F. van Dongen, E. Kindler, and C. W. Günther. “Process Mining: a Two-Step Approach to Balance between Underfitting and Overfitting.” Software & Systems Modeling 9, no. 1 (November 25, 2008): 87–111. doi:10.1007/s10270-008-0106-z.

Full Text: PDF

DOI: 10.28991/esj-2020-01231


  • There are currently no refbacks.

Copyright (c) 2020 Zehra Nur Canbolat, Gökhan Silahtaroğlu, Özge Doğuç, Nevin Yılmaztürk