Comparing Supervised Machine Learning Models for Covid-19 patient detection using a Combination of Clinical and Laboratory Dataset

Document Type : Original Article


1 Department of Information Technology, Tarbiat Modares University, Tehran, Iran

2 Department of Internal Medicine, School of Medicine, Shahid Beheshti University of Medical Sciences, Imam Hossein Hospital, Tehran, Iran


COVID-19 is a new variant of SARS-COV-2 which can lead to mild to severe infection in humans. Despite the remarkable efforts to contain the epidemic, the virus spread rapidly around the world and its prevalence continued with different degrees of clinical symptoms in many countries. Although common strategies including prevention, diagnosis, and care are necessary to curb this epidemic, early and accurate diagnosis can play an important role in reducing the speed of the epidemic. In this regard, the use of technologies based on artificial intelligence can be of great help. For this reason, since the outbreak of COVID-19, many researchers have tried to use machine learning techniques as a subset of artificial intelligence for the early diagnosis of COVID-19. Considering the importance and role of using clinical and laboratory data in the diagnosis of people with covid-19, in this paper K-NN, SVM, decision tree, random forest, Naive Bayes, neural network and XGBoost models are the most common machine learning models, and a dataset containing 1354 records consisting of clinical and laboratory data of patients in Imam Hossein Hospital in Tehran has been used to diagnose patients with covid-19. The results of this research indicate that based on the evaluation criteria, XGBoost and K-NN models have the most accuracy among the mentioned models and can be considered suitable predictive models for the diagnosis of COVID-19.