GET THE APP

The Application of Artificial Intelligence Technique (CNN-Alexnet) in Diagnosing COVID-19 Using Chest X-ray Images

Journal of Research in Medical and Dental Science
eISSN No. 2347-2367 pISSN No. 2347-2545

All submissions of the EM system will be redirected to Online Manuscript Submission System. Authors are requested to submit articles directly to Online Manuscript Submission System of respective journal.

Research - (2021) Volume 9, Issue 5

The Application of Artificial Intelligence Technique (CNN-Alexnet) in Diagnosing COVID-19 Using Chest X-ray Images

Maryam Muhammed1, Moussa Mahamat Boukar1, Saleh Elyakubu Aldullahi1 and Senol Dane2*

*Correspondence: Senol Dane, Department of Physiology, Faculty of Basic Medical Sciences, College of Health Sciences, Nile University of Nigeria, Abuja, Nigeria, Email:

Author info »

Abstract

Background: The coronavirus which initially appeared in China in December 2019 was later declared global pandemic in the year 2020. It has caused a devastating effect on daily lives, public health, and the global economy. Early detection of positive cases is overly critical to prevent further spread of the pandemic and to quickly treat affected patients in isolation. Which is why introduction to fast and accurate alternative of diagnosing the virus is very vital. Methods: An AI technique called deep learning which is most applied to analyze visual imagery like radiological images, This AI technique uses convolutional neural networks (CNN) to analyze the images, AlexNet is the CNN model used for this research. Several studies suggest that medical images contain salient information about the Covid-19 virus, which is why applying such advanced artificial intelligence (AI) techniques coupled with radiological imaging can be helpful for the accurate detection of this disease with a huge potential to address the problem of a limited to no specialized physicians in remote areas like Nigeria’s most vulnerable regions. Results: Initially, the model gave high accuracy of 97.97%, this was suspected to be overfitting. This was corrected by increasing the dataset and applying cross validation thereby reducing noise by giving a lower accuracy to 85% and also increasing its specificity. Conclusions: The aim of the study was to introduce an alternative way of diagnosing the Covid-19 asides from the PCR that is currently the most popular one, this has been archived by our working system and the waiting time has been reduced from 24-48hours to 58 minutes. Secondly, to identify a suitable model in Deep learning in medical science and to measure the performance and to access the effectiveness of the chosen model Alexnet in terms of accuracy, precision, recall &F1score. We archived this by striking a balance in the high percentile number of the following terms and reducing it to a more believable, reliable, and accurate figure.

Keywords

Artificial Intelligence, Deep learning, Convolutional neural networks, Pandemic

Introduction

The novel Covid-19 virus came to light after a cluster of patients with pneumonia of unknown cause was linked to a wet market in Wuhan city, Hubei Province, China. Since emerging in Wuhan, in December 2019, the coronavirus disease (Covid-19) epidemic caused by severe acute respiratory syndrome coronavirus 2 (SARSCoV- 2) has progressed rapidly into a pandemic [1,2]. The novel Covid-19 virus quickly spread around the world [3]. On 30th January 2020, the World Health Organization (WHO) declared the Chinese outbreak of Covid-19 to be a Public Health Emergency of International Concern posing a high risk to countries with vulnerable health systems. The virus adversely affects the health and welfare of the world's population, killing many people and impacting the economy of nations worldwide. The novel Covid-19 virus belongs to a large group of dangerous viruses [4]. The Covid-19 is classified as such disease due to its nature which can cause cold, like the SARS coronavirus (SARS-CoV). The disease is named Covid-19 and the virus is termed SARSCoV- 2. The infectious disease caused by this type of virus was named Covid-19 by the World Health Organization (WHO) on February 11, 2020 [5]. This new type of virus spread from Wuhan to much of China in just 30 days, which in no short time spread rapidly around the world and became a pandemic [6].

The most common symptoms of Covid-19 include fever, cough, headache, dyspnea, dizziness, muscle pain, and fatigue [7]. Hence, the spread of Covid-19 may be interrupted by early detection, isolation, prompt treatment, and the implementation of a robust contact tracing apps. Early detection of Covid-19 is crucially significant and vital in curbing the spread of the novel coronavirus.

Nowadays, applying artificial intelligence techniques such as deep learning algorithms for diagnosis in the medical field have gained popularity by becoming a supplementary tool for health practitioners [8-11]. Therefore, the deep learning techniques was adopted in the analysis of Chest X-rays for Covid-19 detection in this study. This application can be considered as the primary diagnostic tool in epicenters of the pandemic to reduce staff exposure to the virus and test result wait time [12].

In this study, we work with data consisting of Chest X-ray images because of its past application in the diagnosis for Covid-19 and disease progression evaluation in hospital admitted cases [13]. Hence, there are portable X-ray units widely available and can be easily accessed in most primary healthcare facilities, which makes X-ray imaging to be widely available including in remote areas. Chest X-rays can also be operated in more isolated rooms limiting staff exposure to the deadly virus. Sometimes, the patient’s clinical situation does not warrant a CT scan; therefore, Chest X-rays are a better choice for the initial assessment of infection cases.

Materials and Methods

Subjects

In the process of preparing this research, we have done an in-depth study of related works, literature reviews that are evaluated using a qualitative research method. The performance of the proposed methods is also evaluated with some metrics like, accuracy, precision, recall, and F1 score. It is difficult to access real patients records from the healthcare facilities. Thus, I have acquired some image data of patients from online repositories i.e., secondary data was used [14, 15]. The secondary data contains 3 categories of sample data: Normal chest Xray, Pneumonia X-ray samples and positive Covid-19 X-ray samples.

The dataset consists of 3800 total X-ray images; 1200 confirmed COVID-19 patient chest X-ray images (Figure 1), 1300 pneumonia patient chest X-ray images (Figure 2) and 1300 normal (healthy) people chest X-ray images (Figure 3).

medical-dental-science-COVID-19

Figure 1: A chest X-ray image of a patient with COVID-19.

medical-dental-science-patient-pneumonia

Figure 2: A chest X-ray image of a patient with pneumonia.

medical-dental-science-normal-chest

Figure 3: A normal chest X-ray image.

The dataset used in training the model will be organized into 3 folders (training, test data & cross validation) which contains subfolders for each image category (COVID-19 (positive)/ normal (negative) and pneumonia). The proposed model is trained with 70% of the data, testing will be done with 20% and cross validation 10%.

Convolutional neural network

Using un-supervised learning, the image classification model used during this project is the CNN model, this was selected due to its high recognition accuracy. Image classification in CNN works by training a dataset filled with labelled images and learns from them without human intervention, then responds back with an output. This is later tested to confirm if the model has learnt properly. Thus, why there is a training dataset and a test data set. We used pretrained network with transfer learning because it is much faster and easier than training from scratch as it requires minimum amount of data and computational resources. One of the major advantages of CNN and is its high accuracy in Object detection. It processes and classifies objects in images and since our data is mainly X-ray images, this model fits perfectly.

AlexNet

AlexNet is a Convolutional Neural Network that is 8 layers, it can be used to load a pretrained version of a network trained on large Image datasets up to a million from the ImageNet database. AlexNet can easily classify images into 1000 objects categories. The network takes an image as input, then outputs a label for the object in the image. AlexNet as a pre-trained model has gained a rich feature representation to classify variety of images ranging from various applications. The first five layers, the convolutional layers, from the pre-trained AlexNet CNN model are saved as fixed feature extractors, while the last three layers are the fully connected layers [16]. The network training options were set at 20 epoch(s). A complete pass of the algorithm over the entire training set is called an epoch. The mini-batch size was set as ‘default’. Mini-batch size is the subset of the training dataset used by the SGDM to update the network parameters. In contrast, the learning rate of training the model was set at 0.001 which basically signifies the rate of adjusting the weights of the network to the gradient.

Results

This section of our study presents the results that were obtained during the experiment that was conducted on building our COVID-19, pneumonia and normal patients’ diagnosis model using X-ray images as input parameters. We further evaluate the implementation of our models in detail, showing the effectiveness of our proposed framework.

The pre-trained AlexNet CNN model we employed for the purpose of transfer learning achieved 93.8% training accuracy with training loss value of 0.198 at 0.001 learning rate. Furthermore, we conducted this experiment several times using the same model with different training options and obtained the results discussed in this study.

Given such high accuracy figure, overfitting is suspected to have occurred, this can be due to many reasons one of which is suspected to be the system cramming previous solutions instead of learning from it. However, this problem is overcome by Cross-validation and Stratified cross validation. Initially we selected images at random for training from each dataset, we later changed using cross validation by selecting the images 1:1. Stratified cross validation was also applied to help break down the dataset into smaller fractions in other to avoid the sets from overlapping. These are all included to increase the efficiency of our model. The proposed model is then trained using 70% of the dataset for training, 20% for test and the remaining 10% for validation.

The initial prediction result was giving an accuracy of 96%. This accuracy level was way too high. One of 2 things was suspected to cause this high number, the first was that there was too much noise in the data hence the high result, second was the system was not actually learning but rather cramming results and since it was selecting data randomly it is possible it kept on selecting the same images.

This led to the introduction of cross validation. Instead of the images being selected at random, it selects images 1:1 to enable all the data images to be trained there by archiving the main goal of cross validation which is to test the model’s ability to predict new data in this case ‘Test data’ that was not used to train the model initially in order to flag problems like overfitting or bias selection and to give an insight on how the model will generalize an independent dataset.

The dataset was also increased from 1000 to 3800 to enable better results. The learning rate was reduced from 0.0001 to 0.000001. Increasing the iteration to 14 and each having 20 epochs. All these changes enabled us to estimate the accuracy of the model. finalizing the accuracy to 85% (Table 1).

Table 1. Performance of the proposed method.

Network Accuracy Precision Recall F1 Score
AlexNet 85.0% 86.3% 86.1% 86.2%

Discussion

Psychological stress due to COVID-19 pandemic can result in fear and worry among people about their health and financial conditions. It can also cause changes in eating patterns, sleeping problems, concentration difficulties, and exacerbation of chronic health problems, mental health conditions, usage of tobacco, alcohol, and other substances [17]. The current pandemic-related coping strategies may harm mental health, such as decreased well-being and increased depression and anxiety symptoms [18,19], insomnia, and anger [20-22]. Also, inactivity due to COVID-19 disease can have a negative effect on physical and mental health and coping with stress and anxiety during isolation time [23,24]. Besides, there were some negative lifestyle changes due to the COVID-19 pandemic [25]. Furthermore, the fairly big changes in food consumption preferences were reported [26]. Also, in another study, there was a significant decrease in family incomes and a significant increase in family expenditures during the pandemic outbreak [27]. Also, Nigerian women entrepreneurs experienced the negative effect of COVID-19 outbreak on their businesses [28]. Duration and severity of loss of smell and loss of taste and durations of fatigue and headache symptoms was higher in women than in men. These results can be attributed the gender related differences in depression [19,29] and anxiety [18,19]. In previous studies, the relationships of some environmental and hereditary factors such as gender, education, physical abnormalities, handedness, marital status, visual memory, and salivary testosterone with some psychologies including self-esteem, alexithymia, depression [29-36]. Besides, in a recent study, some COVID-19 symptoms were higher and more severe in women than in men [37].

This study details the metrics and measures used to evaluate the performance of the chosen model (AlexNet) in diagnosing covid19. We used the CNN model AlexNet for features extraction on image data to train for diagnosis of patients using X-ray images. The study indicates the effectiveness of using technological tools as an alternative to curbing the spread of Covid19 by conducting rapid diagnosis to identify people with the disease.

Using 3 types of data (Normal, Pneumonia, Covid19) Xray images, the AlexNet model was modified, and layers 23 & 25 were selected to solve the problem statement. The data was partitioned into 70% training, 20% testing & 10% validation. The initial prediction result was giving an accuracy of 96%. This accuracy level was way too high 2 things was suspected to cause this high number, the first was that there was too much noise in the data hence the high result, second was the system was not actually learning but rather cramming results. This led to the introduction of cross validation. Instead of the images being selected at random, it selects images 1:1 to flag problems like overfitting or bias selection and to give an insight on how the model will generalize an independent dataset. The dataset was also increased from 1000 to 3800 images to enable better results. The learning rate was reduced from 0.0001 to 0.000001. Increasing the iteration to 14 and each having 20 epochs. All these changes enabled us to estimate the accuracy of the model. finalizing the accuracy to 85%.

Conclusion

In this study, the main aim was to introduce an alternative way of diagnosing the Covid-19 asides from the PCR that is currently the most popular one, reason being the long waiting time for results. This has been archived by our working system and the waiting time has been reduced from 24-48hours to 58 minutes.

Secondly, to identify a suitable model in Deep learning in medical science and to measure the performance of the algorithm using a general guideline on evaluating such methods. We compared this model (Alex Net) to others and later modified it to work with our data in other to solve the problem statement which is diagnosing Covid19.

Lastly, was to access the effectiveness of the chosen model Alexnet in terms of accuracy, precision, recall &F1score. We archived this by striking a balance in the high percentile number of the following terms and reducing it to a more realistic and reliable figure.

References

Author Info

Maryam Muhammed1, Moussa Mahamat Boukar1, Saleh Elyakubu Aldullahi1 and Senol Dane2*

1Department of Computer Science, Faculty of Natural and Applied Sciences, Nile University of Nigeria, Abuja, Nigeria
2Department of Physiology, Faculty of Basic Medical Sciences, College of Health Sciences, Nile University of Nigeria, Abuja, Nigeria
 

Received: 03-Apr-2021 Accepted: 10-May-2021 Published: 17-May-2021

http://sacs17.amberton.edu/