DEEP LEARNING-BASED EARLY DETECTION OF DIABETIC RETINOPATHY USING FUNDUS IMAGES

DR. T. KUMARESAN; RAMYA.  R; DR.S.ZULAIKHA BEEVI; MALINI. M; DINESHBABU.  V; K. SARANYA

Authors

DR. T. KUMARESAN PROFESSOR, DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING KGISL INSTITUTE OF TECHNOLOGY, COIMBATORE
RAMYA. R ASSOCIATE PROFESSOR, DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING, BANNARI AMMAN INSTITUTE OF TECHNOLOGY, SATHYAMANGALAM-638401, ERODE.
DR.S.ZULAIKHA BEEVI PROFESSOR, DEPARTMENT OF ARTIFICIAL INTELLIGENCE AND DATA SCIENCE VEL TECH HIGH TECH DR.RANGARAJANDR.SAKUNTHALA ENGINEERING COLLEGE AVADI, CHENNAI-600062, TAMIL NADU, INDIA
MALINI. M ASSOCIATE PROFESSOR AND HEAD, DEPARTMENT OF COMPUTER SCIENCE AND BUSINESS SYSTEMS AKSHAYA COLLEGE OF ENGINEERING AND TECHNOLOGY, COIMBATORE TAMILNADU
DINESHBABU. V ASSISTANT PROFESSOR, DEPARTMENT OF INFORMATION TECHNOLOGY, KARPAGAM INSTITUTE OF TECHNOLOGY, COIMBATORE, TAMILNADU
K. SARANYA ASSOCIATE PROFESSOR, DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING, BANNARI AMMAN INSTITUTE OF TECHNOLOGY, SATHYAMANGALAM, ERODE, TAMILNADU, INDIA

Keywords:

Diabetic Retinopathy (DR), Deep Learning, EfficientNetV2, Vision Transformer (ViT), Grad-CAM, Automated Screening.

Abstract

This paper proposes a deep learning model that together with high-resolution fundus images can be used to detect diabetic retinopathy (DR) at an early stage. The efficiency of the proposed architecture is to utilize EfficientNetv2 to extract representation in a usefully robust fashion and a Vision Transformer (ViT) to incorporate long-range spatial contextual information to improve lesion localization and classification performance. Data augmentation and gradually resizing images during the training process in the model helps to fix the class imbalance to enhance generalization. Besides, it is combined with Grad-CAM to enable interpretability by visual recognition of areas affecting the predictions. It has been trained and tested on more than 100 000 fundus images gathered on EyePACS and Messidor-2. It attains a high classification performance (AUC-ROC, 96.3%, sensitivity, 94.1%, specificity, 92.8%) impressive than the standard CNN epitomes like ResNet 50 and DenseNet 121. Qualitative Grad-CAM heatmaps validate the value of the model to identify clinically relevant areas that validates its capability of applications in transparent and scalable DR screening in real-world low-resource scenarios.

DEEP LEARNING-BASED EARLY DETECTION OF DIABETIC RETINOPATHY USING FUNDUS IMAGES

Authors

Keywords:

Abstract

Downloads

How to Cite

Issue

Section

License