OTONet: Deep Neural Network for Precise Otoscopy Image Classification
Document Type
Article
Publication Title
IEEE Access
Abstract
Otoscopy is a diagnostic procedure to visualize the external ear canal and eardrum, facilitating the detection of various ear pathologies and conditions. Timely otoscopy image classification offers significant advantages, including early detection, reduced patient anxiety, and personalized treatment plans. This paper introduces a novel OTONet framework specifically tailored for otoscopy image classification. It leverages octave 3D convolution and a combination of feature and region-focus modules to create an accurate and robust classification system capable of distinguishing between various otoscopic conditions. This architecture is designed to efficiently capture and process the spatial and feature information present in otoscopy images. Using a public otoscopy dataset, OTONet has reached a classification accuracy of 99.3% and an F1 score of 99.4% across 11 classes of ear conditions. A comparative analysis demonstrates that OTONet surpasses other established machine learning models, including ResNet50, ResNet50v2, VGG16, Dense-Net169, and ConvNeXtTiny, across various evaluation metrics. The research's contribution to improved diagnostic accuracy reduced human error, expedited diagnostics, and its potential for telemedicine applications.
First Page
7734
Last Page
7746
DOI
10.1109/ACCESS.2024.3351668
Publication Date
1-1-2024
Recommended Citation
Rao, Divya; Singh, Rohit; Kamath, Sudiksha Kottachery; and Pendekanti, Sanjeev Kushal, "OTONet: Deep Neural Network for Precise Otoscopy Image Classification" (2024). Open Access archive. 7330.
https://impressions.manipal.edu/open-access-archive/7330