Optic disc and cup segmentation with combined residual context encoding and path augmentation
Objective Ophthalmic image segmentation is an important part of medical image analysis.Among these,optic disc(OD)and optic cup(OC)segmentation are crucial technologies for the intelligent diagnosis of glaucoma,which can cause irreversible damage to the eyes and is the second leading cause of blindness worldwide.The primary glaucoma screening method is the evaluation of OD and OC based on fundus images.The cup disc ratio(CDR)is one of the most rep-resentative glaucoma detection features.In general,eyes with CDR greater than 0.65 are considered to have glaucoma.With the continuous development of deep learning,U-Net and its variant models,including superpixel classification and edge segmentation,have been widely used in OD and OC segmentation tasks.However,the segmentation accuracy of OD and OC is limited,and their efficiency is low during training due to the loss of spatial information caused by continuous con-volution and pooling operations.To improve the accuracy and training efficiency of OD and OC segmentation,we proposed the residual context path augmentation U-Net(RCPA-Net),which can capture deeper semantic feature information and solve the problem of unclear edge localization.Method RCPA-Net includes three modules:feature coding module(FCM),residual atrous convolution(RAC)module,and path augmentation module(PAM).First,the FCM block adopts the ResNet34 network as the backbone network.By introducing the residual module and attention mechanism,the model is enabled to focus on the region of interest,and the efficient channel attention(ECA)is adopted to the squeeze and excita-tion(SE)module.The ECA module is an efficient channel attention module that avoids dimensionality reduction and cap-tures cross-channel features effectively.Second,the RAC block is used to obtain the context feature information of a wider layer.Inspired by Inception-V4 and context encoder network(CE-Net),we fuse cavity convolution into the inception series network and stack convolution blocks.Traditional convolution is replaced with cavity convolution,such that the receptive field increases while the number of parameters remains the same.Finally,to shorten the information path between the low-level and top-level features,the PAM block uses an accurate low-level positioning signal and lateral connection to enhance the entire feature hierarchy.To solve the problem of extremely unbalanced pixels and generate the final segmentation map,we propose a new multi-label loss function based on the dice coefficient and focal loss.This function improves the pixel ratio between the OD/OC and background regions.In addition,we enhance the training data by flipping the image and adjusting the ratio of length and width.Then,the input images are processed using the contrast-limited adaptive histogram equalization method,and each resultant image is fused with its original one and then averaged to form a new three-channel image.This step aims to enhance image contrast and enrich image information.In the experimental stage,we use Adam optimization instead of the stochastic gradient descent method to optimize the model.The number of samples selected for each training stage is eight,and the weight decay is 0.000 1.During training,the learning rate is adjusted adaptively in accordance with the number of samples selected each time.In outputting the prediction results,the maximum connected region in OD and OC is selected to obtain the final segmentation result.Result Four datasets(ORIGA,Drishti-GS1,Ref-uge,and RIM-ONE-R1)are employed to validate the performance of the proposed method.Then,the results are compared with various state-of-the-art methods,including U-Net,M-Net,and CE-Net.The ORIGA dataset contains 650 color fun-dus images of 3 072 × 2 048 pixels,and the ratio of the training set to the test set is 1∶1 during the experiment.The Drishti-GS1 dataset contains 101 images,including 31 normal images and 70 diseased images.The fundus images are divided into two datasets,Groups A and B,which include 50 training samples and 51 testing samples,respectively.The 400 fundus images in the Refuge dataset are also divided into two datasets.Group A includes 320 training samples,while Group B includes 80 testing samples.The Jaccard index and F-measure score are used in the experimentation to evaluate the results of OD and OC segmentation.The results indicate that in the ORIGA dataset,the Jaccard index and F-measure of the proposed method in OD/OC segmentation are 0.939 1/0.794 8 and 0.968 6/0.885 5,respectively.In the Drishti-GS1 dataset,the results in OD/OC segmentation are 0.951 3/0.863 3 and 0.975 0/0.926 6,respectively.In the Refuge dataset,the results are 0.929 8/0.828 8 and 0.963 6/0.906 3,respectively.In the RIM-ONE-R1 dataset,the results of OD segmentation are 0.929 0 and 0.962 8.The results of the proposed method on the four datasets are all better than those of its counterparts,and the performance of the network is significantly improved.In addition,we conduct ablation experi-ments for the primary modules proposed in the network,where we perform comparative experiments with respect to the loca-tion of the modules,the parameters in the model,and other factors.The results of the ablation experiments demonstrate the effectiveness of each proposed module in RCPA-Net.Conclusion In this study,we propose RCPA-Net,which com-bines the advantages of deep segmentation models.The images predicted using RCPA-Net are closer to the real results,providing more accurate segmentation of OD and OC than several state-of-the-art methods.The experimentation demon-strates the high effectiveness and generalization ability of RCPA-Net.
optic disc and optic cup segmentationdeep learningattention mechanismresidual atrous convolutionpath augmentation