首页期刊导航|虚拟现实与智能硬件(中英文)
期刊信息/Journal information
虚拟现实与智能硬件(中英文)
虚拟现实与智能硬件(中英文)
虚拟现实与智能硬件(中英文)/Journal Virtual Reality&Intelligent Hardware
正式出版
收录年代

    A review of medical ocular image segmentation

    Lai WEIMenghan HU
    181-202页
    查看更多>>摘要:Deep learning has been extensively applied to medical image segmentation,resulting in significant advancements in the field of deep neural networks for medical image segmentation since the notable success of U-Net in 2015.However,the application of deep learning models to ocular medical image segmentation poses unique challenges,especially compared to other body parts,due to the complexity,small size,and blurriness of such images,coupled with the scarcity of data.This article aims to provide a comprehensive review of medical image segmentation from two perspectives:the development of deep network structures and the application of segmentation in ocular imaging.Initially,the article introduces an overview of medical imaging,data processing,and performance evaluation metrics.Subsequently,it analyzes recent developments in U-Net-based network structures.Finally,for the segmentation of ocular medical images,the application of deep learning is reviewed and categorized by the type of ocular tissue.

    ARGA-Unet:Advanced U-net segmentation model using residual grouped convolution and attention mechanism for brain tumor MRI image segmentation

    Siyi XUNYan ZHANGSixu DUANMingwei WANG...
    203-216页
    查看更多>>摘要:Background Magnetic resonance imaging(MRI)has played an important role in the rapid growth of medical imaging diagnostic technology,especially in the diagnosis and treatment of brain tumors owing to its non-invasive characteristics and superior soft tissue contrast.However,brain tumors are characterized by high non-uniformity and non-obvious boundaries in MRI images because of their invasive and highly heterogeneous nature.In addition,the labeling of tumor areas is time-consuming and laborious.Methods To address these issues,this study uses a residual grouped convolution module,convolutional block attention module,and bilinear interpolation upsampling method to improve the classical segmentation network U-net.The influence of network normalization,loss function,and network depth on segmentation performance is further considered.Results In the experiments,the Dice score of the proposed segmentation model reached 97.581%,which is 12.438%higher than that of traditional U-net,demonstrating the effective segmentation of MRI brain tumor images.Conclusions In conclusion,we use the improved U-net network to achieve a good segmentation effect of brain tumor MRI images.

    Intelligent diagnosis of atrial septal defect in children using echocardiography with deep learning

    Yiman LIUSize HOUXiaoxiang HANTongtong LIANG...
    217-225页
    查看更多>>摘要:Background Atrial septal defect(ASD)is one of the most common congenital heart diseases.The diagnosis of ASD via transthoracic echocardiography is subjective and time-consuming.Methods The objective of this study was to evaluate the feasibility and accuracy of automatic detection of ASD in children based on color Doppler echocardiographic static images using end-to-end convolutional neural networks.The proposed depthwise separable convolution model identifies ASDs with static color Doppler images in a standard view.Among the standard views,we selected two echocardiographic views,i.e.,the subcostal sagittal view of the atrium septum and the low parasternal four-chamber view.The developed ASD detection system was validated using a training set consisting of 396 echocardiographic images corresponding to 198 cases.Additionally,an independent test dataset of 112 images corresponding to 56 cases was used,including 101 cases with ASDs and 153 cases with normal hearts.Results The average area under the receiver operating characteristic curve,recall,precision,specificity,F1-score,and accuracy of the proposed ASD detection model were 91.99,80.00,82.22,87.50,79.57,and 83.04,respectively.Conclusions The proposed model can accurately and automatically identify ASD,providing a strong foundation for the intelligent diagnosis of congenital heart diseases.

    Combining machine and deep transfer learning for mediastinal lymph node evaluation in patients with lung cancer

    Hui XIEJianfang ZHANGLijuan DINGTao TAN...
    226-238页
    查看更多>>摘要:Background The prognosis and survival of patients with lung cancer are likely to deteriorate with metastasis.Using deep-learning in the detection of lymph node metastasis can facilitate the noninvasive calculation of the likelihood of such metastasis,thereby providing clinicians with crucial information to enhance diagnostic precision and ultimately improve patient survival and prognosis.Methods In total,623 eligible patients were recruited from two medical institutions.Seven deep learning models,namely Alex,GoogLeNet,Resnet18,Resnet101,Vgg16,Vgg19,and MobileNetv3(small),were utilized to extract deep image histological features.The dimensionality of the extracted features was then reduced using the Spearman correlation coefficient(r≥0.9)and Least Absolute Shrinkage and Selection Operator.Eleven machine learning methods,namely Support Vector Machine,K-nearest neighbor,Random Forest,Extra Trees,XGBoost,LightGBM,Naive Bayes,AdaBoost,Gradient Boosting Decision Tree,Linear Regression,and Multilayer Perceptron,were employed to construct classification prediction models for the filtered final features.The diagnostic performances of the models were assessed using various metrics,including accuracy,area under the receiver operating characteristic curve,sensitivity,specificity,positive predictive value,and negative predictive value.Calibration and decision-curve analyses were also performed.Results The present study demonstrated that using deep radiomic features extracted from Vgg16,in conjunction with a prediction model constructed via a linear regression algorithm,effectively distinguished the status of mediastinal lymph nodes in patients with lung cancer.The performance of the model was evaluated based on various metrics,including accuracy,area under the receiver operating characteristic curve,sensitivity,specificity,positive predictive value,and negative predictive value,which yielded values of 0.808,0.834,0.851,0.745,0.829,and 0.776,respectively.The validation set of the model was assessed using clinical decision curves,calibration curves,and confusion matrices,which collectively demonstrated the model's stability and accuracy.Conclusion In this study,information on the deep radiomics of Vgg16 was obtained from computed tomography images,and the linear regression method was able to accurately diagnose mediastinal lymph node metastases in patients with lung cancer.

    Automatic detection of breast lesions in automated 3D breast ultrasound with cross-organ transfer learning

    Lingyun BAOZhengrui HUANGZehui LINYue SUN...
    239-251页
    查看更多>>摘要:Background Deep convolutional neural networks have garnered considerable attention in numerous machine learning applications,particularly in visual recognition tasks such as image and video analyses.There is a growing interest in applying this technology to diverse applications in medical image analysis.Automated three-dimensional Breast Ultrasound is a vital tool for detecting breast cancer,and computer-assisted diagnosis software,developed based on deep learning,can effectively assist radiologists in diagnosis.However,the network model is prone to overfitting during training,owing to challenges such as insufficient training data.This study attempts to solve the problem caused by small datasets and improve model detection performance.Methods We propose a breast cancer detection framework based on deep learning(a transfer learning method based on cross-organ cancer detection)and a contrastive learning method based on breast imaging reporting and data systems(BI-RADS).Results When using cross organ transfer learning and BIRADS based contrastive learning,the average sensitivity of the model increased by a maximum of 16.05%.Conclusion Our experiments have demonstrated that the parameters and experiences of cross-organ cancer detection can be mutually referenced,and contrastive learning method based on BI-RADS can improve the detection performance of the model.

    Face animation based on multiple sources and perspective alignment

    Yuanzong MEIWenyi WANGXi LIUWei YONG...
    252-266页
    查看更多>>摘要:Background Face image animation generates a synthetic human face video that harmoniously integrates the identity derived from the source image and facial motion obtained from the driving video.This technology could be beneficial in multiple medical fields,such as diagnosis and privacy protection.Previous studies on face animation often relied on a single source image to generate an output video.With a significant pose difference between the source image and the driving frame,the quality of the generated video is likely to be suboptimal because the source image may not provide sufficient features for the warped feature map.Methods In this study,we propose a novel face-animation scheme based on multiple sources and perspective alignment to address these issues.We first introduce a multiple-source sampling and selection module to screen the optimal source image set from the provided driving video.We then propose an inter-frame interpolation and alignment module to further eliminate the misalignment between the selected source image and the driving frame.Conclusions The proposed method exhibits superior performance in terms of objective metrics and visual quality in large-angle animation scenes compared to other state-of-the-art face animation methods.It indicates the effectiveness of the proposed method in addressing the distortion issues in large-angle animation.