Objective: To establish a prediction model for the identifying of cataplexy facial features based on clinical shooting videos by using a deep learning image recognition network ResNet-18. Methods: A cross-sectional study. Twenty-five narcolepsy type 1 patients who were first diagnosed and never received treatment and 25 healthy controls recruited by advertisement in the Second Affiliated Hospital of Nanchang University from 2020 to 2023.After image preprocessing, a total of 1 180 images were obtained, including 583 cataplexy faces and 597 normal faces.90% were selected as the training set and validation set, and then expanded the data by 5 times.80% of the expanded data set was extracted as the training set and 20% as the validation set, that is, the number of the training set was (583+597)×0.9×0.8×5=4 248, the number of the validation set was (583+597)×0.9×0.2×5=1 062. The data sets for training and validation were used train parameters to establish the model and were trained through the five-fold cross-validation method, to establish the ResNet-18 cataplexy face recognition model via transfer learning.10% (118 images) of the original non-amplified images were extracted as the test set. The test set data did not participate in data enhancement and model training, and was only used to evaluate the final performance of the model. Finally, ResNet-18 was compared with VGG-16, ResNet-34 and Inception V3 deep learning models, and the receiver operating characteristic curve was used to evaluate the value of ResNet-18 image recognition network in cataplexy face recognition. Results: Among 25 patients with narcolepsy type 1, 15 were males and 10 were females, aged [M (Q1, Q3)] of 14.0(11.0, 20.5) years.Among 25 healthy controls, 14 were males and 11 were females, with a median age of 16.0(14.4, 23.0) years.The overall accuracy of ResNet-18 image recognition network in the test set was 90.9%, the sensitivity was 96.4% and the specificity was 85.2%. The area under the ROC curve was 0.99(95%CI:0.96-1.00). The ResNet-18 model parameter amount was 11.69 M, the floating point operation amount was 1 824.03 M, and the single image recognition time was 5.9 ms. Conclusions: The cataplexy face prediction model built based on the deep learning image recognition network ResNet-18 has a high accuracy in identifying cataplexy faces.
目的: 应用深度学习图像识别网络ResNet-18,基于临床拍摄视频,建立猝倒面容预测模型。 方法: 本研究为横断面研究,收集2020至2023年在南昌大学第二附属医院首诊未经治疗的1型发作性睡病患者25例及健康对照25名,采集的图像预处理后,共获得1 180张图片,其中583张猝倒面容,597张正常面容。从中抽取90%作为训练集与验证集,随后数据扩增5倍,扩充后的数据集抽取80%作为训练集,20%作为验证集,即训练集数量为(583+597)×0.9×0.8×5=4 248,验证集数量为(583+597)×0.9×0.2×5=1 062,训练集与验证集用于训练参数建立模型,并通过五折交叉验证法进行训练,构建采用迁移学习方式的ResNet-18猝倒面容识别模型。原未扩增前图像抽取10%(118张)作为测试集,测试集数据不参与数据增强和模型训练,仅用于测试模型最终效果。最后将ResNet-18与VGG-16、ResNet-34和Inception V3深度学习模型进行比较,用受试者工作特征曲线评估ResNet-18图像识别网络在猝倒面容识别中的价值。 结果: 25例1型发作性睡病患者中,男15例,女10例,年龄[M(Q1,Q3)]为14.0(11.0,20.5)岁;25名健康对照者中,男14名,女11名,年龄16.0(14.4,23.0)岁。ResNet-18图像识别网络在测试集中的总体准确率为90.9%,灵敏度为96.4%,特异度为85.2%,受试者工作特征曲线下面积为0.99(95%CI:0.96~1.00)。ResNet-18模型参数量为11.69 M,浮点运算量为1 824.03 M,单张图片识别时间为5.9 ms。 结论: 基于深度学习图像识别网络ResNet-18构建的猝倒面容预测模型在猝倒面容的识别上有较高的准确率。.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.3