中国医院统计2024,Vol.31Issue(1) :61-66,71.DOI:10.3969/j.issn.1006-5253.2024.01.012

医学多元统计方法课程模拟案例教学研究

Research on simulated case teaching in the course of Medical Multiple Statistical Methods

王亚茹 王玖 刘海霞 侯静 张莉 李智 孙红卫
中国医院统计2024,Vol.31Issue(1) :61-66,71.DOI:10.3969/j.issn.1006-5253.2024.01.012

医学多元统计方法课程模拟案例教学研究

Research on simulated case teaching in the course of Medical Multiple Statistical Methods

王亚茹 1王玖 1刘海霞 1侯静 1张莉 1李智 1孙红卫1
扫码查看

作者信息

  • 1. 滨州医学院公共卫生学院,264003山东烟台
  • 折叠

摘要

目的 由于多元统计概念较为复杂,容易导致学生理解上的困难,本文以模拟实验的形式考察数据的各项特征对分析结果的影响,以加强学生对多元统计原理的理解,利于学生正确应用多元统计方法.方法 设计6个模拟实验,运用R4.2.2生成模拟数据并进行统计分析,探寻数据特征对6种统计分析方法的影响.结果 由模拟实验的统计分析结果可知,在多重线性回归中自变量间的多重共线性会使得回归方程的回归系数估计不准确.主成分和因子分析中变量间的相关结构影响降维效果.logistic回归中要考虑到自变量对因变量的非线性影响,否则结果会影响到估计结果的准确度.在Cox回归删失比例过大,特别是达到60%后,Cox回归模型结果偏差较大.变量间的相关结构影响典型相关分析结果.类别比例相差较大时,Bayes判别和logistic回归做判别更为合适,由于包含先验概率,贝叶斯判别更加灵活.结论 多元统计方法的运用需充分考虑数据特征等适用条件,通过模拟实验法,鼓励学生自主运行模拟实验,直观理解抽象的多元概念,便于多元统计方法的理解和应用.

Abstract

Objective The complexity of the concept of multivariate statistics can easily lead to difficulties in students'understanding.This article uses simulation experiments to examine the impact of various characteristics of data on the analysis re-sults,in order to strengthen students'understanding of the principles of multivariate statistics and facilitate their correct applica-tion of multivariate statistical methods.Methods Six simulation experiments were designed,and R 4.2.2 was used to generate simulation data and conduct statistical analysis to explore the impact of data characteristics on the 6 statistical analysis methods.Results According to the statistical analysis results of simulation experiments,the multicollinearity among independent variables in multilinear regression will make the regression coefficient estimation of the regression equation inaccurate.The correlation structure between variables in principal component and factor analysis affects the dimensionality reduction effect.In logistic re-gression,the nonlinear influence of independent variables on dependent variables should be considered,otherwise the results will affect the accuracy of the estimation results.When the proportion of deletion in Cox regression is too large,especially after reac-hing 60%,the results of the Cox regression model have a significant deviation.The correlation structure between variables affects the results of canonical correlation analysis.When there is a large difference in the proportion of categories,Bayesian discrimina-tion and logistic regression are more appropriate for discrimination,because Bayesian discrimination with prior probability is more flexible.Conclusion The application of multivariate statistical methods needs to fully consider applicable conditions such as data characteristics.Through simulation experiments,students are encouraged to independently run simulation experiments,intuitively understand abstract multivariate concepts,and facilitate the understanding and application of multivariate statistical methods.

关键词

模拟实验/多元统计/概念原理/教学

Key words

simulation experiment/multivariable statistics/conceptual principle/teaching

引用本文复制引用

基金项目

山东省优质专业学位案例库建设项目(SDYAL2022183)

出版年

2024
中国医院统计
卫生部统计信息中心,滨州医学院

中国医院统计

影响因子:0.564
ISSN:1006-5253
参考文献量18
段落导航相关论文