基于机器学习的服装生产线员工效率预测

Employee efficiency prediction of garment production line based on machine learning

扫码查看

原文链接

维普
万方数据

中文摘要：在服装生产线中,管理者通常凭借直觉和经验进行工人调度和工序编排,缺少基于历史生产相关数据的分析,难以进行产前预判.为此,充分利用历史生产数据,使用机器学习技术科学地预判工人产前效率,以提高生产线的平衡率.首先,收集了某工厂13个订单的526个生产数据并通过分位数划分法对效率进行等级划分.其次,基于生产数据的特征,在员工生产效率预测任务中选择了随机森林集成学习模型,并与其它8个模型进行了综合比较.最后,通过递归式特征消除法,从15个初始特征中筛选出实现模型最大预测性能的最优特征组以优化模型.优化后结果显示,随机森林模型展现出优异的预测性能,在回归任务中,验证集R2值为0.836,而均方根误差值为0.116;在分类任务中,其验证集平衡F分数值为0.823.研究结果表明,使用随机森林模型可以实现产前工人效率的有效预测,预测结果可避免管理者在调度时做出错误决策,同时为生产线的优化和柔性调度提供参考.

外文摘要：Objective The significant impact of variations in employee productivity on the balance of apparel production lines has prompted the need for a solution to address the shortfall in achieving targeted productivity levels under manually scheduled operations lacking historical data analysis support.This research aims to utilize machine learning models to predict actual employee efficiency,providing management with valuable insights for goal setting and decision-making to enhance production profitability and prevent erroneous decisions to some extent.Method In order to achieve efficiency prediction,this research conducted on-site surveys at factory A,gathering 526 historical production records from 13 orders.Through feature engineering,15 initial prediction datasets were constructed,and efficiency levels were categorized using quantile division.Subsequently,considering the production data characteristics,RandomForest regression and classification models were selected for efficiency prediction.In order to validate the predictive performance of the model,it was compared with eight other models.Pearson and Spearman correlation coefficient analyses were performed to investigate the impact of variables on the model predictions.Finally,recursive feature elimination was employed to optimize the model by selecting the optimal feature subset from the initial feature set for maximum predictive performance.Results Using a random split function,20％of the prediction dataset was set aside for validation,while the remaining 80％was divided into training and testing sets for ten-fold cross-validation.R2 and RMSE were chosen as regression metrics,and F1 score was selected as the classification metric.The RandomForest regression model demonstrated the optimal predictive performance,showing the smallest range of fit and root mean square error in ten-fold cross-validation,with a fitting goodness value of 0.826 and an RMSE value of 0.126.In the classification task,the random forest model exhibited higher predictive performance compared to most models,with a balanced F1 score of 0.809 in the validation set,slightly lower than the gradient boosting classification model.Prior to model optimization,correlation coefficient and feature importance analyses revealed the crucial role of the auxiliary variable"annual efficiency"in predictions.Based on variable analysis,recursive feature elimination was employed to select the optimal feature parameter set for both the RandomForest regression and classification models.In the regression task,the RandomForest model achieved the optimal parameter combination with eight features,yielding a validation set R2value of 0.836.In the classification task,the growth curve of the random forest model's predictive performance was relatively gradual,using nine features to form the optimal parameter combination,resultingin a validation F1 score of 0.823.In the optimization results,setting the threshold for the difference between RandomForestRegressor predictions and actual results to 30％identified only three outliers,accounting for 3.16％of the data.For the RandomForestClassifier model,the classification results indicated a very low recall rate for sample 3,contributing to the relatively lower F1 score.Conclusion Through comparative experiments on predictive performance,the RandomForest model was selected as the optimal optimization model.Recursive feature elimination was chosen for model optimization based on the analysis of variable impacts on efficiency prediction.The results demonstrate that machine learning can accurately predict employee efficiency.Due to limitations imposed by the experimental factory,parameter collection was restricted.Future efficiency prediction research could consider adding more feature parameters to enhance model generalization.Additionally,considering the influence of time series,recurrent neural networks(RNNs)could be employed for modeling production efficiency prediction.In the future,we will continue to optimize this predictive model and apply it to the scheduling and arrangement of actual apparel assembly line workers.

外文关键词：

garment production datamachine learningprenatal efficiencyrecursive feature eliminationflexible scheduling

作者：

鞠宇、王朝晖、李博一、叶勤文

展开 >

作者单位：

东华大学服装与艺术设计学院,上海 200051

现代服装设计与技术教育部重点实验室,上海 200051

上海市纺织智能制造与工程一带一路国际联合实验室,上海 200051

关键词：

服装生产数据机器学习产前效率预测递归式特征消除柔性调度

基金：

上海市科委"科技创新行动计划一带一路"国际合作项目

项目编号：

21130750100

出版年：

2024

DOI：

10.13475/j.fzxb.20230601001

纺织学报

中国纺织工程学会

纺织学报

CSTPCD北大核心

影响因子：0.699

ISSN：0253-9721

年,卷(期)：2024.45(5)