Research on the application of machine learning related techniques in the classification of Astragali Radix characterized by flavonoids
Objective:To establish a three classification model for cultivated,semi-wild,and wild Astragali Radix characterized by flavonoids,and explore and evaluate the application of techniques of automated machine learning and data augmentation in the field of drug analysis.Methods:Firstly,correlation analysis and principal component analysis were conducted on the flavonoid content data of Astragali Radix,and models of decision tree and logistic regression were established to analyze the importance of flavonoid components based on the models.Then,using the AutoGluon framework with 5 as num_bag_folds,2 sets of 30 models respectively through 64 batches of real data and 600 batches of virtual data generated based on real data with the TVAE table data gen-eration algorithm for training were obtained,and these models were evaluated by accuracy.Results:The analysis of machine learning models,indicated that formononetin,campanulin and onospin played the important roles in the quality control of Astragali Radix,especially for the source grade control.The accuracy of model prediction showed that the models based on Neural Net and tree-model always had the best classification effect for Astragali Radix.The virtual data generated by data augmentation technique is basically consistent with the actual data in terms of the accuracy trend of the model training process.Conclusion:Related techniques of machine learning have good application value in the classification of Astragali Radix characterized by flavonoids.