首页|A virtual sample generation algorithm supporting machine learning with a small-sample dataset: A case study for rubber materials

A virtual sample generation algorithm supporting machine learning with a small-sample dataset: A case study for rubber materials

扫码查看
Machine learning (ML) is widely used in the field of material informatics。 However, limitations on the size of available datasets are a key bottleneck in the use of machine learning methods to predict material properties or reverse-design high-performance materials。 To solve this problem, we propose a virtual sample generation algorithm based on a Gaussian mixture model (GMM-VSG) to address the lack of training samples in machine learning。 The core idea of the algorithm is to generate virtual samples by fitting the distribution of the original samples。 We used an open rubber composite dataset (24 samples) to establish a machine learning model to predict the wear resistance of rubber materials through mechanical properties to verify the performance of the GMM-VSG algorithm。 The results show that after using our algorithm, the R2 of the prediction model reached 0。95, and the prediction accuracy increased by 41%。 This shows that the proposed algorithm can effectively promote the prediction accuracy of data model with small sample size。

Small sample machine learningGaussian mixture modelVirtual sample generationMODEL

Shen, Lijun、Qian, Quan

展开 >

Shanghai Univ

2022

Computational Materials Science

Computational Materials Science

EISCI
ISSN:0927-0256
年,卷(期):2022.211
  • 11
  • 29