Robotics & Machine Learning Daily News2024,Issue(Aug.1) :31-31.

Dongseo University Researcher Adds New Findings in the Area of Machine Learning (An Oversampling Technique with Descriptive Statistics)

Robotics & Machine Learning Daily News2024,Issue(Aug.1) :31-31.

Dongseo University Researcher Adds New Findings in the Area of Machine Learning (An Oversampling Technique with Descriptive Statistics)

扫码查看

Abstract

By a News Reporter-Staff News Editor at Robotics & Machine Learning DailyNews Daily News – A new study on artificial intelligence is now available. According to news reporting outof Dongseo University by News Rx editors, research stated, “Oversampling is often applied as a means towin a better knowledge model.”The news reporters obtained a quote from the research from Dongseo University: “ Several oversamplingmethods based on synthetic instances have been suggested, a nd SMOTE is one of the representativeoversampling methods that can generate syn thetic instances of a minor class. Until now, the oversampleddata has been used conventionally to train machine learning models without statistical analysis, s o itis not certain that the machine learning models will be fine for unseen cas es in the future. However,because such synthetic data is different from the ori ginal data, we may wonder how much it resemblesthe original data so that the ov ersampled data is worth using to train machine learning models. For thispurpose , I conducted this study on a representative dataset called wine data in the UCI machine learningrepository, which is one of the datasets that has been experim ented with by many researchers in researchfor knowledge discovery models. I gen erated synthetic data iteratively using SMOTE, and I comparedthe synthetic data with the original data of wine to see if it was statistically reliable using a box plotand t-test. Moreover, since training a machine learning model by supply ing more high-quality traininginstances increases the probability of obtaining a machine learning model with higher accuracy, it was alsochecked whether a bet ter machine learning model of random forests can be obtained by generating muchmore synthetic data than the original data and using it for training the random forests.”

Key words

Dongseo University/Cyborgs/Emerging Te chnologies/Machine Learning

引用本文复制引用

出版年

2024
Robotics & Machine Learning Daily News

Robotics & Machine Learning Daily News

ISSN:
段落导航相关论文