Prediction of Wind Turbine Lubricating Oil's Acid Value by Ordinary Least Square Method Based on Attenuated Total Reflectance-Fourier Transform Infrared Spectroscopy Through Higher-Order Derivative Combined with Angular Metric
To address the key challenges in multivariate statistical modeling,a higher-order derivative approach combined with vector space angle multiplicative error correction was proposed for establishing an acid value prediction ordinary least squares(OLS)regression model based on attenuated total reflectance-Fourier transform infrared(ATR-FTIR)spectroscopy.By using acid values measured by potentiometric titration as reference,ATR-FTIR spectroscopy was utilized for direct calibration and prediction of acid values on 96 kinds of lubricating oil samples from a wind turbine.Firstly,the simulated hyperbolic(SH)method was employed to obtain accurate fourth derivative spectrum,resolving overlapping bands and enhancing spectral selectivity.Then,from the calibration set(48 samples),informative spectral regions were identified based on correlation coefficients.Next,the sample with the highest acid value was selected as the reference and1/(1+tan(θ/2))was used as the metric relation of the spectrum to suppress the multiplicity error caused by factors such as the change of effective optical path in ATR-FTIR spectroscopy.After pretreatment of the spectrum by the method of fourth-order derivative combined with angular quantity,the number of variables decreased from 1737 to 8,and the matrix condition number decreased from 1.85×1015 to 56.34,which effectively eliminated the collinearity issue for OLS regression.Direct OLS modeling on spectral preprocessed data achieved a determination coefficient of 0.981 for 47 validation samples,with a relative error range of-8.38%-8.22%,outperforming the commonly used partial least squares(PLS)method(Determination coefficient of 0.865,relative error of-27.82%-22.38%).It was proved that effective data preprocessing significantly improved the prediction accuracy of the model.Furthermore,when the number of calibration set was compressed to 25 and the number of validation set was expanded to 70,the model retained 8 variables with a condition number of 42.60,the determination coefficient of validation set was 0.972,and the relative error ranged from-10.80%to 12.31%.Comparing with the PLS method(Determination coefficient of 0.724,relative error of-34.26%-53.84%),the improvement was more obvious,which showed that the method could still have high prediction accuracy even with fewer modeling samples as well as robustness against multiplicative error interference.
Lubricating oilAcid valueAttenuated total reflectance-Fourier transform infrared spectroscopyHigher-order derivativeVector space angleCollinearityOrdinary least squares regression