环境化学2024,Vol.43Issue(12) :4250-4264.DOI:10.7524/j.issn.0254-6108.2023062602

应用小样本机器学习方法定量解析大气水溶性类腐殖质氧化潜势的关键来源

Applying the few-shot learning method to quantify crucial sources of oxygen potential of water-soluble humic-like substances

洪一航 曹芳 范美益 薛永文 赵祝钰 吴继炎 章炎麟
环境化学2024,Vol.43Issue(12) :4250-4264.DOI:10.7524/j.issn.0254-6108.2023062602

应用小样本机器学习方法定量解析大气水溶性类腐殖质氧化潜势的关键来源

Applying the few-shot learning method to quantify crucial sources of oxygen potential of water-soluble humic-like substances

洪一航 1曹芳 2范美益 3薛永文 2赵祝钰 2吴继炎 2章炎麟2
扫码查看

作者信息

  • 1. 南京信息工程大学,教育部气候与环境变化国际合作联合实验室,大气环境中心,南京,210044
  • 2. 南京信息工程大学,教育部气候与环境变化国际合作联合实验室,大气环境中心,南京,210044;南京信息工程大学应用气象学院,南京,210044
  • 3. 南京信息工程大学,教育部气候与环境变化国际合作联合实验室,大气环境中心,南京,210044;南京信息工程大学应用气象学院,南京,210044;香港理工大学土木环境学系,空气质量研究室,中国香港,999077
  • 折叠

摘要

水溶性类腐殖质(water-soluble humic-like substances,HULISWS)是大气颗粒物中一类危害人体健康的有机物.氧化潜势(oxygen potential,OP)可以用于衡量颗粒物对细胞内分子的氧化能力,精确解析HULISWS的OP来源可以助力有害污染物的精准减排工作.本文于2016年秋、冬季节在东北平原逐日采集了大气细颗粒物(PM2.5)样品,并使用正定矩阵因子分解(positive-definite matrix factorization,PMF)模型量化了 PM2.5中HULISWS来源的变化特征.尽管此前很多研究解析了 HULISWS中OP的潜在来源,但传统源解析方法难以准确拟合OP与物质来源关系,导致其解析结果存在较大不确定性.随机森林(random forest,RF)是一种可以拟合非线性关系的机器学习算法,可以对OP来源进行解析.然而,RF算法在较小样本训练下通常会增加泛化误差,导致模型不确定性较大.针对这些问题,本文提出并验证了一种通过强化RF模型对特征变量的识别来提升其泛化能力的小样本学习(few-shot learning,FSL)方法[FSL-RF].通过FSL-RF拟合HULISWS的OP与PMF源解析结果来建立来源与OP的关系,并使用置换变量重要性量化了各来源对OP的贡献.结果表明,生物质燃烧贡献了 HULISWS浓度和HULISWS中OP的72%和63%.此外,烹饪排放贡献了 4%的HULISWS浓度,对HULISWS中OP的贡献为19%.目前,尽管生物质燃烧仍然是东北地区大气HULISWS对人体造成细胞损伤的主要方式,但对烹饪排放的减排对人体细胞损伤的控制更加有效.

Abstract

Water-soluble humic-like substances(HULISWS)are a group of organic compounds which is harmful to human health.Oxygen potential(OP)could evaluate the oxygen ability of aerosols in the lungs and quantifying its source could help the precise emission reduction works.Here,the daily aerosol samples were collected in Northeast China Plain during autumn and winter.In this work,the sources of HULISWS concentration were firstly quantified using the positive-definite matrix factorization model(PMF).Although lots of work using PMF quantified the sources of the OP of HULISWS,the non-linear relation between concentration and OP could cause lots of uncertainties.Random forest(RF)algorithm,which is an easy tool to fit the complex non-linear relationship,was used to quantify the potential sources of OP.However,the generalization error will be much higher when the sample size is small.Here,we conducted a few-shot learning(FSL)method which improved the learning ability of the RF model by strengthening the recognition of characteristic variables[FSL-RF].Combining FSL-RF with PMF,the contribution of sources to the oxygen potential(OP)of HULISWS was quantified.The results indicated that biomass burning emission contributed 72%of mass concentration and 63%to OP of HULISWS.Besides,cooking emissions,which contributed 4%of the mass concentration of HULISWS,contributed 19%to OP of HULISWS.Our results showed that although biomass burning emissions domaint the OP of HULISWS,reducing the cooking emission might be the crucial way to reduce the OP of HULISWS in the Northeast China Plain.

关键词

类腐殖质/氧化潜势/源解析/机器学习/小样本学习

Key words

humic-like substances/oxygen potential/source apportionment/machine learning/few-shot learning

引用本文复制引用

出版年

2024
环境化学
中国科学院生态环境研究中心

环境化学

CSTPCDCSCD北大核心
影响因子:1.049
ISSN:0254-6108
段落导航相关论文