ULEO:Unified Language of Experiment Operations for Representation of Synthesis Protocols
[Objective]This study addresses the unified representation issue of experimental operation verbs in synthetic experiment protocols,which provides high-quality experimental protocol data for science intelligence and robotics.[Methods]We utilized a collaborative approach driven by data and expert knowledge to identify and standardize experimental operation verbs from literature and patent texts related to synthesis.First,we used advanced open-source large models like ChatGLM2-6B to identify experimental operation verbs.Then,we combined Wu-Palmer and cosine similarity to standardize these verbs.Finally,we assessed their classification accuracy with expert knowledge.[Results]The study identified 149 operation verbs for inorganic synthetic experiments and 141 operation verbs for organic synthetic experiments.Expert judgment revealed that many of the 124 operation terms appearing in both groups do not possess distinct category characteristics.Therefore,we merged the two categories to have 166 experimental operation verbs representing the operations in organic,inorganic,and hybrid synthesis experiments.[Limitations]The study only employed basic prompt engineering techniques to direct the large model to recognize experimental operation verbs from publicly accessible datasets.This study focused on operation terms involved in synthesis,engineering,and basic steps without considering operation terms in dynamic,analytical,and name reactions.[Conclusions]This study establishes a unified language for representing experimental operations in synthesis,applicable to organic,inorganic,and hybrid synthesis reactions.It could inform the future development of scientific robotics experiments.
Unified Language of Experiment OperationsAI for ScienceSynthesis Experimental ProtocolsExperiment OperationsScience Robotics