Objective To mine the medical health knowledge,provide a practical experience to support health science popularization knowledge for artificial intelligence and other fields.Methods Based on Natural Language Processing(NLP)technology,the structure of popular science articles accumulated by Xuhui District Center for Disease Control and Prevention from January 2010 to January 2021 were split,read and understand,entity recognition,etc.The processing process included document pre-processing,feature extraction,paragraph screening,read and understand,answer sorting,review,and release.Results A total of 5 395 questions and answers were obtained through direct document structure splitting;by reading and understanding,857 questions and answers were obtained;by extracting digital Q&A,1 668 questions were obtained,forming a preliminary medical health knowledge base in the form of Q&A.Conclusion Natural Language Processing(NLP)technology provides an effective way to produce a large number of language materials for AI technology.
Natural language processingMedical knowledgeCorpusArtificial intelligence