基于生成式人工智能大语言模型的文献来源药品不良反应报告自动生成平台研究与应用

doi:10.19803/j.1672-8629.20250291

摘要/Abstract

摘要： 目的利用生成式人工智能技术,探索文献来源药品不良反应报告的自动化生成方法,以提升报告处理效率与准确性。方法基于行业内成熟的基础大语言模型及表格解析、OCR识别等前沿技术,从由中国知网、万方数据中下载的约700篇中文文献中提取药品不良反应报告基本要素,标注数据训练集（6 925条）进行不良反应报告生成场景训练,形成能理解学术文献并自动生成不良反应报告的定制化模型。平台通过多轮算法迭代优化模型性能,结合人工复核,采用人工智能3个算法性能指标数值（召回率、精确度、F₁值）和报告处理前后耗时对比来评估研究结果。结果经过4轮算法迭代,定制化模型的召回率、准确率和F₁值分别达到97.1%、90.1%和93.5%,均满足设定标准。平均报告处理时长由传统人工80 min缩短至45 min,效率提升77.8%。结论生成式人工智能为文献来源药品不良反应报告的智能识别与高效生成,对药物警戒自动化、智能化转型提供了新工具参考。本平台仍面临无法处理复杂排版及异常数据等局限,需持续优化数据集与算法,并严格遵循伦理与监管要求,确保数据合规。

关键词: 药品不良反应, 个例安全性报告, 生成式人工智能, 大语言模型, 机器学习

Abstract: Objective To leverage generative artificial intelligence to explore how to automate the generation of adverse drug reaction (ADR) reports from medical literature in order to enhance the efficiency and accuracy of the report processing workflow. Methods Based on the mature basic large language model in this industry, table parsing, OCR recognition and other advanced technologies, and using the annotated data training set (6 925 pieces) retrieved from about 700 published articles in Chinese downloaded from CNKI and WanFang Database, related training was carried out under an ADR report generation-specific scenario. A model was established that could understand academic publications and automatically generate individual case safety reports (ICSRs). This platform was optimized through multiple rounds of algorithmic iterations and assisted by manual review. The research results were evaluated by comparing the values of three AI algorithm performance indicators (recall rate, precision, F₁ value) and the report processing time. Results After four rounds of algorithm iterations, the recall rate, precision and F₁ value of this generative AI model for ADR reports reached 97.1%, 90.1% and 93.5% respectively, all meeting the project’s acceptance standards. The processing time for ICSRs was reduced from 80 minutes taken by conventional manual methods to 45 minutes, which was boosted by 77.8%. Conclusion Generative artificial intelligence offers new tools for the intelligent identification and efficient generation of ADR reports sourced from literature and plays a significant role in driving the automation and intelligent transformation of pharmacovigilance. However, this platform has such limitations as the complexity of formatting and abnormal data. Constant optimization of datasets and algorithms is required, along with strict adherence to ethical and regulatory requirements to ensure data compliance.

Key words: Adverse Drug Reactions, Individual Case Safety Reports, Generative Artificial Intelligence, Large Language Models, Machine Learning

中图分类号:

R951
R994.11

刘芹, 章嘉一, 吴烜, SOANES Nigel. 基于生成式人工智能大语言模型的文献来源药品不良反应报告自动生成平台研究与应用[J]. 中国药物警戒, 2025, 22(9): 1040-1044.

LIU Qin, ZHANG Jiayi, WU Xuan, SOANES Nigel. An Automatic Generation Platform for Adverse Drug Reaction Reports from Literature Sources Based on the Generative Artificial Intelligence Large Language Model[J]. Chinese Journal of Pharmacovigilance, 2025, 22(9): 1040-1044.

导出引用管理器 EndNote|Ris|BibTeX

链接本文: https://zgywjj.magtechjournal.com/CN/10.19803/j.1672-8629.20250291

https://zgywjj.magtechjournal.com/CN/Y2025/V22/I9/1040

参考文献

[1] National Center for ADR Monitoring, China. Annual Reports on Adverse Drug Reaction Monitoring (2023)[EB/OL]. (2024-03-26)[2024-06-27]. https://www.cdr-adr.org.cn/drug_1/aqjs_1/drug_aqjs_sjbg/202403/t20240326_50614.html.
[2] NMPA. Good Pharmacovigilance Practice[EB/OL]. (2021-05-13)[2024-06-25]. https://www.nmpa.gov.cn/xxgk/fgwj/xzhgfxwj/20210513151827179.html.
[3] ZHU L, ZHU Y, LIU LH, et al.Common Problems with Post-marketing Adverse Drug Reaction Reports from Literature[J]. Chinese Journal of Pharmacovigilance(中国药物警戒), 2021, 18(12): 1155-1158.
[4] NMPA. List of Typical Application Scenarios of Artificial Intelligence in Drug Regulation[EB/OL]. (2024-06-13)[2024-06-18]. https://www.nmpa.gov.cn/xxgk/fgwj/xzhgfxwj/20210513151827179.html.
[5] AI KUWAITI A, NAZER K, AI-REEDY A, et al.A Review of the Role of Artificial Intelligence in Healthcare[J]. Journal of Personalized Medicine, 2023, 13(6): 951.
[6] The Ministry of Industry and Information Technology and Six Other Departments. Implementation Plan for Digital and Intelligent Transformation of the Pharmaceutical Industry (2025-2030)[EB/OL]. (2025-04-24)[2025-08-18]. https://www.nmpa.gov.cn/xxgk/fgwj/qita/20250425170354149.html.
[7] CIOMS. Artificial Intelligence in Pharmacovigilance Proposal[EB/OL]. (2022-02-02)[2024-07-03]. https://cioms.ch/wp-content/uploads/2022/05/CIOMS-Concept-Note-on-AI-in-PV_final-02.02.2022.pdf.
[8] NÉVÉOL A, DALIANIS H, VELUPILLAI S, et al. Clinical Natural Language Processing in Languages Other Than English: Opportunities and Challenges[J]. Journal of Biomedical Semantics, 2018, 9(1): 1-13.
[9] GONG YS, CAI KP, WANG ZQ, et al.Semi-Automatic Text Category Labelling Method Based on Machine Learning[J].Chinese Journal of Engineering Mathematics(工程数学学报), 2021, 38(6): 750-762.
[10] GU JM, TANG CQ, JIANG Y, et al.A Case Report of Acute Kidney Injury Caused by Semaglutide in Combination with Multiple Medications[J]. Chinese Journal of Pharmacoepidemiology(药物流行病学杂志), 2025, 34(7): 835-840.
[11] LI WH, HAKEN B.Computer Vision-ECCV 2020 Workshops[C]. Glasgow, United Kingdom: Springer, 2021.
[12] GU H, JIANG HD.Retinal Hemorrhage Caused by Gefitinib[J]. Adverse Drug Reactions Journal(药物不良反应杂志), 2021, 23(1): 52-53.
[13] TOMOVIC S, PAVLOVIC K, BAJCETA M.Aligning Document Layouts Extracted with Different OCR Engines with Clustering Approach[J]. Egyptian Informatics Journal, 2021, 22: 239-338.
[14] BRUCE P. The Future of Pharmacovigilance is Proactive[EB/OL]. (2024-03-04)[2024-07-16]. https://www.contractpharma.com/the-future-of-pharmacovigilance-is-proactive-210022/.
[15] WANG T, ZHENG MJ, LIU HL, et al.Current Applications of Artificial Intelligence in Pharmacovigilance in the USA and Implications[J]. Chinese Journal of Pharmacovigilance(中国药物警戒), 2023, 20(10): 1129-1133.
[16] LIANG HS, CHEN JQ.Research on Ethical Problems of Application of Generative Artificial Intelligence in Medical Field[J]. Journal of Jinzhou Medical University (Social Science Edition)(锦州医科大学学报社会科学版), 2024, 22(3): 19-22.
[17] The State Council of The People’s Republic of CHINA. Notice by Six Departments Including the Ministry of Science and Technology of Issuing the Guiding Opinions on Accelerating Scenario Innovation and Promoting High-Quality Economic Development with High-Level Application of Artificial Intelligence[EB/OL]. (2022-07-29)[2024-07-10]. https://www.gov.cn/zhengce/zhengceku/2022-08/12/content_5705154.htm.
[18] CHENG L. The Status, Challenges and Prospects of Generative Artificial Intelligence Governance[J]. People's Tribune(人民论坛), 2024, 2: 76-81.
[19] CIOMS. Artificial intelligence in Pharmacovigilance Proposal Working Group Report[EB/OL]. (2025-05-01)[2025-08-14]. https://CIOMS-WG-XIV_Draft-report-for-Public-Consultation_1May-2025.pdf.
[20] ZHANG X.Algorithmic Governance Challenges and Governance-Based Supervision of Generative Artificial Intelligence[J]. Modern Law Science, 2023, 45(3): 108-123.
[21] CHEN HG, AN SF.Research on the Risks and Governance of Generative Artificial Intelligence Training Data[J]. Journal of Hainan Open University(海南开放大学学报), 2024, 94(1): 110-118.
[22] T/SBIAORG 0002-2025 0002-2025. Specifications of AI-Assisted Data Extraction for Individual Case Safety Reports in Pharmacovigilance Based on Large Language Models(基于大语言模型的药物警戒个例安全性报告人工智能辅助信息提取规范)[S]. Beijing: Standards Press of China, 2025.
[23] WANG T, ZHANG L, WANG Q, et al.Analysis of the Current Application Status and Enlightenment of Artificial Intelligence in Post-marketing Pharmacovigilance in China[J]. Herald of Medicine(医药导报), 2025, 44(4): 560-564.
[24] Laborp Drug Development. The Future of Pharmacovigilance[EB/OL]. (2019-09-12)[2024-07-16]. https://www.labcorp.com/future-pharmacovigilance.