中国药物警戒 ›› 2018, Vol. 15 ›› Issue (6): 343-347.

• 安全与合理用药 • 上一篇    下一篇

基于混合模型的药品不良反应数据遮蔽效应消除方法研究

魏建香1, 张剑吟2, 刘美含1, 李明3,4, 孙骏3, 徐厚明3   

  1. 1南京邮电大学物联网学院,江苏 南京 210003;
    2南京邮电大学计算机学院,江苏 南京 210023;
    3江苏省药品不良反应监测中心,江苏 南京 210002;
    4中国药科大学,江苏 南京 210009
  • 收稿日期:2018-08-03 修回日期:2018-08-03 出版日期:2018-06-15 发布日期:2018-08-03
  • 作者简介:魏建香,男,博士,副教授·硕导,药品安全风险评估。
  • 基金资助:
    国家社会科学基金(14BTQ036):大数据环境下药品安全突发事件预警与应急管理研究; 国家食品药品监督管理局药品评价中心委托项目:信号检测数据分类依据的研究

Study on Data Masking Effect Reduction Method for Adverse Drug Reaction Based on Hybrid Model

WEI Jian-xiang1, ZHANG Jian-yin2, LIU Mei-han1, LI Ming3,4, SUN Jun3, XU Hou-ming3   

  1. 1 School of Internet of Things, Nanjing University of Posts and Telecommunications, Jiangsu Nanjing 210003, China;
    2 School of Computer Science, Nanjing University of Posts and Telecommunications, Jiangsu Nanjing 210023, China;
    3 Jiangsu Center for ADR Monitoring, Jiangsu Nanjing 210002, China;
    4 China Pharmaceutical University, Jiangsu Nanjing 210009, China
  • Received:2018-08-03 Revised:2018-08-03 Online:2018-06-15 Published:2018-08-03

摘要: 目的 针对传统不相称性测定法在药品不良反应信号检测上存在数据遮蔽现象问题,提出了一种基于混合模型的遮蔽效应消除方法。方法 该模型融合了Lasso Logistic回归、移除报告及IC方法。首先利用Lasso Logistic回归法进行数据的首轮移除;第二轮则移除发生频次小于等于4的报告;接着采用IC算法进行信号检测;最后利用我国2010年药品不良反应监测报告中的中药数据进行实验并基于已知不良反应库和国家中心信息通报进行结果的验证。结果 实验结果表明,与单一方法比较,此方法在召回率、查准率和F指标上均有良好的表现。结论 该混合模型不仅有效地提高了药品不良反应信号检测的准确性,而且具有早期发现真阳性信号的能力。本研究为药品不良反应信号检测提供了一种更可靠的方法。

关键词: 数据遮蔽效应, 混合模型, Lasso Logistic回归, 信号检测

Abstract: Objective To propose a data masking effect reduction method based on hybrid model to solve the problem of masking in adverse drug reactions (ADRs) signal detection by traditional disproportionality measurement. Methods The model was mixed together with Lasso Logistic Regression(LLR), report removal and IC method. The first round of data removal was carried out by Lasso Logistic Regression method and the second round was to remove the reports which frequency is less than or equal to 4. Then, IC algorithm was used for signal detection. Finally, traditional Chinese medicine data of ADR monitoring reports of China in 2010 was used to conduct experiments, and the Results were tested and verified based on known adverse reaction database and Information Bulletin of National ADR Center. Results The experimental Results showed that this model had better performances in recall rate, precision and F index by comparison with each single method. Conclusion The hybrid model not only improves the accuracy of the ADR signal detection, but also has the ability to detect more true positive signals at early stage. This research may provide a more reliable method for ADR signal detection.

Key words: data masking effect, hybrid model, Lasso Logistic Regression, signal detection

中图分类号: