Chinese Journal of Pharmacovigilance ›› 2025, Vol. 22 ›› Issue (12): 1410-1417.
DOI: 10.19803/j.1672-8629.20250512

• Orginal Article • Previous Articles     Next Articles

Prediction of Severe Adverse Drug Reactions Based on Spontaneous Reporting Data-Driven Machine Learning Models

LIU Xi1, LI Chen1,2, TIAN Yuan1, CHEN Mengli1,2*   

  1. 1Department of Pharmacy, Medical Supplies Centre of PLA General Hospital, Beijing 100853, China;
    2PLA ADR Monitoring Center, Beijing 100853, China
  • Received:2025-08-01 Published:2025-12-19

Abstract: Objective To construct an intelligent model for prediction of severe adverse drug reactions (ADR) based on spontaneous ADR reporting data in order to enhance the efficiency of pharmacovigilance, identify high-risk adverse reactions earlier, and optimize the allocation of healthcare resources. Methods A retrospective analysis was conducted of 4 144 spontaneous ADR reports. The DeepSeek large language model (LLM) was applied to standardize diagnostic information and names of drugs. Twenty-four clinical features were selected to process data on age, time variables, and unordered categories with feature engineering. Ten machine learning algorithms, including Bernoulli Naive Bayes (BNB) and Random Forest (RF), were compared. Besides, the effectiveness of such sampling techniques as SMOTE, ADASYN, and TomekLinks in addressing data imbalance (10.4% severe ADR) was evaluated. Model performance was evaluated with the area under the precision-recall curve (AUPRC) as primary metrics, and the area under the curve (AUC) and true positive rate (TPR) as secondary metrics. Feature contributions were analyzed by using SHAP values. Results The combination of BNB and TomekLinks delivered the best performance. In internal validation, AUC=0.921, AUPRC=0.757, and TPR=0.626 were achieved, compared with AUPRC=0.711 and AUC=0.901 in external validation, suggesting good generalization ability. SHAP analysis revealed that hospitalizations or prolonged hospital stay, age, and immune abnormalities or infections indicated significant positive influence, while insignificant impact on preexisting diseases, damage to the gastrointestinal system and the skin and its accessories, and recovery/improvement were indicators of negative influence. Conclusion Undersampling techniques, particularly TomekLinks, outperform oversampling methods for high-dimensional sparse features. The BNB algorithm, a classic classification method based on Bayes' theorem, continues to excel in classification efficiency among various algorithms. Limitations include potential bias from single-center data and insufficient sample size for severe ADR. A more accurate early warning system should be established by integrating multi-center data, taking molecular features of drugs into consideration and leveraging natural language processing technologies.

Key words: Adverse Drug Reaction, Severity, Prediction Model, Machine Learning, Bernoulli Naive Bayes, DeepSeek

CLC Number: