In the healthcare industry, the ever-increasing volume of clinical trial data presents challenges for ensuring drug safety and detecting adverse drug reactions (ADRs). This study aims to address the c Show more
In the healthcare industry, the ever-increasing volume of clinical trial data presents challenges for ensuring drug safety and detecting adverse drug reactions (ADRs). This study aims to address the challenge of accurately detecting Serious Adverse Events (SAEs) in pharmacovigilance, a critical component in ensuring drug safety during and after clinical trials. The key problem lies in the underreporting and delayed detection of Adverse Drug Reactions (ADRs) due to the heterogeneous nature of medical data, class imbalance, and the limited scope of traditional monitoring techniques. This study proposes a hybrid AI-driven framework that integrates structured (e.g., patient demographics, lab results) and unstructured data (e.g., clinical notes) to detect ADRs using advanced deep learning and NLP methods. The objective is to outperform traditional signal detection methods and provide interpretable predictions to aid clinicians in real-time. By leveraging advanced Machine Learning (ML) and Deep Learning (DL) techniques, including Random Forests, Gradient Boosting Machines, and Convolutional Neural Networks (CNNs), our model aims to identify potential ADRs across different patient subgroups. Through meticulous feature engineering and the application of techniques to address data imbalance, our model demonstrates improved accuracy and interpretability in predicting ADRs. The CNN model achieved an accuracy of 85 %, outperforming traditional models, such as Logistic Regression (78 %) and Support Vector Machines (80 %). These findings suggest that specific demographic and clinical factors significantly influence the likelihood of adverse reactions, offering valuable insights for targeted monitoring and risk mitigation strategies[11]. This research underscores the potential of predictive modeling to enhance pharmacovigilance efforts and ensure safer clinical trial outcomes.•The research methodology includes a comparison of supervised learning algorithms, such as Logistic Regression, Random Forest, Gradient Boost, CNN, and genetic algorithms, to identify patterns and anomalies in clinical trial data. BERT and GPT, were also employed to provide the functionality of textual interactions over medical data.•Performance metrics such as accuracy, precision, recall, and F1-score were systematically applied to evaluate each model's performance. Among the models tested, the CNN model with BERT achieved the highest accuracy, providing valuable insights into the potential of deep learning for enhancing pharmacovigilance practices.•These findings suggest that an inclusion of diverse clinical data when supplied to advanced ML and NLP techniques can significantly improve the detection of ADRs, leading to better alignment with the fundamental principles of Good Clinical Practice (GCP). Show less
2024 · Current Drug Targets · Bentham Science · added 2026-04-21
Background: Drug discovery is a complex and expensive procedure involving several
timely and costly phases through which new potential pharmaceutical compounds must pass to get
approved. One of these Show more
Background: Drug discovery is a complex and expensive procedure involving several
timely and costly phases through which new potential pharmaceutical compounds must pass to get
approved. One of these critical steps is the identification and optimization of lead compounds,
which has been made more accessible by the introduction of computational methods, including
deep learning (DL) techniques. Diverse DL model architectures have been put forward to learn the
vast landscape of interaction between proteins and ligands and predict their affinity, helping in the
identification of lead compounds.
ARTICLE HISTORY
Objective: This survey fills a gap in previous research by comprehensively analyzing the most
commonly used datasets and discussing their quality and limitations. It also offers a comprehensive classification of the most recent DL methods in the context of protein-ligand binding affinity
prediction (BAP), providing a fresh perspective on this evolving field.
Received: June 07, 2024
Revised: August 11, 2024
Accepted: August 19, 2024
Methods: We thoroughly examine commonly used datasets for BAP and their inherent characteristics. Our exploration extends to various preprocessing steps and DL techniques, including graph
neural networks, convolutional neural networks, and transformers, which are found in the literaDOI:
10.2174/0113894501330963240905083020 ture. We conducted extensive literature research to ensure that the most recent deep learning approaches for BAP were included by the time of writing this manuscript.
Results: The systematic approach used for the present study highlighted inherent challenges to
BAP via DL, such as data quality, model interpretability, and explainability, and proposed considerations for future research directions. We present valuable insights to accelerate the development
of more effective and reliable DL models for BAP within the research community.
Conclusion: The present study can considerably enhance future research on predicting affinity between protein and ligand molecules, hence further improving the overall drug development process. Show less