2025 · Cui et al. BioData Mining · BioMed Central · added 2026-04-21
Deep learning, a cornerstone of artificial intelligence, is driving rapid advancements in computational biology. Protein-protein interactions (PPIs) are fundamental regulators of biological functions. Show more
Deep learning, a cornerstone of artificial intelligence, is driving rapid advancements in computational biology. Protein-protein interactions (PPIs) are fundamental regulators of biological functions. With the inclusion of deep learning in PPI research, the field is undergoing transformative changes. Therefore, there is an urgent need for a comprehensive review and assessment of recent developments to improve analytical methods and open up a wider range of biomedical applications. This review meticulously assesses deep learning progress in PPI prediction from 2021 Show less
2025 · International journal of molecular sciences · MDPI · added 2026-04-21
Academic Editor: Sabrina Venditti Received: 22 May 2025 Revised: 5 July 2025 N6-methyladenosine (m6A) represents the most common and thoroughly investigated RNA modification and exerts essential funct Show more
Academic Editor: Sabrina Venditti Received: 22 May 2025 Revised: 5 July 2025 N6-methyladenosine (m6A) represents the most common and thoroughly investigated RNA modification and exerts essential functions in regulating gene expression through influencing the RNA stability, the translation efficiency, alternative splicing, and nuclear export processes. The rapid development of high-throughput sequencing approaches, including miCLIP and MeRIP-seq, has profoundly transformed epitranscriptomics research. Show less
2025 · Li et al. BMC Genomics · BioMed Central · added 2026-04-21
Predicting protein‒protein interactions (PPIs) plays a crucial role in understanding biological processes. Although biological experimental methods can identify PPIs, they are costly, time-consuming, Show more
Predicting protein‒protein interactions (PPIs) plays a crucial role in understanding biological processes. Although biological experimental methods can identify PPIs, they are costly, time-consuming, labor-intensive, and often lack stability. In contrast, computational approaches for PPI prediction, particularly deep learning methods, can efficiently learn representations from protein sequences. However, the generalizability, robustness, and stability of computational PPI prediction models still need improvement, especially for species with limited verified PPI Show less
2025 · Frontiers in pharmacology · Frontiers · added 2026-04-21
Background/ObjectivesNew computational methods, based on statistical, machine learning, and deep learning techniques using drug-related entities (e.g., genes, protein bindings, etc.), help reduce the Show more
Background/ObjectivesNew computational methods, based on statistical, machine learning, and deep learning techniques using drug-related entities (e.g., genes, protein bindings, etc.), help reduce the costs of in-vitro experiments through drug-drug interaction prediction (DDIp). This review examines recent advances in DDIp. It presents an in-depth review of the state-of-the-art studies relating to semi-supervised, supervised, self-supervised learning, and other techniques such as graph-based learning and matrix factorization methods for predicting DDIs. All possible interactions between drugs are not known, and accurately predicting interactions is even more difficult due to the complex nature of drug-drug interactions (DDI).MethodsOf the 49 papers published in Web of Science in the last 6 years, 24 papers were considered relevant based on information presented in their titles and abstracts. The included articles focus specifically on predicting DDIs using a type of machine learning algorithm. Excluded articles focused on drug discovery, drug repurposing, molecular representation, or the extraction of biomedical interactions. The methodology, results limitations, and future research directions were studied for each paper. Common challenges, limitations, and future research directions were analyzed.Results and conclusionThe main limitations are class imbalance, poor performance on new drugs, limited explainability, and the need for additional data sources. Show less
In the healthcare industry, the ever-increasing volume of clinical trial data presents challenges for ensuring drug safety and detecting adverse drug reactions (ADRs). This study aims to address the c Show more
In the healthcare industry, the ever-increasing volume of clinical trial data presents challenges for ensuring drug safety and detecting adverse drug reactions (ADRs). This study aims to address the challenge of accurately detecting Serious Adverse Events (SAEs) in pharmacovigilance, a critical component in ensuring drug safety during and after clinical trials. The key problem lies in the underreporting and delayed detection of Adverse Drug Reactions (ADRs) due to the heterogeneous nature of medical data, class imbalance, and the limited scope of traditional monitoring techniques. This study proposes a hybrid AI-driven framework that integrates structured (e.g., patient demographics, lab results) and unstructured data (e.g., clinical notes) to detect ADRs using advanced deep learning and NLP methods. The objective is to outperform traditional signal detection methods and provide interpretable predictions to aid clinicians in real-time. By leveraging advanced Machine Learning (ML) and Deep Learning (DL) techniques, including Random Forests, Gradient Boosting Machines, and Convolutional Neural Networks (CNNs), our model aims to identify potential ADRs across different patient subgroups. Through meticulous feature engineering and the application of techniques to address data imbalance, our model demonstrates improved accuracy and interpretability in predicting ADRs. The CNN model achieved an accuracy of 85 %, outperforming traditional models, such as Logistic Regression (78 %) and Support Vector Machines (80 %). These findings suggest that specific demographic and clinical factors significantly influence the likelihood of adverse reactions, offering valuable insights for targeted monitoring and risk mitigation strategies[11]. This research underscores the potential of predictive modeling to enhance pharmacovigilance efforts and ensure safer clinical trial outcomes.•The research methodology includes a comparison of supervised learning algorithms, such as Logistic Regression, Random Forest, Gradient Boost, CNN, and genetic algorithms, to identify patterns and anomalies in clinical trial data. BERT and GPT, were also employed to provide the functionality of textual interactions over medical data.•Performance metrics such as accuracy, precision, recall, and F1-score were systematically applied to evaluate each model's performance. Among the models tested, the CNN model with BERT achieved the highest accuracy, providing valuable insights into the potential of deep learning for enhancing pharmacovigilance practices.•These findings suggest that an inclusion of diverse clinical data when supplied to advanced ML and NLP techniques can significantly improve the detection of ADRs, leading to better alignment with the fundamental principles of Good Clinical Practice (GCP). Show less
Computational drug discovery is essential for screening
potential treatments and reducing the costs and time associated with
proposing or combining drugs for disease management. Despite the
extensive Show more
Computational drug discovery is essential for screening
potential treatments and reducing the costs and time associated with
proposing or combining drugs for disease management. Despite the
extensive research conducted in this field, it remains an emerging area,
particularly with the advent of machine learning, deep learning, and large
language models (LLMs). This systematic review examines the
integration of machine learning and deep learning techniques in drug
discovery, concentrating on three critical areas: drug−drug interactions
(DDIs), drug-target interactions (DTIs), and adverse drug reactions
(ADRs). The review analyzes over 100 papers published between 2020
and 2025, categorizing the methods into deep learning, machine learning,
graph learning, and hybrid models. It highlights the transformative impact
of natural language processing (NLP) and LLMs in extracting meaningful
insights from biomedical literature and chemical data. Furthermore, this work introduces key databases and data sets widely utilized
in drug discovery. Additionally, this review identifies gaps in the existing research, such as the lack of comprehensive studies that
simultaneously address DDI, DTI, and ADR extraction, and it proposes a more holistic approach to fill these gaps. The paper
concludes by thoroughly evaluating various models, underscoring their performance metrics. Show less
Drug-drug interactions (DDIs) can produce unpredictable pharmacological effects and lead to adverse
events that have the potential to cause irreversible damage to the organism. Traditional methods to
Show more
Drug-drug interactions (DDIs) can produce unpredictable pharmacological effects and lead to adverse
events that have the potential to cause irreversible damage to the organism. Traditional methods to
detect DDIs through biological or pharmacological analysis are time-consuming and expensive, therefore,
there is an urgent need to develop computational methods to effectively predict drug-drug interactions.
Currently, deep learning and knowledge graph techniques which can effectively extract features of entities have been widely utilized to develop DDI prediction methods. In this research, we aim to systematically review DDI prediction researches applying deep learning and graph knowledge. The available
biomedical data and public databases related to drugs are firstly summarized in this review. Then, we
discuss the existing drug-drug interactions prediction methods which have utilized deep learning and
knowledge graph techniques and group them into three main classes: deep learning-based methods,
knowledge graph-based methods, and methods that combine deep learning with knowledge graph.
We comprehensively analyze the commonly used drug related data and various DDI prediction methods,
and compare these prediction methods on benchmark datasets. Finally, we briefly discuss the challenges
related to drug-drug interactions prediction, including asymmetric DDIs prediction and high-order DDI
prediction. Show less
2024 · Current Drug Targets · Bentham Science · added 2026-04-21
Background: Drug discovery is a complex and expensive procedure involving several
timely and costly phases through which new potential pharmaceutical compounds must pass to get
approved. One of these Show more
Background: Drug discovery is a complex and expensive procedure involving several
timely and costly phases through which new potential pharmaceutical compounds must pass to get
approved. One of these critical steps is the identification and optimization of lead compounds,
which has been made more accessible by the introduction of computational methods, including
deep learning (DL) techniques. Diverse DL model architectures have been put forward to learn the
vast landscape of interaction between proteins and ligands and predict their affinity, helping in the
identification of lead compounds.
ARTICLE HISTORY
Objective: This survey fills a gap in previous research by comprehensively analyzing the most
commonly used datasets and discussing their quality and limitations. It also offers a comprehensive classification of the most recent DL methods in the context of protein-ligand binding affinity
prediction (BAP), providing a fresh perspective on this evolving field.
Received: June 07, 2024
Revised: August 11, 2024
Accepted: August 19, 2024
Methods: We thoroughly examine commonly used datasets for BAP and their inherent characteristics. Our exploration extends to various preprocessing steps and DL techniques, including graph
neural networks, convolutional neural networks, and transformers, which are found in the literaDOI:
10.2174/0113894501330963240905083020 ture. We conducted extensive literature research to ensure that the most recent deep learning approaches for BAP were included by the time of writing this manuscript.
Results: The systematic approach used for the present study highlighted inherent challenges to
BAP via DL, such as data quality, model interpretability, and explainability, and proposed considerations for future research directions. We present valuable insights to accelerate the development
of more effective and reliable DL models for BAP within the research community.
Conclusion: The present study can considerably enhance future research on predicting affinity between protein and ligand molecules, hence further improving the overall drug development process. Show less
2023 · Bioinformatics · Oxford University Press · added 2026-04-21
Motivation: Screening new drug–target interactions (DTIs) by traditional experimental methods is costly and time-consuming. Recent advances in knowledge graphs, chemical linear notations, and genomic Show more
Motivation: Screening new drug–target interactions (DTIs) by traditional experimental methods is costly and time-consuming. Recent advances in knowledge graphs, chemical linear notations, and genomic data enable researchers to develop computational-based-DTI models, which play a pivotal role in drug repurposing and discovery. However, there still needs to develop a multimodal fusion DTI model that integrates available heterogeneous data into a unified framework. Results: We developed MDTips, a multimodal-data-based DTI prediction system, by fusing the knowledge graphs, gene expression profiles, and Show less
Drug discovery (DD) is a time-consuming and expensive process. Thus, the industry
employs strategies such as drug repositioning and drug repurposing, which allows the application of
already approved d Show more
Drug discovery (DD) is a time-consuming and expensive process. Thus, the industry
employs strategies such as drug repositioning and drug repurposing, which allows the application of
already approved drugs to treat a different disease, as occurred in the first months of 2020, during the
COVID-19 pandemic. The prediction of drug–target interactions is an essential part of the DD process
because it can accelerate it and reduce the required costs. DTI prediction performed in silico have used
approaches based on molecular docking simulations, including similarity-based and network- and
graph-based ones. This paper presents MPS2IT-DTI, a DTI prediction model obtained from research
conducted in the following steps: the definition of a new method for encoding molecule and protein
sequences onto images; the definition of a deep-learning approach based on a convolutional neural
network in order to create a new method for DTI prediction. Training results conducted with the
Davis and KIBA datasets show that MPS2IT-DTI is viable compared to other state-of-the-art (SOTA)
approaches in terms of performance and complexity of the neural network model. With the Davis
dataset, we obtained 0.876 for the concordance index and 0.276 for the MSE; with the KIBA dataset,
we obtained 0.836 and 0.226 for the concordance index and the MSE, respectively. Moreover, the
MPS2IT-DTI model represents molecule and protein sequences as images, instead of treating them as
an NLP task, and as such, does not employ an embedding layer, which is present in other models.
Academic Editors: Kyriakos
Kachrimanis, David Barlow, Jakub Show less
Drug discovery aims at finding new compounds with specific chemical properties for the treatment of diseases. In the last years, the approach used in this search presents an important component in com Show more
Drug discovery aims at finding new compounds with specific chemical properties for the treatment of diseases. In the last years, the approach used in this search presents an important component in computer science with the skyrocketing of machine learning techniques due to its democratization. With the objectives set by the Precision Medicine initiative and the new challenges generated, it is necessary to establish robust, standard and reproducible computational methodologies to achieve the objectives set. Currently, predictive models based on Machine Learning have gained great importance in the step prior to preclinical studies. This stage manages to drastically reduce costs and research times in the discovery of new drugs. This review article focuses on how these new methodologies are being used in recent years of research. Analyzing the state of the art in this field will give us an idea of where cheminformatics will be developed in the short term, the limitations it presents and the positive results it has achieved. This review will focus mainly on the methods used to model the molecular data, as well as the biological problems addressed and the Machine Learning algorithms used for drug discovery in recent years. Show less
The US National Library of Medicine (NLM) uses the Medical Subject Headings (MeSH) (see Note 1 ) to index almost all 24 million citations in MEDLINE, which greatly facilitates the application of biome Show more
The US National Library of Medicine (NLM) uses the Medical Subject Headings (MeSH) (see Note 1 ) to index almost all 24 million citations in MEDLINE, which greatly facilitates the application of biomedical information retrieval and text mining. Large-scale automatic MeSH indexing has two challenging aspects: the MeSH side and citation side. For the MeSH side, each citation is annotated by only 12 (on average) out of all 28,000 MeSH terms. For the citation side, all existing methods, including Medical Text Indexer (MTI) by NLM, deal with text by bag-of-words, which cannot capture semantic and context-dependent information well. To solve these two challenges, we developed the MeSHLabeler and DeepMeSH. By utilizing "learning to rank" (LTR) framework, MeSHLabeler integrates multiple types of information to solve the challenge in the MeSH side, while DeepMeSH integrates deep semantic representation to solve the challenge in the citation side. MeSHLabeler achieved the first place in both BioASQ2 and BioASQ3, and DeepMeSH achieved the first place in both BioASQ4 and BioASQ5 challenges. DeepMeSH is available at http://datamining-iip.fudan.edu.cn/deepmesh . Show less