📋 Browse Articles

Bi ML catalysis coordination-chemistry

Hybrid Computational Strategy for Predicting Complex Ligand–Metal Architectures

Galymzhan Moldagulov, Kisung Lee, Sanzhar Nurgaliyev +3 more · 2026 · Angewandte Chemie International Edition · Wiley · added 2026-04-20

Galymzhan Moldagulov, Kisung Lee, Sanzhar Nurgaliyev, Assanali Salem, Anatolii Kuznietsov, Bartosz A. Grzybowski Show less

ABSTRACT Understanding how metals coordinate to organic ligands is a precondition for the rational design of metal complexes and catalysts. Whereas certain types of ligands are capable of just one eas Show more

ABSTRACT Understanding how metals coordinate to organic ligands is a precondition for the rational design of metal complexes and catalysts. Whereas certain types of ligands are capable of just one easy‐to‐predict coordination modality, others may present tens and sometimes even hundreds of coordination options (mono‐, bi‐, or polydentate), and predicting the correct one may be a challenge even to seasoned chemists. The current paper describes a “hybrid” computational approach in which a Machine Learning, ML, algorithm learns to predict complex coordination patterns using knowledge‐based “rules” derived from the Cambridge Structural Database, CSD. This model is applicable to a broad scope of ligands (including hemilabile and haptic ones as well as those with denticity > 6) and different metals at different oxidation states. The algorithm's code is disclosed and can be readily deployed in RDKit via our RDMetallics python‐wrapper. It is also deployed as a publicly accessible web portal for demonstration and use. Show less

no PDF DOI: 10.1002/anie.202524655 📎 SI

Machine learning prediction of acute toxicity with in vivo experiments on tetrazole derivatives

Danil P. Zarezin, Alexander A. Shtil, Valentin G. Nenajdenko +1 more · 2026 · RSC Medicinal Chemistry · Royal Society of Chemistry · added 2026-04-20

Danil P. Zarezin, Alexander A. Shtil, Valentin G. Nenajdenko, Sophia S. Borisevich Show less

This study investigates the application of machine learning techniques to predict the toxicity of tetrazole derivatives, aiding in the identification of environmental risks from chemical expos Show more

This study investigates the application of machine learning techniques to predict the toxicity of tetrazole derivatives, aiding in the identification of environmental risks from chemical exposure. Utilizing LD50 data sourced from the scientific literature and the ChemIDplus database, regression models were developed to forecast acute intraperitoneal toxicity in mice. A machine learning regression model for acute intraperitoneal toxicity in mice was constructed and validated on a test dataset, achieving high accuracy (R2 = 0.76 and MSE below 10−4) and surpassing most of the comparable literature models. Molecular descriptors were computed via Mordred software to explore quantitative structure–activity relationships, and additionally, the model's robustness was demonstrated by measuring the acute toxicity of tetrazole derivatives synthesized through the azido-Ugi reaction. Show less

📄 PDF DOI: 10.1039/D5MD00757G

ML synthesis tetrazole

An in-depth review of AI-powered advancements in cancer drug discovery.

Minh Huu Nhat Le, Phat Ky Nguyen, Thi Phuong Trang Nguyen +5 more · 2025 · Biochimica et biophysica acta. Molecular basis of disease · Elsevier · added 2026-04-20

Minh Huu Nhat Le, Phat Ky Nguyen, Thi Phuong Trang Nguyen, Hien Quang Nguyen, Dao Ngoc Hien Tam, Han Hong Huynh, Phat Kim Huynh, Nguyen Quoc Khanh Le Show less

The convergence of artificial intelligence (AI) and genomics is redefining cancer drug discovery by facilitating the development of personalized and effective therapies. This review examines the trans Show more

The convergence of artificial intelligence (AI) and genomics is redefining cancer drug discovery by facilitating the development of personalized and effective therapies. This review examines the transformative role of AI technologies, including deep learning and advanced data analytics, in accelerating key stages of the drug discovery process: target identification, drug design, clinical trial optimization, and drug response prediction. Cutting-edge tools such as DrugnomeAI and PandaOmics have made substantial contributions to therapeutic target identification, while AI's predictive capabilities are driving personalized treatment strategies. Additionally, advancements like AlphaFold highlight AI's capacity to address intricate challenges in drug development. However, the field faces significant challenges, including the management of large-scale genomic datasets and ethical concerns surrounding AI deployment in healthcare. This review underscores the promise of data-centric AI approaches and emphasizes the necessity of continued innovation and interdisciplinary collaboration. Together, AI and genomics are charting a path toward more precise, efficient, and transformative cancer therapeutics. Show less

no PDF DOI: 10.1016/j.bbadis.2025.167680

Biophysics-based protein language models for protein engineering - Nature Methods

Gelman, Sam, Johnson, Bryce, Freschlin, Chase R. +5 more · 2025 · Nature Publishing Group · Nature · added 2026-04-20

Gelman, Sam, Johnson, Bryce, Freschlin, Chase R., Sharma, Arnav, D’Costa, Sameer, Peters, John, Gitter, Anthony, Romero, Philip A. Show less

Mutational effect transfer learning (METL) is a protein language model framework that unites machine learning and biophysical modeling. Transformer-based neural networks are pretrained on biophysical Show more

Mutational effect transfer learning (METL) is a protein language model framework that unites machine learning and biophysical modeling. Transformer-based neural networks are pretrained on biophysical simulation data to capture fundamental relationships between protein sequence, structure and energetics. Show less

📄 PDF DOI: 10.1038/s41592-025-02776-2

Studying Noncovalent Interactions in Molecular Systems with Machine Learning

Serhii Tretiakov, AkshatKumar Nigam, Robert Pollice · 2025 · Chemical Reviews · ACS Publications · added 2026-04-20

Noncovalent interactions (NCIs) is an umbrella term for a multitude of typically weak interactions within and between molecules. Despite the low individual energy contributions, their collective effec Show more

Noncovalent interactions (NCIs) is an umbrella term for a multitude of typically weak interactions within and between molecules. Despite the low individual energy contributions, their collective effect significantly influences molecular behavior. Accordingly, understanding these interactions is crucial across fields like catalysis, drug design, materials science, and environmental chemistry. However, predicting NCIs is challenging, requiring at least molecular mechanics-level pairwise energy contributions or efficient quantum mechanical electron correlation treatment. In this review, we investigate the application of machine learning (ML) to study NCIs in molecular systems, an emerging research field. ML excels at modeling complex nonlinear relationships, and is capable of integrating vast data sets from experimental and theoretical sources. It offers a powerful approach for analyzing interactions across scales, from small molecules to large biomolecular assemblies. Specifically, we examine data sets characterizing NCIs, compare molecular featurization techniques, assess ML models predicting NCIs explicitly, and explore inverse design approaches. ML enhances predictive accuracy, reduces computational costs, and reveals overlooked interaction patterns. By identifying current challenges and future opportunities, we highlight how ML-driven insights could revolutionize this field. Overall, we believe that recent proof-of-concept studies foreshadow exciting developments for the study of NCIs in the years to come. Show less

no PDF DOI: 10.1021/acs.chemrev.4c00893 📎 SI

ML anticancer coordination-chemistry

Machine Learning for Anticancer Activity Prediction of Transition Metal Complexes

Lev Krasnov, Dmitry Malikov, Marina Kiseleva +3 more · 2025 · · added 2026-04-20

Lev Krasnov, Dmitry Malikov, Marina Kiseleva, Ekaterina Nykhrikova, Sergei Tatarin, Stanislav Bezzubov Show less

no PDF DOI: 10.26434/chemrxiv-2025-1nqvm-v2 📎 SI

Navigating protein landscapes with a machine-learned transferable coarse-grained model - Nature Chemistry

Charron, Nicholas E., Bonneau, Klara, Pasos-Trejo, Aldo S. +18 more · 2025 · Nature Publishing Group · Nature · added 2026-04-20

The development of a universal protein coarse-grained model has been a long-standing challenge. A coarse-grained model with chemical transferability has now been developed by combining deep-learning m Show more

The development of a universal protein coarse-grained model has been a long-standing challenge. A coarse-grained model with chemical transferability has now been developed by combining deep-learning methods with a large and diverse training set of all-atom protein simulations. The model can be used for extrapolative molecular dynamics on new sequences. Show less

📄 PDF DOI: 10.1038/s41557-025-01874-0

Drug and Clinical Candidate Drug Data in ChEMBL.

Fiona M I Hunter, Harris Ioannidis, A Patrícia Bento +10 more · 2025 · Journal of Medicinal Chemistry · ACS Publications · added 2026-04-20

Fiona M I Hunter, Harris Ioannidis, A Patrícia Bento, Nicolas Bosc, Sybilla Corbett, Eloy Felix, M Paula Magarinos, Emma Manners, Ines A Smit, Marleen De Veij, Noel M O'Boyle, Barbara Zdrazil, Andrew R Leach Show less

ChEMBL is a large-scale, open-access, FAIR database of bioactive molecules with drug-like properties. ChEMBL 35 contains 17,500 approved drugs, and drugs that are progressing through the clinical deve Show more

ChEMBL is a large-scale, open-access, FAIR database of bioactive molecules with drug-like properties. ChEMBL 35 contains 17,500 approved drugs, and drugs that are progressing through the clinical development pipeline. Drug curation has formed an integral part of the core offering of the ChEMBL database since its inception. The paper is a reference guide to present the principles of why the ChEMBL drug data has been curated in a particular manner so that data users can better understand the nature of the data. The drug data include information on: names, synonyms and trade names, chemical structure or biological sequence, data sources, indications, mechanisms, warnings and drug properties such as maximum phase of development, type of molecule, prodrug status and first approval. The integrated nature of the drug data within the context of a bioactivity resource enables the wide use of the data set in drug discovery, AI and machine learning. Show less

no PDF DOI: 10.1021/acs.jmedchem.5c00920 📎 SI

ML prodrug

An open-source screening platform accelerates discovery of drug combinations - Nature Communications

Wright, William C., Pan, Min, Phelps, Gregory A. +12 more · 2025 · Nature Publishing Group · Nature · added 2026-04-20

Wright, William C., Pan, Min, Phelps, Gregory A., Low, Jonathan, Currier, Duane, Sanjali, Ankita, Trotter, Marlon, Hwang, Jihye, Chapple, Richard H., Liu, Xueying, Bennett, Declan, Zhang, Yinwen, Lee, Richard E., Chen, Taosheng, Geeleher, Paul Show less

Drug combination discovery remains slow and challenging. Here, the authors introduce Combocat, an open-source framework that combines acoustic liquid handling protocols with machine learning to achiev Show more

Drug combination discovery remains slow and challenging. Here, the authors introduce Combocat, an open-source framework that combines acoustic liquid handling protocols with machine learning to achieve ultrahigh-throughput drug combination screening; as proof of concept, they use Combocat to screen 9,045 drug combinations in a neuroblastoma cell line. Show less

📄 PDF DOI: 10.1038/s41467-025-66223-8

AI-driven pharmacovigilance: Enhancing adverse drug reaction detection with deep learning and NLP.

Khemani DB, Malave DS, Shinde S +3 more · 2025 · MethodsX · Elsevier · added 2026-04-20

Khemani DB, Malave DS, Shinde S, Shukla M, Shikalgar R, Talwar H Show less

In the healthcare industry, the ever-increasing volume of clinical trial data presents challenges for ensuring drug safety and detecting adverse drug reactions (ADRs). This study aims to address the c Show more

In the healthcare industry, the ever-increasing volume of clinical trial data presents challenges for ensuring drug safety and detecting adverse drug reactions (ADRs). This study aims to address the challenge of accurately detecting Serious Adverse Events (SAEs) in pharmacovigilance, a critical component in ensuring drug safety during and after clinical trials. The key problem lies in the underreporting and delayed detection of Adverse Drug Reactions (ADRs) due to the heterogeneous nature of medical data, class imbalance, and the limited scope of traditional monitoring techniques. This study proposes a hybrid AI-driven framework that integrates structured (e.g., patient demographics, lab results) and unstructured data (e.g., clinical notes) to detect ADRs using advanced deep learning and NLP methods. The objective is to outperform traditional signal detection methods and provide interpretable predictions to aid clinicians in real-time. By leveraging advanced Machine Learning (ML) and Deep Learning (DL) techniques, including Random Forests, Gradient Boosting Machines, and Convolutional Neural Networks (CNNs), our model aims to identify potential ADRs across different patient subgroups. Through meticulous feature engineering and the application of techniques to address data imbalance, our model demonstrates improved accuracy and interpretability in predicting ADRs. The CNN model achieved an accuracy of 85 %, outperforming traditional models, such as Logistic Regression (78 %) and Support Vector Machines (80 %). These findings suggest that specific demographic and clinical factors significantly influence the likelihood of adverse reactions, offering valuable insights for targeted monitoring and risk mitigation strategies[11]. This research underscores the potential of predictive modeling to enhance pharmacovigilance efforts and ensure safer clinical trial outcomes.•The research methodology includes a comparison of supervised learning algorithms, such as Logistic Regression, Random Forest, Gradient Boost, CNN, and genetic algorithms, to identify patterns and anomalies in clinical trial data. BERT and GPT, were also employed to provide the functionality of textual interactions over medical data.•Performance metrics such as accuracy, precision, recall, and F1-score were systematically applied to evaluate each model's performance. Among the models tested, the CNN model with BERT achieved the highest accuracy, providing valuable insights into the potential of deep learning for enhancing pharmacovigilance practices.•These findings suggest that an inclusion of diverse clinical data when supplied to advanced ML and NLP techniques can significantly improve the detection of ADRs, leading to better alignment with the fundamental principles of Good Clinical Practice (GCP). Show less

📄 PDF DOI: 10.1016/j.mex.2025.103460 📎 SI

From Detection to Prediction: Advances in m6A Methylation Analysis Through Machine Learning and Deep Learning with Implications in Cancer.

Jin R, Zou Q, Luo X · 2025 · International journal of molecular sciences · MDPI · added 2026-04-20

N6-methyladenosine (m6A) represents the most common and thoroughly investigated RNA modification and exerts essential functions in regulating gene expression through influencing the RNA stability, the Show more

N6-methyladenosine (m6A) represents the most common and thoroughly investigated RNA modification and exerts essential functions in regulating gene expression through influencing the RNA stability, the translation efficiency, alternative splicing, and nuclear export processes. The rapid development of high-throughput sequencing approaches, including miCLIP and MeRIP-seq, has profoundly transformed epitranscriptomics research. These techniques facilitate the detailed transcriptome-wide profiling of m6A modifications, shedding light on their crucial roles in diverse biological pathways. This review comprehensively examines the identification, mechanisms of regulation, and functional consequences of m6A modifications. It emphasizes their critical roles in physiological contexts, encompassing immune function, neuronal development, and the differentiation of stem cells. Additionally, the review discusses the contributions of m6A dysregulation to pathological conditions, including cancer, neurodegenerative diseases, and disorders of metabolism. We also discuss the development and application of machine-learning algorithms for m6A site prediction, emphasizing the integration of sequence-based, structural, and evolutionary conservation features to enhance the predictive accuracy. Furthermore, the potential of applying the findings from m6A research in precision medicine and drug development is examined. By synthesizing the current knowledge and emerging trends, this review aims to provide a comprehensive understanding of m6A biology and its translational potential, offering new perspectives for future research and therapeutic innovation. Show less

📄 PDF DOI: 10.3390/ijms26146701 📎 SI

Spatial recognition and semi-quantification of epigenetic events in pancreatic cancer subtypes with multiplexed molecular imaging and machine learning.

Krzysztof Szymoński, Natalia Janiszewska, Kamila Sofińska +10 more · 2025 · Scientific reports · Nature · added 2026-04-20

Krzysztof Szymoński, Natalia Janiszewska, Kamila Sofińska, Katarzyna Skirlińska-Nosek, Dawid Lupa, Michał Czaja, Marta Urbańska, Katarzyna Jurkowska, Kamila Konik, Marta Olszewska, Dariusz Adamek, Kamil Awsiuk, Ewelina Lipiec Show less

Genomic alterations are the driving force behind pancreatic cancer (PC) tumorigenesis, but they do not fully account for its diverse phenotypes. Investigating the epigenetic landscapes of PC offers a Show more

Genomic alterations are the driving force behind pancreatic cancer (PC) tumorigenesis, but they do not fully account for its diverse phenotypes. Investigating the epigenetic landscapes of PC offers a more comprehensive understanding and could identify targeted therapies that enhance patient survival. In this study, we have developed a new promising methodology of spatial epigenomics that integrates multiplexed molecular imaging with convolutional neural networks. Then, we used it to map epigenetic modification levels in the six most prevalent PC subtypes. We analyzed and semi-quantified the resulting molecular data, revealing significant variability in their epigenomes. DNA and histone modifications, specifically methylation and acetylation, were investigated. Using the same technique, we examined DNA conformational changes to further elucidate the transcriptional regulatory mechanisms involved in PC differentiation. Our results revealed that the foamy-gland and squamous-differentiated subtypes exhibited significantly increased global levels of epigenetic modifications and elevated Z-DNA ratios. Overall, our findings may suggest a potentially reduced efficacy of therapeutics targeting epigenetic regulators for these subtypes. Conversely, the conventional ductal PC subtype has emerged as a promising candidate for treatment with epigenetic modulators. Show less

📄 PDF DOI: 10.1038/s41598-025-90087-z 📎 SI

ML imaging

Machine learning-based drug-drug interaction prediction: a critical review of models, limitations, and data challenges.

Gheorghita FI, Bocanet VI, Iantovics LB · 2025 · Frontiers in pharmacology · Frontiers · added 2026-04-20

New computational methods, based on statistical, machine learning, and deep learning techniques using drug-related entities (e.g., genes, protein bindings, etc.), help reduce the costs of in-vitro exp Show more

New computational methods, based on statistical, machine learning, and deep learning techniques using drug-related entities (e.g., genes, protein bindings, etc.), help reduce the costs of in-vitro experiments through drug-drug interaction prediction (DDIp). This review examines recent advances in DDIp. It presents an in-depth review of the state-of-the-art studies relating to semi-supervised, supervised, self-supervised learning, and other techniques such as graph-based learning and matrix factorization methods for predicting DDIs. All possible interactions between drugs are not known, and accurately predicting interactions is even more difficult due to the complex nature of drug-drug interactions (DDI). Show less

📄 PDF DOI: 10.3389/fphar.2025.1632775 📎 SI

ML amino-acid review

A Systematic Review of Drug-Related InteractionsUtilizing Deep Learning and LLMs for Prediction and Mitigation.

Behrooznia S, Hooshmand M · 2025 · ACS Omega · ACS Publications · added 2026-04-20

Computational drug discovery is essential for screening potential treatments and reducing the costs and time associated with proposing or combining drugs for disease management. Despite the extensive Show more

Computational drug discovery is essential for screening potential treatments and reducing the costs and time associated with proposing or combining drugs for disease management. Despite the extensive research conducted in this field, it remains an emerging area, particularly with the advent of machine learning, deep learning, and large language models (LLMs). This systematic review examines the integration of machine learning and deep learning techniques in drug discovery, concentrating on three critical areas: drug-drug interactions (DDIs), drug-target interactions (DTIs), and adverse drug reactions (ADRs). The review analyzes over 100 papers published between 2020 and 2025, categorizing the methods into deep learning, machine learning, graph learning, and hybrid models. It highlights the transformative impact of natural language processing (NLP) and LLMs in extracting meaningful insights from biomedical literature and chemical data. Furthermore, this work introduces key databases and data sets widely utilized in drug discovery. Additionally, this review identifies gaps in the existing research, such as the lack of comprehensive studies that simultaneously address DDI, DTI, and ADR extraction, and it proposes a more holistic approach to fill these gaps. The paper concludes by thoroughly evaluating various models, underscoring their performance metrics. Show less

📄 PDF DOI: 10.1021/acsomega.5c04997 📎 SI

MPIDNN-GPPI: multi-protein language model with an improved deep neural network for generalized protein‒protein interaction prediction.

Li Y, Wang C, Gu H +3 more · 2025 · BMC genomics · BioMed Central · added 2026-04-20

Li Y, Wang C, Gu H, Long Z, Fan M, Li L Show less

Predicting protein‒protein interactions (PPIs) plays a crucial role in understanding biological processes. Although biological experimental methods can identify PPIs, they are costly, time-cons Show more

Predicting protein‒protein interactions (PPIs) plays a crucial role in understanding biological processes. Although biological experimental methods can identify PPIs, they are costly, time-consuming, labor-intensive, and often lack stability. In contrast, computational approaches for PPI prediction, particularly deep learning methods, can efficiently learn representations from protein sequences. However, the generalizability, robustness, and stability of computational PPI prediction models still need improvement, especially for species with limited verified PPI data. Protein embeddings generated by protein language models can extract features from protein sequences and reflect hierarchical biological structures, making them suitable for predicting PPIs. Therefore, in this study, we propose a novel protein sequence-based PPI prediction framework designed for generalized PPI assessment by integrating two protein language models (PLMs) and an enhanced deep neural network (MPIDNN-GPPI). Specifically, the sequences are embedded using two protein language models, Ankh and ESM-2. A deep neural network is then used to learn representations from the feature vectors produced by PLMs. Subsequently, a multi-head attention mechanism is introduced to capture long-range dependencies and fuse them with DNN-derived representations. Finally, a deep neural network is applied to assess the probability of interaction between two proteins. To evaluate the performance of MPIDNN-GPPI, nine PPI datasets were collected from the STRING database, covering a diverse set of species: five datasets from mammals (D. melanogaster, C. elegans, S. cerevisiae, H. sapiens, and M. musculus), and four datasets from plants (O. sativa, A. thaliana, G. max, and Z. mays). When trained on H. sapiens, MPIDNN-GPPI achieved AUC values of 0.959, 0.966, 0.954, and 0.916 on independent test sets for M. musculus, D. melanogaster, C. elegans, and S. cerevisiae, respectively. These results represent the best performance among all PPI models compared in this study. Similarly, when trained on O. sativa, the model achieved AUC values of 0.96, 0.95, and 0.913 on independent datasets for A. thaliana, G. max, and Z. mays, respectively. Ablation experiments demonstrated that models combining Ankh and ESM-2 outperformed those relying on a single protein language model. Furthermore, MPIDNN-GPPI, which incorporates multi-head attention and deep neural networks (DNN), achieved superior performance compared to models using DNN alone. These findings indicate that MPIDNN-GPPI possesses strong generalization capability for cross-species PPI prediction. The proposed model, trained on one species, can be effectively applied to accurately predict PPIs in other species. Show less

📄 PDF DOI: 10.1186/s12864-025-12228-y 📎 SI

Recent advances in deep learning for protein-protein interaction: a review.

Cui J, Yang S, Yi L +3 more · 2025 · BioData mining · BioMed Central · added 2026-04-20

Cui J, Yang S, Yi L, Xi Q, Yang D, Zuo Y Show less

Deep learning, a cornerstone of artificial intelligence, is driving rapid advancements in computational biology. Protein-protein interactions (PPIs) are fundamental regulators of biological functions. Show more

Deep learning, a cornerstone of artificial intelligence, is driving rapid advancements in computational biology. Protein-protein interactions (PPIs) are fundamental regulators of biological functions. With the inclusion of deep learning in PPI research, the field is undergoing transformative changes. Therefore, there is an urgent need for a comprehensive review and assessment of recent developments to improve analytical methods and open up a wider range of biomedical applications. This review meticulously assesses deep learning progress in PPI prediction from 2021 to 2025. We evaluate core architectures (GNNs, CNNs, RNNs) and pioneering approaches-attention-driven Transformers, multi-task frameworks, multimodal integration of sequence and structural data, transfer learning via BERT and ESM, and autoencoders for interaction characterization. Moreover, we examined enhanced algorithms for dealing with data imbalances, variations, and high-dimensional feature sparsity, as well as industry challenges (including shifting protein interactions, interactions with non-model organisms, and rare or unannotated protein interactions), and offered perspectives on the future of the field. In summary, this review systematically summarizes the latest advances and existing challenges in deep learning in the field of protein interaction analysis, providing a valuable reference for researchers in the fields of computational biology and deep learning. Show less

📄 PDF DOI: 10.1186/s13040-025-00457-6 📎 SI

ML amino-acid review

Network-based multi-omics integrative analysis methods in drug discovery: a systematic review.

Jiang W, Ye W, Tan X +1 more · 2025 · BioData mining · BioMed Central · added 2026-04-20

Jiang W, Ye W, Tan X, Bao YJ Show less

The integration of multi-omics data from diverse high-throughput technologies has revolutionized drug discovery. While various network-based methods have been developed to integrate multi-omics data, Show more

The integration of multi-omics data from diverse high-throughput technologies has revolutionized drug discovery. While various network-based methods have been developed to integrate multi-omics data, systematic evaluation and comparison of these methods remain challenging. This review aims to analyze network-based approaches for multi-omics integration and evaluate their applications in drug discovery. We conducted a comprehensive review of literature (2015-2024) on network-based multi-omics integration methods in drug discovery, and categorized methods into four primary types: network propagation/diffusion, similarity-based approaches, graph neural networks, and network inference models. We also discussed the applications of the methods in three scenario of drug discovery, including drug target identification, drug response prediction, and drug repurposing, and finally evaluated the performance of the methods by highlighting their advantages and limitations in specific applications. While network-based multi-omics integration has shown promise in drug discovery, challenges remain in computational scalability, data integration, and biological interpretation. Future developments should focus on incorporating temporal and spatial dynamics, improving model interpretability, and establishing standardized evaluation frameworks. Show less

📄 PDF DOI: 10.1186/s13040-025-00442-z 📎 SI

UniProt: the Universal Protein Knowledgebase in 2025.

Bateman A, Martin MJ, Orchard S +96 more · 2025 · Nucleic acids research · Oxford University Press · added 2026-04-20

Bateman A, Martin MJ, Orchard S, Magrane M, Adesina A, Ahmad S, Bowler-Barnett EH, Bye-A-Jee H, Carpentier D, Denny P, Fan J, Garmiri P, Gonzales LJDC, Hussein A, Ignatchenko A, Insana G, Ishtiaq R, Joshi V, Jyothi D, Kandasaamy S, Lock A, Luciani A, Luo J, Lussi Y, Marin JSM, Raposo P, Rice DL, Santos R, Speretta E, Stephenson J, Totoo P, Tyagi N, Urakova N, Vasudev P, Warner K, Wijerathne S, Yu CW, Zaru R, Bridge AJ, Aimo L, Argoud-Puy G, Auchincloss AH, Axelsen KB, Bansal P, Baratin D, Batista Neto TM, Blatter MC, Bolleman JT, Boutet E, Breuza L, Gil BC, Casals-Casas C, Echioukh KC, Coudert E, Cuche B, de Castro E, Estreicher A, Famiglietti ML, Feuermann M, Gasteiger E, Gaudet P, Gehant S, Gerritsen V, Gos A, Gruaz N, Hulo C, Hyka-Nouspikel N, Jungo F, Kerhornou A, Mercier PL, Lieberherr D, Masson P, Morgat A, Paesano S, Pedruzzi I, Pilbout S, Pourcel L, Poux S, Pozzato M, Pruess M, Redaschi N, Rivoire C, Sigrist CJA, Sonesson K, Sundaram S, Sveshnikova A, Wu CH, Arighi CN, Chen C, Chen Y, Huang H, Laiho K, Lehvaslaiho M, McGarvey P, Natale DA, Ross K, Vinayaka CR, Wang Y, Zhang J Show less

The aim of the UniProt Knowledgebase (UniProtKB; https://www.uniprot.org/) is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with function Show more

The aim of the UniProt Knowledgebase (UniProtKB; https://www.uniprot.org/) is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this publication, we describe ongoing changes to our production pipeline to limit the sequences available in UniProtKB to high-quality, non-redundant reference proteomes. We continue to manually curate the scientific literature to add the latest functional data and use machine learning techniques. We also encourage community curation to ensure key publications are not missed. We provide an update on the automatic annotation methods used by UniProtKB to predict information for unreviewed entries describing unstudied proteins. Finally, updates to the UniProt website are described, including a new tab linking protein to genomic information. In recognition of its value to the scientific community, the UniProt database has been awarded Global Core Biodata Resource status. Show less

📄 PDF DOI: 10.1093/nar/gkae1010 📎 SI

Morphological profiling for drug discovery in the era of deep learning

2024 · Briefings in Bioinformatics · Oxford University Press · added 2026-04-20

Abstract Morphological profiling is a valuable tool in phenotypic drug discovery. The advent of high-throughput automated imaging has enabled the captur Show more

Abstract Morphological profiling is a valuable tool in phenotypic drug discovery. The advent of high-throughput automated imaging has enabled the capturing of a wide range of morphological features of cells or organisms in response to perturbations at the single-cell resolution. Concurrently, significant advances in machine learning and deep learning, especially in computer vision, have led to substantial improvements in analyzing large-scale high-content images at high throughput. These efforts have facilitated understanding of compound mechanism of action, drug repurposing, characterization of cell morphodynamics under perturbation, and ultimately contributing to the development of novel therapeutics. In this review, we provide a comprehensive overview of the recent advances in the field of morphological profiling. We summarize the image profiling analysis workflow, survey a broad spectrum of analysis strategies encompassing feature engineering– and deep learning–based approaches, and introduce publicly available benchmark datasets. We place a particular emphasis on the application of deep learning in this pipeline, covering cell segmentation, image representation learning, and multimodal learning. Additionally, we illuminate the application of morphological profiling in phenotypic drug discovery and highlight potential challenges and opportunities in this field. Show less

no PDF DOI: 10.1093/bib/bbae284

Predictive Modelling in pharmacokinetics: from in-silico simulations to personalized medicine.

Ajita Paliwal, Smita Jain, Sachin Kumar +8 more · 2024 · Expert opinion on drug metabolism & toxicology · Taylor & Francis · added 2026-04-20

Ajita Paliwal, Smita Jain, Sachin Kumar, Pranay Wal, Madhusmruti Khandai, Prasanna Shama Khandige, Vandana Sadananda, Md Khalid Anwer, Monica Gulati, Tapan Behl, Shriyansh Srivastava Show less

INTRODUCTION: Pharmacokinetic parameters assessment is a critical aspect of drug discovery and development, yet challenges persist due to limited training data. Despite advancements in machine learnin Show more

INTRODUCTION: Pharmacokinetic parameters assessment is a critical aspect of drug discovery and development, yet challenges persist due to limited training data. Despite advancements in machine learning and in-silico predictions, scarcity of data hampers accurate prediction of drug candidates' pharmacokinetic properties. AREAS COVERED: The study highlights current developments in human pharmacokinetic prediction, talks about attempts to apply synthetic approaches for molecular design, and searches several databases, including Scopus, PubMed, Web of Science, and Google Scholar. The article stresses importance of rigorous analysis of machine learning model performance in assessing progress and explores molecular modeling (MM) techniques, descriptors, and mathematical approaches. Transitioning to clinical drug development, article highlights AI (Artificial Intelligence) based computer models optimizing trial design, patient selection, dosing strategies, and biomarker identification. In-silico models, including molecular interactomes and virtual patients, predict drug performance across diverse profiles, underlining the need to align model results with clinical studies for reliability. Specialized training for human specialists in navigating predictive models is deemed critical. Pharmacogenomics, integral to personalized medicine, utilizes predictive modeling to anticipate patient responses, contributing to more efficient healthcare system. Challenges in realizing potential of predictive modeling, including ethical considerations and data privacy concerns, are acknowledged. EXPERT OPINION: AI models are crucial in drug development, optimizing trials, patient selection, dosing, and biomarker identification and hold promise for streamlining clinical investigations. Show less

no PDF DOI: 10.1080/17425255.2024.2330666

Structure and dynamics in drug discovery - npj Drug Discovery

Wei, Haixin, McCammon, J. Andrew · 2024 · Nature Publishing Group · Nature · added 2026-04-20

The roles of computing in structure-based drug discovery are considered, from early studies based on some of the first experimental structures of enzyme-inhibitor complexes, through the use of advance Show more

The roles of computing in structure-based drug discovery are considered, from early studies based on some of the first experimental structures of enzyme-inhibitor complexes, through the use of advanced molecular dynamics simulations and machine learning methods. This perspective aims to explore the history, current trends, and future directions of these methodologies. Show less

📄 PDF DOI: 10.1038/s44386-024-00001-2

Merging Metabolic Modeling and Imaging for Screening Therapeutic Targets in Colorectal Cancer.

Niki Tavakoli, Emma J Fong, Abigail Coleman +9 more · 2024 · bioRxiv : the preprint server for biology · Cold Spring Harbor Laboratory · added 2026-04-20

Niki Tavakoli, Emma J Fong, Abigail Coleman, Yu-Kai Huang, Mathias Bigger, Michael E Doche, Seungil Kim, Heinz-Josef Lenz, Nicholas A Graham, Paul Macklin, Stacey D Finley, Shannon M Mumenthaler Show less

Cancer-associated fibroblasts (CAFs) play a key role in metabolic reprogramming and are well-established contributors to drug resistance in colorectal cancer (CRC). To exploit this metabolic crosstalk Show more

Cancer-associated fibroblasts (CAFs) play a key role in metabolic reprogramming and are well-established contributors to drug resistance in colorectal cancer (CRC). To exploit this metabolic crosstalk, we integrated a systems biology approach that identified key metabolic targets in a data-driven method and validated them experimentally. This process involved a novel machine learning-based method to computationally screen, in a high-throughput manner, the effects of enzyme perturbations predicted by a computational model of CRC metabolism. This approach reveals the network-wide effects of metabolic perturbations. Our results highlighted hexokinase (HK) as a crucial target, which subsequently became our focus for experimental validation using patient-derived tumor organoids (PDTOs). Through metabolic imaging and viability assays, we found that PDTOs cultured in CAF-conditioned media exhibited increased sensitivity to HK inhibition, confirming the model predictions. Our approach emphasizes the critical role of integrating computational and experimental techniques in exploring and exploiting CRC-CAF crosstalk. Show less

no PDF DOI: 10.1101/2024.05.24.595756 📎 SI

ML imaging

Z-DNA formation in promoters conserved between human and mouse are associated with increased transcription reinitiation rates - Scientific Reports

Beknazarov, Nazar, Konovalov, Dmitry, Herbert, Alan +1 more · 2024 · Nature Publishing Group · Nature · added 2026-04-20

Beknazarov, Nazar, Konovalov, Dmitry, Herbert, Alan, Poptsova, Maria Show less

A long-standing question concerns the role of Z-DNA in transcription. Here we use a deep learning approach DeepZ that predicts Z-flipons based on DNA sequence, structural properties of nucleotides and Show more

A long-standing question concerns the role of Z-DNA in transcription. Here we use a deep learning approach DeepZ that predicts Z-flipons based on DNA sequence, structural properties of nucleotides and omics data. We examined Z-flipons that are conserved between human and mouse genomes after generating whole-genome Z-flipon maps and then validated them by orthogonal approaches based on high resolution chemical mapping of Z-DNA and the transformer algorithm Z-DNABERT. For human and mouse, we revealed similar pattern of transcription factors, chromatin remodelers, and histone marks associated with conserved Z-flipons. We found significant enrichment of Z-flipons in alternative and bidirectional promoters associated with neurogenesis genes. We show that conserved Z-flipons are associated with increased experimentally determined transcription reinitiation rates compared to promoters without Z-flipons, but without affecting elongation or pausing. Our findings support a model where Z-flipons engage Transcription Factor E and impact phenotype by enabling the reset of preinitiation complexes when active, and the suppression of gene expression when engaged by repressive chromatin complexes. Show less

📄 PDF DOI: 10.1038/s41598-024-68439-y

Drug-drug interactions prediction based on deep learning and knowledge graph: A review.

Luo H, Yin W, Wang J +4 more · 2024 · iScience · Elsevier · added 2026-04-20

Luo H, Yin W, Wang J, Zhang G, Liang W, Luo J, Yan C Show less

Drug-drug interactions (DDIs) can produce unpredictable pharmacological effects and lead to adverse events that have the potential to cause irreversible damage to the organism. Traditional methods to Show more

Drug-drug interactions (DDIs) can produce unpredictable pharmacological effects and lead to adverse events that have the potential to cause irreversible damage to the organism. Traditional methods to detect DDIs through biological or pharmacological analysis are time-consuming and expensive, therefore, there is an urgent need to develop computational methods to effectively predict drug-drug interactions. Currently, deep learning and knowledge graph techniques which can effectively extract features of entities have been widely utilized to develop DDI prediction methods. In this research, we aim to systematically review DDI prediction researches applying deep learning and graph knowledge. The available biomedical data and public databases related to drugs are firstly summarized in this review. Then, we discuss the existing drug-drug interactions prediction methods which have utilized deep learning and knowledge graph techniques and group them into three main classes: deep learning-based methods, knowledge graph-based methods, and methods that combine deep learning with knowledge graph. We comprehensively analyze the commonly used drug related data and various DDI prediction methods, and compare these prediction methods on benchmark datasets. Finally, we briefly discuss the challenges related to drug-drug interactions prediction, including asymmetric DDIs prediction and high-order DDI prediction. Show less

📄 PDF DOI: 10.1016/j.isci.2024.109148 📎 SI

CO2 Fixation to Prebiotic Intermediates over Heterogeneous Catalysts.

Youngdong Song, Harun Tüysüz · 2024 · Accounts of Chemical Research · ACS Publications · added 2026-04-20

ConspectusThe study of the origin of life requires a multifaceted approach to understanding where and how life arose on Earth. One of the most compelling hypotheses is the chemosynthetic origin of lif Show more

ConspectusThe study of the origin of life requires a multifaceted approach to understanding where and how life arose on Earth. One of the most compelling hypotheses is the chemosynthetic origin of life at hydrothermal vents, as this condition has been considered viable for early forms of life. The continuous production of H2 and heat by serpentinization generates reductive conditions at hydrothermal vents, in which CO2 can be used to build large biomolecules. Although this involves surface catalysis and an autocatalytic process, in which solid minerals act as catalysts in the conversion of CO2 to metabolically important organic molecules, the systematic investigation of heterogeneous catalysis to comprehend prebiotic chemistry at hydrothermal vents has not been undertaken.In this Account, we discuss geochemical CO2 fixation to metabolic intermediates by synthetic minerals at hydrothermal vents from the perspective of heterogeneous catalysis. Ni and Fe are the most abundant transition metals at hydrothermal vents and occur in the active site of the enzymes carbon monoxide dehydrogenases/acetyl coenzyme A synthases (CODH/ACS). Synthetic free-standing NiFe alloy nanoparticles can convert CO2 to acetyl coenzyme A pathway intermediates such as formate, acetate, and pyruvate. The same alloy can further convert pyruvate to citramalate, which is essential in the biological citramalate pathway. Thermal treatment of Ni3Fe nanoparticles under NH3, which can occur in hydrothermal vents, results in Ni3FeN/Ni3Fe heterostructures. This catalyst has been demonstrated to produce prebiotic formamide and acetamide from CO2 and H2O using Ni3FeN/Ni3Fe as both substrate and catalyst. In the process of serpentinization, Co can be reduced in the vicinity of olivine, a Mg-Fe silicate mineral. This produces CoFe and CoFe2 with serpentine in nature, representing SiO2-supported CoFe alloys. In mimicking these natural minerals, synthetic SiO2-supported CoFe alloys demonstrate the same liquid products as NiFe alloys, namely, formate, acetate, and pyruvate under mild hydrothermal vent conditions. In contrast to the NiFe system, hydrocarbons up to C6 were detected in the gas phase, which is also present in hydrothermal vents. The addition of alkali and alkaline-earth metals to the catalysts results in enhanced formate concentration, playing a promotional role in CO2 reduction. Finally, Co was loaded onto ordered mesoporous SiO2 after modification with cations to simulate the minerals found in hydrothermal vents. These catalysts were then investigated under diminished H2O concentration, revealing the conversion of CO2 to CO, CH4, methanol, and acetate. Notably, the selectivity to metabolically relevant methanol was enhanced in the presence of cations that could generate and stabilize the methoxy intermediate. Calculation using the machine learning approach revealed the possibility of predicting the selectivity of CO2 fixation when modifying mesoporous SiO2 supports with heterocations. Our research demonstrates that minerals at hydrothermal vents can convert CO2 into metabolites under a variety of prebiotic conditions, potentially paving the way for modern biological CO2 fixation processes. Show less

no PDF DOI: 10.1021/acs.accounts.4c00151 📎 SI

Co Fe ML Ni catalysis drug-delivery review

Drug Mechanism: A bioinformatic update.

Martina Cirinciani, Eleonora Da Pozzo, Maria Letizia Trincavelli +2 more · 2024 · Biochemical pharmacology · Elsevier · added 2026-04-20

Martina Cirinciani, Eleonora Da Pozzo, Maria Letizia Trincavelli, Paolo Milazzo, Claudia Martini Show less

A drug Mechanism of Action (MoA) is a complex biological phenomenon that describes how a bioactive compound produces a pharmacological effect. The complete knowledge of MoA is fundamental to fully und Show more

A drug Mechanism of Action (MoA) is a complex biological phenomenon that describes how a bioactive compound produces a pharmacological effect. The complete knowledge of MoA is fundamental to fully understanding the drug activity. Over the years, many experimental methods have been developed and a huge quantity of data has been produced. Nowadays, considering the increasing omics data availability and the improvement of the accessible computational resources, the study of a drug MoA is conducted by integrating experimental and bioinformatics approaches. The development of new in silico solutions for this type of analysis is continuously ongoing; herein, an updating review on such bioinformatic methods is presented. The methodologies cited are based on multi-omics data integration in biochemical networks and Machine Learning (ML). The multiple types of usable input data and the advantages and disadvantages of each method have been analyzed, with a focus on their applications. Three specific research areas (i.e. cancer drug development, antibiotics discovery, and drug repurposing) have been chosen for their importance in the drug discovery fields in which the study of drug MoA, through novel bioinformatics approaches, is particularly productive. Show less

no PDF DOI: 10.1016/j.bcp.2024.116078

Advances in Protein-Ligand Binding Affinity Prediction via Deep Learning: A Comprehensive Study of Datasets, Data Preprocessing Techniques, and Model Architectures.

Abdelkader GA, Kim JD · 2024 · Current Drug Targets · Bentham Science · added 2026-04-20

Drug discovery is a complex and expensive procedure involving several timely and costly phases through which new potential pharmaceutical compounds must pass to get approved. One of these critical ste Show more

Drug discovery is a complex and expensive procedure involving several timely and costly phases through which new potential pharmaceutical compounds must pass to get approved. One of these critical steps is the identification and optimization of lead compounds, which has been made more accessible by the introduction of computational methods, including deep learning (DL) techniques. Diverse DL model architectures have been put forward to learn the vast landscape of interaction between proteins and ligands and predict their affinity, helping in the identification of lead compounds. Show less

📄 PDF DOI: 10.2174/0113894501330963240905083020 📎 SI

GeNNius: an ultrafast drug-target interaction inference method based on graph neural networks.

Veleiro U, de la Fuente J, Serrano G +7 more · 2024 · Bioinformatics · Oxford University Press · added 2026-04-20

Veleiro U, de la Fuente J, Serrano G, Pizurica M, Casals M, Pineda-Lucena A, Vicent S, Ochoa I, Gevaert O, Hernaez M Show less

Drug-target interaction (DTI) prediction is a relevant but challenging task in the drug repurposing field. In-silico approaches have drawn particular attention as they can reduce associated costs and Show more

Drug-target interaction (DTI) prediction is a relevant but challenging task in the drug repurposing field. In-silico approaches have drawn particular attention as they can reduce associated costs and time commitment of traditional methodologies. Yet, current state-of-the-art methods present several limitations: existing DTI prediction approaches are computationally expensive, thereby hindering the ability to use large networks and exploit available datasets and, the generalization to unseen datasets of DTI prediction methods remains unexplored, which could potentially improve the development processes of DTI inferring approaches in terms of accuracy and robustness. Show less

📄 PDF DOI: 10.1093/bioinformatics/btad774 📎 SI