👤 Ulrich Schatzschneider

🔍 Search 📋 Browse 🏷️ Tags ❤️ Favourites ➕ Add 🧬 Extraction
5
Articles
articles
Gerd Blanke, Jan Brammer, Djordje Baljozovic +7 more · 2025 · Faraday Discussions · Royal Society of Chemistry · added 2026-04-20
The InChI (International Chemical Identifier) standard stands as a cornerstone in chemical informatics, facilitating the structure-based identification and exchange chemical information about Show more
The InChI (International Chemical Identifier) standard stands as a cornerstone in chemical informatics, facilitating the structure-based identification and exchange chemical information about compounds across various platforms and databases. The InChI as a unique canonical line notation has made chemical structures searchable on the internet at a broad scale. The largest repositories working with InChIs contain more than 1 billion structures. Central to the functionality of the InChI is its codebase, which orchestrates a series of intricate steps to generate unique identifiers for chemical compounds. Up to now, these steps have been sparsely documented and the InChI algorithm had to be seen as a black box. For the new v1.07 release, the code has been analyzed and the major steps documented, more than 3000 bugs and security issues, as well as nearly 60 Google OSS-Fuzz issues have been fixed. New test systems have been implemented that allow users to directly test the code developments. The move to GitHub has not only made the development more transparent but will also enable external contributors to join the further development of the InChI code. Motivation for this modernisation was the urgency to treat molecular inorganic compounds by the InChI in a meaningful way. Until now, no classic string representation fulfills this need of molecular inorganic chemistry. Currently bonds to metal centers are by definition disconnected which makes most inorganic InChIs meaningless at the moment. Herein, we propose new routines to remedy this problem in the representation of molecular inorganic compounds by the InChI. Show less
📄 PDF DOI: 10.1039/D4FD00145A
algorithm development chemical informatics cheminformatics coordination chemistry inorganic chemistry inorganic compounds metal complexes
Victoria V. L. Müller, Peter V. Simpson, Kun Peng +7 more · 2023 · Inorganic Chemistry · ACS Publications · added 2026-04-20
The biological activity of Pd(II) and Pt(II) complexes toward three different cancer cell lines as well as inhibition of selenoenzyme thioredoxin reductase (TrxR) was modulated in an unexpected way by Show more
The biological activity of Pd(II) and Pt(II) complexes toward three different cancer cell lines as well as inhibition of selenoenzyme thioredoxin reductase (TrxR) was modulated in an unexpected way by the introduction of triazolate as a "protective group" to the inner metal coordination sphere using the iClick reaction of [M(N3)(terpy)]PF6 [M = Pd(II) or Pt(II) and terpy = 2,2':6',2″-terpyridine] with an electron-poor alkyne. In a cell proliferation assay using A549, HT-29, and MDA-MB-231 human cancer cell lines, the palladium compound was significantly more potent than the isostructural platinum analogue and exhibited submicromolar activity on the most responsive cell line. This difference was also reflected in the inhibitory efficiency toward TrxR with IC50 values of 0.1 versus 5.4 μM for the Pd(II) and Pt(II) complexes, respectively. UV/Vis kinetic studies revealed that the Pt compound binds to selenocysteine faster than to cysteine [k = (22.9 ± 0.2)·10-3 vs (7.1 ± 0.2)·10-3 s-1]. Selective triazolato ligand exchange of the title compounds with cysteine (Hcys) and selenocysteine (Hsec)─but not histidine (His) and 9-ethylguanine (9EtG)─was confirmed by 1H, 77Se, and 195Pt NMR spectroscopy. Crystal structures of three of the four ligand exchange products were obtained, including [Pt(sec)(terpy)]PF6 as the first metal complex of selenocysteine to be structurally characterized. Show less
no PDF DOI: 10.1021/acs.inorgchem.3c02701
NMR Pd Pt X-ray
Mario Krenn, Qianxiang Ai, Senja Barthel +28 more · 2022 · Patterns (New York, N.Y.) · Elsevier · added 2026-04-20
Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of pr Show more
Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of properties, the discovery of new reaction pathways, or the design of new molecules. The machine needs to read and write fluently in a chemical language for each of these tasks. Strings are a common tool to represent molecular graphs, and the most popular molecular string representation, Smiles, has powered cheminformatics since the late 1980s. However, in the context of AI and ML in chemistry, Smiles has several shortcomings-most pertinently, most combinations of symbols lead to invalid results with no valid chemical interpretation. To overcome this issue, a new language for molecules was introduced in 2020 that guarantees 100% robustness: SELF-referencing embedded string (Selfies). Selfies has since simplified and enabled numerous new applications in chemistry. In this perspective, we look to the future and discuss molecular string representations, along with their respective opportunities and challenges. We propose 16 concrete future projects for robust molecular representations. These involve the extension toward new chemical domains, exciting questions at the interface of AI and robust languages, and interpretability for both humans and machines. We hope that these proposals will inspire several follow-up works exploiting the full potential of molecular string representations for the future of AI in chemistry and materials science. Show less
no PDF DOI: 10.1016/j.patter.2022.100588
ML review
Jan C. Brammer, Gerd Blanke, Claudia Kellner +3 more · 2022 · Journal of Cheminformatics · BioMed Central · added 2026-04-20
AbstractTUCAN is a canonical serialization format that is independent of domain-specific concepts of structure and bonding. The atomic number is the only chemical feature that is used to derive the TU Show more
AbstractTUCAN is a canonical serialization format that is independent of domain-specific concepts of structure and bonding. The atomic number is the only chemical feature that is used to derive the TUCAN format. Other than that, the format is solely based on the molecular topology. Validation is reported on a manually curated test set of molecules as well as a library of non-chemical graphs. The serialization procedure generates a canonical “tuple-style” output which is bidirectional, allowing the TUCAN string to serve as both identifier and descriptor. Use of the Python NetworkX graph library facilitated a compact and easily extensible implementation.Graphical Abstract Show less
📄 PDF DOI: 10.1186/s13321-022-00640-5
graph theory molecular topology
Peter V. Simpson, Claudia Schmidt, Ingo Ott +2 more · 2013 · European Journal of Inorganic Chemistry · Wiley · added 2026-05-01
📄 PDF DOI: 10.1002/ejic.201300820
Biometal