Phan, T.-L., Weinbauer, K., González Laffitte, M. E., Pan, Y., Merkle, D., Andersen, J., Fagerberg, R., Flamm, C., & Stadler, P. F. (2025). SynTemp: Efficient Extraction of Graph-Based Reaction Rules from Large-Scale Reaction Databases. Journal of Chemical Information and Modeling. https://doi.org/10.1021/acs.jcim.4c01795
reaction rules; graph transformations; atom-atom maps; classification of reactions
en
Abstract:
Reaction templates are graphs that represent the reaction center as well as the surrounding context in order to specify salient features of chemical reactions. They are subgraphs of imaginary transition states, which are equivalent to double pushout graph rewriting rules and thus can be applied directly to predict reaction outcomes at the structural formula level. We introduce here SynTemp, a framework designed to extract and hierarchically cluster reaction templates from large-scale reaction data repositories. Rule inference is implemented as a robust graph-theoretic approach, which first computes an atom−atom mapping (AAM) as a consensus over partial predictions from multiple state-of-the-art tools and then augments the raw AAM by mechanistically relevant hydrogen atoms and extracts the reactions center extended by relevant context. SynTemp achieves an exceptional accuracy of 99.5% and a success rate of 71.23% in obtaining AAMs on the chemical reaction dataset. Hierarchical clustering of the extended reaction centers based on topological features results in a library of 311 transformation rules explaining 86% of the reaction dataset.
en
Project title:
Training Alliance for Computational Systems chemistry: Proposal number: 101072930 (European Commission)
-
Project (external):
Federal Ministry of Education and Research of Germany ; Sächsische Staatsministerium für Wissenschaft Kultur und Tourismus