Alumnus talk - Freedom of design in chemical compound space: towards computational design of molecules withesired quantum-mechanical properties
Leonardo Medrano
Department of Physics and Materials Science, University of Luxembourg
Profile at our chair

Dec. 7, 2023, 1 p.m.
This seminar is held online.
Online: https://tinyurl.com/nanoSeminar-GA

Google Scholar


Computer-driven molecular design combines the principles of chemistry, physics, and artificial intelligence to identify novel chemical compounds and materials with desired properties for a specific application. This rational in silico design requires an advanced understanding of both the structure–property/property–property relationships that exist across chemical compound space (CCS) as well as efficient methodologies to define an inverse mapping from properties to 3D molecular structures. In this work, we first analyze these fundamental relationships in the sector of CCS spanned by small molecules using the QM7-X dataset—a comprehensive dataset of 42 quantum-mechanical (QM) properties for ≈4.2 million equilibrium and non-equilibrium structures of small organic molecules with up to seven non-hydrogen (C, N, O, S, Cl) atoms. By characterizing and enumerating progressively more complex manifolds of molecular property space—the corresponding high-dimensional space defined by the properties of each molecule in this sector of CCS—our analysis reveals that one has a substantial degree of flexibility or “freedom of design” when searching for a single molecule with a desired pair of properties or a set of distinct molecules sharing an array of properties. To explore how the insights gained in this analysis can be exploited in the actual design of molecules with desired properties, we then develop a proof-of-concept implementation based on a variational autoencoder (VAE) architecture and demonstrate that it is feasible to parameterize the QM7-X chemical space with a finite set of extensive/intensive QM properties. We illustrate the capabilities of our approach by conditional generation of de novo molecular structures with targeted properties, transition path interpolation for chemical reactions as well as insights into property-structure relationships. These results thus provide a proof-of-concept demonstration aiming to enable the inverse property-to-structure design in diverse chemical spaces. We expect that our work serves as motivation for the development of advanced ML-based tools that will improve the in silico sampling, identification, and design of molecular systems for a specific application.


Brief CV

Leonardo Medrano Sandonas is a senior postdoc at the University of Luxembourg. He has a broad background in computational modelling of organic molecules and materials. He is currently working on combining machine learning methods with quantum/statistical mechanics to develop physics-inspired neural network potentials for the study of drug-protein interactions as well as frameworks for computer-aided molecular design. In 2018, Leonardo got his doctor degree in Mechanical Engineering at the Technical University of Dresden as an IMPRS fellow (International Max Planck Research School). Earlier, he got his bachelor and master’s degree in Physics at the National University of San Marcos in Lima-Peru. In addition to his theoretical investigations, Leonardo actively engages in multi-disciplinary projects with experimental/industrial collaborators to address current challenges in Physics and Chemistry (see Google Scholar page).



Share
Alumnus talk - Freedom of design in chemical compound space: towards computational design of molecules withesired quantum-mechanical properties
Leonardo Medrano
Department of Physics and Materials Science, University of Luxembourg
Profile at our chair

Dec. 7, 2023, 1 p.m.
This seminar is held online.
Online: https://tinyurl.com/nanoSeminar-GA

Google Scholar


Computer-driven molecular design combines the principles of chemistry, physics, and artificial intelligence to identify novel chemical compounds and materials with desired properties for a specific application. This rational in silico design requires an advanced understanding of both the structure–property/property–property relationships that exist across chemical compound space (CCS) as well as efficient methodologies to define an inverse mapping from properties to 3D molecular structures. In this work, we first analyze these fundamental relationships in the sector of CCS spanned by small molecules using the QM7-X dataset—a comprehensive dataset of 42 quantum-mechanical (QM) properties for ≈4.2 million equilibrium and non-equilibrium structures of small organic molecules with up to seven non-hydrogen (C, N, O, S, Cl) atoms. By characterizing and enumerating progressively more complex manifolds of molecular property space—the corresponding high-dimensional space defined by the properties of each molecule in this sector of CCS—our analysis reveals that one has a substantial degree of flexibility or “freedom of design” when searching for a single molecule with a desired pair of properties or a set of distinct molecules sharing an array of properties. To explore how the insights gained in this analysis can be exploited in the actual design of molecules with desired properties, we then develop a proof-of-concept implementation based on a variational autoencoder (VAE) architecture and demonstrate that it is feasible to parameterize the QM7-X chemical space with a finite set of extensive/intensive QM properties. We illustrate the capabilities of our approach by conditional generation of de novo molecular structures with targeted properties, transition path interpolation for chemical reactions as well as insights into property-structure relationships. These results thus provide a proof-of-concept demonstration aiming to enable the inverse property-to-structure design in diverse chemical spaces. We expect that our work serves as motivation for the development of advanced ML-based tools that will improve the in silico sampling, identification, and design of molecular systems for a specific application.


Brief CV

Leonardo Medrano Sandonas is a senior postdoc at the University of Luxembourg. He has a broad background in computational modelling of organic molecules and materials. He is currently working on combining machine learning methods with quantum/statistical mechanics to develop physics-inspired neural network potentials for the study of drug-protein interactions as well as frameworks for computer-aided molecular design. In 2018, Leonardo got his doctor degree in Mechanical Engineering at the Technical University of Dresden as an IMPRS fellow (International Max Planck Research School). Earlier, he got his bachelor and master’s degree in Physics at the National University of San Marcos in Lima-Peru. In addition to his theoretical investigations, Leonardo actively engages in multi-disciplinary projects with experimental/industrial collaborators to address current challenges in Physics and Chemistry (see Google Scholar page).



Share