Investigating the Effect of Coarse-Graining on Chemical Compound Space
dc.contributor.author | Kanekal, Kiran H. | |
dc.date.accessioned | 2021-04-08T11:06:48Z | |
dc.date.available | 2021-04-08T11:06:48Z | |
dc.date.issued | 2021 | |
dc.description.abstract | Chemical structure-property relationships are essential for the development of new materials used in all facets of life. Practically, this process amounts to projecting regions of the chemical compound space (CCS) onto certain descriptors related to the property of interest, allowing the structure-property relationship to be inferred. The challenge in constructing these relationships usually stems from a lack of data, as their accuracy and transferability will depend on how well-sampled CCS is with respect to the chosen descriptors. High-throughput screening, in which the properties of compounds are determined in an automated fashion, is one strategy used to overcome this problem. However, for many properties of soft-matter systems, this approach is difficult to implement computationally. This difficulty arises due to the large costs associated with adequately sampling complex free energy landscapes at atomistic resolutions using established tools such as molecular dynamics (MD) simulations. Coarse-grained (CG) models, parameterized at lower resolutions compared to their atomistic counterparts, provide a means to circumvent these costs. However, many of these models are constructed in order to specifically reproduce the properties of a small number of compounds, making it difficult to generalize across CCS. In this work, we demonstrate that the coarse-grained Martini model reduces the size of CCS, and can be used in computational high-throughput screening methods to efficiently construct chemical structure-property relationships over wide ranges of CCS. We find that this reduction of CCS is due to a limited number of Martini interaction types, with multiple atomistic chemical fragments mapping to the same CG interaction type. We then investigate the relationship between unsupervised machine learning and coarse-graining, yielding strategies for parameterizing chemically transferable CG models from both a top-down and bottom-up perspective. We employ these data-driven techniques to parameterize new top-down CG models and quantify their transferability and accuracy as a function of the number of CG interaction types for each model. Finally, we develop a method that uses unsupervised machine learning in combination with the bottom-up multiscale coarse-graining technique to generate chemically-transferable CG models with high structural accuracy. We examine the limitations of both top-down and bottom-up approaches and make recommendations for the future development of these methodologies. Overall, our work demonstrates the means by which chemically-transferable CG models can be both constructed and utilized to efficiently infer chemical structure-property relationships for materials discovery. | en_GB |
dc.identifier.doi | http://doi.org/10.25358/openscience-5538 | |
dc.identifier.uri | https://openscience.ub.uni-mainz.de/handle/20.500.12030/5542 | |
dc.identifier.urn | urn:nbn:de:hebis:77-openscience-9f93a327-bcb0-45a9-9019-43cf18d01a3f3 | |
dc.language.iso | eng | de |
dc.rights | InC-1.0 | * |
dc.rights.uri | https://rightsstatements.org/vocab/InC/1.0/ | * |
dc.subject.ddc | 004 Informatik | de_DE |
dc.subject.ddc | 004 Data processing | en_GB |
dc.subject.ddc | 500 Naturwissenschaften | de_DE |
dc.subject.ddc | 500 Natural sciences and mathematics | en_GB |
dc.subject.ddc | 530 Physik | de_DE |
dc.subject.ddc | 530 Physics | en_GB |
dc.subject.ddc | 540 Chemie | de_DE |
dc.subject.ddc | 540 Chemistry and allied sciences | en_GB |
dc.subject.ddc | 570 Biowissenschaften | de_DE |
dc.subject.ddc | 570 Life sciences | en_GB |
dc.subject.ddc | 660 Technische Chemie | de_DE |
dc.subject.ddc | 660 Chemical engineering | en_GB |
dc.title | Investigating the Effect of Coarse-Graining on Chemical Compound Space | en_GB |
dc.type | Dissertation | de |
jgu.date.accepted | 2020-12-15 | |
jgu.description.extent | xvi, 229 Seiten | de |
jgu.organisation.department | FB 09 Chemie, Pharmazie u. Geowissensch. | de |
jgu.organisation.name | Johannes Gutenberg-Universität Mainz | |
jgu.organisation.number | 7950 | |
jgu.organisation.place | Mainz | |
jgu.organisation.ror | https://ror.org/023b0x485 | |
jgu.organisation.year | 2020 | |
jgu.rights.accessrights | openAccess | |
jgu.subject.ddccode | 004 | de |
jgu.subject.ddccode | 500 | de |
jgu.subject.ddccode | 530 | de |
jgu.subject.ddccode | 540 | de |
jgu.subject.ddccode | 570 | de |
jgu.subject.ddccode | 660 | de |
jgu.type.dinitype | PhDThesis | en_GB |
jgu.type.resource | Text | de |
jgu.type.version | Original work | de |