Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/27770
Title: Novel Methodology for Improving the Generalization Capability of Chemo-Informatics Deep Learning Models
Authors: Sandjakoska, Ljubinka
Madevska Bogdanova, Ana
Pejov, Ljupcho
Keywords: Deep learning Regularization Chemo-informatics Molecular simulations
Issue Date: Oct-2022
Publisher: Springer Nature Switzerland
Conference: ICT Innovations 2022, Springer, Cham. vol 1740. https://doi.org/10.1007/978-3-031-22792-9_13
Abstract: In the last decade, the research community has implemented various applications of deep learning concepts to solve quite advanced tasks in chemistry, ranging from computational chemistry to materials and drug design and even chemical synthesis problems at both laboratory and industrial – grades. Because of the advantages as a high-performance prediction tool in molecular simulations, deep learning is becoming far more than just a temporary trend. Instead, it is foreseen as a tool that will be essential to employ throughout tackling a range of different issues in chemical sciences in the nearest future. In this paper, we propose a novel methodology for regularization of deep neural networks used in chemo-informatics. The methodology consists of four blocks: Class of initial conditions; Orthogonalization, Activation and Standardization. Three graph-based architectures are developed: deep tensor neural network, directed acyclic graph and convolutional graph model. Graph-based models are more convenient for modeling molecules since the molecules and their features are often naturally represented by graphs. Several experiments are obtained on datasets from MoleculeNet aggregator: QM7, QM8, QM9, ToxCast, Tox21, ClinTox, BBBP and SIDER, for predicting geometric, energetic, electronic and thermodynamic properties on small molecules. The obtained results outperform some of the published references and give directions for further improvement. As a particular example, in one of the architectures, we have reduced mean absolute error by more than 12 times compared to conventional regression models, and more than 3 times in comparison to deep networks where the proposed methodology is not implemented.
URI: http://hdl.handle.net/20.500.12188/27770
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Show full item record

Page view(s)

19
checked on May 17, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.