Figure 6: Experimentally determined solubility values
plotted against solubility values predicted by model.
Similar graphs were obtained for model run on
Dataset-2 (Free Solv) and Dataset-3 (Lipophilicity).
In both the cases, the model FEAT seems to underfit
the data. Nevertheless, various hyper-parameters can
be tweaked and experimented with to generate more
accurate results.
4 CONCLUSIONS AND SCOPE
FOR FUTURE RESEARCH
It is known that physiological properties of the
chemicals can be related to structural properties of the
chemicals by some functions as we call these
relations as QSAR relations. QSAR relations are not
usually linear in nature, they might be very complex
in higher order polynomials, hence we try to take help
of artificial neural networks in achieving this task. As
molecules can be views as graphs present in nature,
the bonds correspond to the edges and atoms
correspond to the nodes in the graph. It makes sense
to use Graph Neural Network to extract structural
information from the molecular graphs and use this
information to predict various properties from it.
The graph neural network based model FEAT was
designed by having five linear layers of Graph
Convolutional Network Layers with a catenated
Global Mean and Max pooling layer with a final
linear layer to predict the property. The presented
model FEAT gave nearly accurate results. In addition,
it promises to be a fast and feasible means to predict
molecular properties in comparison to conventional
methods based off manual experimentation by
chemists. Considering the present state of this
research, it definitely cannot be taken as an alternative
to conventional experimentation; however, it could
potentially serve as a means for the required
predictions in environments or situations that are
resource- and time-constrained. As a scope for future
research in this relatively less researched domain, the
model can be further improved by adding more
message passing layers, experimenting with different
learning rates, changing atomic properties chosen for
atoms, etc.
REFERENCES
Dearden, J.C. (2003). Quantitative structure-property
relationships for prediction of boiling point, vapor
pressure, and melting point. Environmental Toxicology
and Chemistry, vol. 22, no. 8, pp. 1696–1709. Wiley-
Blackwell.
Kipf, T.N., Welling, M. (2017). Semi-Supervised
Classification with Graph Convolutional Networks. In
Proceedings of the 2017 International Conference on
Learning Representations, pp. 1–14. ICLR.
Wang, Y., Li, Y., Song, Y., Rong, X. (2020). The Influence
of the Activation Function in a Convolution Neural
Network Model of Facial Expression Recognition.
Applied Sciences, vol. 10, no. 5, pp. 1897. MDPI.
Wieder, O., Kohlbacher, S., Kuenemann, M., Garon, A.,
Ducrot, P., Seidel, T., Langer, T. (2020). A compact
review of molecular property prediction with graph
neural networks. Drug Discovery Today: Technologies,
vol. 37, pp. 1-12. Elsevier.
Wu, Z., Ramsundar, B., Feinberg, E.N., Gomes, J.
Geniesse, C., Pappu, A.S., Leswing, K., Pande, V.
(2018). MoleculeNet: A Benchmark for Molecular
Machine Learning. Chemical Science, vol. 9, pp. 513–
530. Royal Society of Chemistry.