TMLRpred: A machine learning classification model to distinguish reversible EGFR double mutant inhibitors

Saini, R.; Fatima, S.; Aggarwal, S.M.

dc.contributor.author	Saini, R.
dc.contributor.author	Fatima, S.
dc.contributor.author	Aggarwal, S.M.
dc.date.accessioned	2020-12-04T06:39:53Z
dc.date.available	2020-12-04T06:39:53Z
dc.date.issued	2020-09-01
dc.identifier.issn	17470277
dc.identifier.uri	http://localhost:8080/xmlui/handle/123456789/1056
dc.description.abstract	The EGFR is a clinically important therapeutic drug target in lung cancer. The first-generation tyrosine kinase inhibitors used in clinics are effective against L858R-mutated EGFR. However, relapse of the disease due to the presence of resistant mutation (T790M) makes these inhibitors ineffective. This has necessitated the need to identify new potent EGFR inhibitors against the resistant double mutants. Therefore, various machine learning techniques ((instance-based learner (IBK), naïve Bayesian (NB), sequential minimal optimization (SMO), and random forest (RF)) were employed to develop twelve classification models on three different datasets (high, moderate, and weakly active inhibitors). The models were validated using fivefold cross-validation and independent validation datasets. It was observed that the random forest-based models showed best performance. Also, functional groups, PubChem fingerprints, and substructure of highly active inhibitors were compared to inactive to identify structural features which are important for activity. To promote open-source drug discovery, a tool has been developed, which incorporates the best performing models and allows users to predict the potential of chemical molecules as anti-TMLR inhibitor. It is expected that the machine learning classification models developed in this study will pave way for identifying novel inhibitors against the resistant EGFR double mutants. © 2020 John Wiley & Sons A/S	en_US
dc.language.iso	en_US	en_US
dc.publisher	Blackwell Publishing Ltd	en_US
dc.relation.ispartofseries	Chemical Biology and Drug Design;Vol. 96
dc.subject	classification models	en_US
dc.subject	EGFR	en_US
dc.subject	machine learning	en_US
dc.subject	random forest	en_US
dc.subject	T790M	en_US
dc.title	TMLRpred: A machine learning classification model to distinguish reversible EGFR double mutant inhibitors	en_US
dc.type	Article	en_US