Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSAR

In this work we report the first attempt to study the effect of activity cliffs over the generalization ability of machine learning (ML) based QSAR classifiers, using as study case a previously reported diverse and noisy dataset focused on drug induced liver injury (DILI) and more than 40 ML classif...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Perez Castillo , Y. (author)
مؤلفون آخرون: Sanchez Rodriguez, A. (author)
التنسيق: article
منشور في: 2015
الوصول للمادة أونلاين:http://dspace.utpl.edu.ec/handle/123456789/18835
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1858999309068926976
author Perez Castillo , Y.
author2 Sanchez Rodriguez, A.
author2_role author
author_facet Perez Castillo , Y.
Sanchez Rodriguez, A.
author_role author
collection Repositorio Universidad Técnica Particular de Loja
dc.creator.none.fl_str_mv Perez Castillo , Y.
Sanchez Rodriguez, A.
dc.date.none.fl_str_mv 2015-10-30
2016-01-01
2017-06-16T22:02:27Z
2017-06-16T22:02:27Z
dc.identifier.none.fl_str_mv 10.2174/1381612822666160509124337
1381-6128
10.2174/1381612822666160509124337
http://dspace.utpl.edu.ec/handle/123456789/18835
dc.language.none.fl_str_mv Inglés
dc.publisher.none.fl_str_mv Current Pharmaceutical Design
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
dc.source.none.fl_str_mv reponame:Repositorio Universidad Técnica Particular de Loja
instname:Universidad Técnica Particular de Loja
instacron:UTPL
dc.title.none.fl_str_mv Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSAR
dc.type.none.fl_str_mv info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
description In this work we report the first attempt to study the effect of activity cliffs over the generalization ability of machine learning (ML) based QSAR classifiers, using as study case a previously reported diverse and noisy dataset focused on drug induced liver injury (DILI) and more than 40 ML classification algorithms. Here, the hypothesis of structure-activity relationship (SAR) continuity restoration by activity cliffs removal is tested as a potential solution to overcome such limitation. Previously, a parallelism was established between activity cliffs generators (ACGs) and instances that should be misclassified (ISMs), a related concept from the field of machine learning. Based on this concept we comparatively studied the classification performance of multiple machine learning classifiers as well as the consensus classifier derived from predictive classifiers obtained from training sets including or excluding ACGs. The influence of the removal of ACGs from the training set over the virtual screening performance was also studied for the respective consensus classifiers algorithms. In general terms, the removal of the ACGs from the training process slightly decreased the overall accuracy of the ML classifiers and multi-classifiers, improving their sensitivity (the weakest feature of ML classifiers trained with ACGs) but decreasing their specificity. Although these results do not support a positive effect of the removal of ACGs over the classification performance of ML classifiers, the �balancing effect� of ACG removal demonstrated to positively influence the virtual screening performance of multi-classifiers based on valid base ML classifiers. Specially, the early recognition ability was significantly favored after ACGs removal. The results presented and discussed in this work represent the first step towards the application of a remedial solution to the activity cliffs problem in QSAR studies.
eu_rights_str_mv openAccess
format article
id UTPL_99a6ec518f53bfdc37e2760d4f6683fa
identifier_str_mv 10.2174/1381612822666160509124337
1381-6128
instacron_str UTPL
institution UTPL
instname_str Universidad Técnica Particular de Loja
language_invalid_str_mv Inglés
network_acronym_str UTPL
network_name_str Repositorio Universidad Técnica Particular de Loja
oai_identifier_str oai:dspace.utpl.edu.ec:123456789/18835
publishDate 2015
publisher.none.fl_str_mv Current Pharmaceutical Design
reponame_str Repositorio Universidad Técnica Particular de Loja
repository.mail.fl_str_mv .
repository.name.fl_str_mv Repositorio Universidad Técnica Particular de Loja - Universidad Técnica Particular de Loja
repository_id_str 1227
spelling Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSARPerez Castillo , Y.Sanchez Rodriguez, A.In this work we report the first attempt to study the effect of activity cliffs over the generalization ability of machine learning (ML) based QSAR classifiers, using as study case a previously reported diverse and noisy dataset focused on drug induced liver injury (DILI) and more than 40 ML classification algorithms. Here, the hypothesis of structure-activity relationship (SAR) continuity restoration by activity cliffs removal is tested as a potential solution to overcome such limitation. Previously, a parallelism was established between activity cliffs generators (ACGs) and instances that should be misclassified (ISMs), a related concept from the field of machine learning. Based on this concept we comparatively studied the classification performance of multiple machine learning classifiers as well as the consensus classifier derived from predictive classifiers obtained from training sets including or excluding ACGs. The influence of the removal of ACGs from the training set over the virtual screening performance was also studied for the respective consensus classifiers algorithms. In general terms, the removal of the ACGs from the training process slightly decreased the overall accuracy of the ML classifiers and multi-classifiers, improving their sensitivity (the weakest feature of ML classifiers trained with ACGs) but decreasing their specificity. Although these results do not support a positive effect of the removal of ACGs over the classification performance of ML classifiers, the �balancing effect� of ACG removal demonstrated to positively influence the virtual screening performance of multi-classifiers based on valid base ML classifiers. Specially, the early recognition ability was significantly favored after ACGs removal. The results presented and discussed in this work represent the first step towards the application of a remedial solution to the activity cliffs problem in QSAR studies.Current Pharmaceutical Design2017-06-16T22:02:27Z2015-10-302017-06-16T22:02:27Z2016-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article10.2174/13816128226661605091243371381-612810.2174/1381612822666160509124337http://dspace.utpl.edu.ec/handle/123456789/18835Inglésinfo:eu-repo/semantics/openAccessreponame:Repositorio Universidad Técnica Particular de Lojainstname:Universidad Técnica Particular de Lojainstacron:UTPL2017-06-16T22:02:27Zoai:dspace.utpl.edu.ec:123456789/18835Institucionalhttps://dspace.utpl.edu.ec/Institución privadahttps://www.utpl.edu.ec/https://dspace.utpl.edu.ec/oai.Ecuador...opendoar:12272017-06-16T22:02:27Repositorio Universidad Técnica Particular de Loja - Universidad Técnica Particular de Lojafalse
spellingShingle Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSAR
Perez Castillo , Y.
status_str publishedVersion
title Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSAR
title_full Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSAR
title_fullStr Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSAR
title_full_unstemmed Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSAR
title_short Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSAR
title_sort Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSAR
url http://dspace.utpl.edu.ec/handle/123456789/18835