Channelpedia

PubMed 24493411


Referenced in: none

Automatically associated channels: TASK1



Title: Impact of distance-based metric learning on classification and visualization model performance and structure-activity landscapes.

Authors: Natalia V Kireeva, Svetlana I Ovchinnikova, Sergey L Kuznetsov, Andrey M Kazennov, Aslan Yu Tsivadze

Journal, date & volume: J. Comput. Aided Mol. Des., 2014 Feb , 28, 61-73

PubMed link: http://www.ncbi.nlm.nih.gov/pubmed/24493411


Abstract
This study concerns large margin nearest neighbors classifier and its multi-metric extension as the efficient approaches for metric learning which aimed to learn an appropriate distance/similarity function for considered case studies. In recent years, many studies in data mining and pattern recognition have demonstrated that a learned metric can significantly improve the performance in classification, clustering and retrieval tasks. The paper describes application of the metric learning approach to in silico assessment of chemical liabilities. Chemical liabilities, such as adverse effects and toxicity, play a significant role in drug discovery process, in silico assessment of chemical liabilities is an important step aimed to reduce costs and animal testing by complementing or replacing in vitro and in vivo experiments. Here, to our knowledge for the first time, a distance-based metric learning procedures have been applied for in silico assessment of chemical liabilities, the impact of metric learning on structure-activity landscapes and predictive performance of developed models has been analyzed, the learned metric was used in support vector machines. The metric learning results have been illustrated using linear and non-linear data visualization techniques in order to indicate how the change of metrics affected nearest neighbors relations and descriptor space.