Designing and implementing chemoinformatic approaches in TDR Targets Database: linking genes to chemical compounds in tropical disease causing pathogens

Magariños, María Paula; Overington, John; Carmona, Santiago; Shanmugam, Dhanasekaran; Doyle, Maria; Ralph, Stuart; Crowther, Greg; Hertz-Fowler, Christiane; Nwaka, Solomon; Berriman, Matt; Roos, David; Van Voorhis, Wes; Agüero, Fernán

doi:10.1186/1471-2105-11-S10-O10

Volume 11 Supplement 10

Highlights from the Sixth International Society for Computational Biology (ISCB) Student Council Symposium

Oral presentation
Open access
Published: 07 December 2010

Designing and implementing chemoinformatic approaches in TDR Targets Database: linking genes to chemical compounds in tropical disease causing pathogens

María Paula Magariños¹,
John Overington²,
Santiago Carmona¹,
Dhanasekaran Shanmugam³,
Maria Doyle⁴,
Stuart Ralph⁴,
Greg Crowther⁵,
Christiane Hertz-Fowler⁶,
Solomon Nwaka⁷,
Matt Berriman⁶,
David Roos³,
Wes Van Voorhis⁵ &
…
Fernán Agüero¹

BMC Bioinformatics volume 11, Article number: O10 (2010) Cite this article

2711 Accesses
1 Citations
Metrics details

Background

Information about chemical compounds and their activity against whole organisms or specific molecular targets is available from the literature or from specialized databases. However, there are few resources that effectively integrate such large chemical datasets with genome data and provide a mechanism to link active compounds to potential target genes. Here, we showcase the integration of chemoinformatic tools for querying chemical datasets and linking chemicals to genes in TDR Targets database (tdrtargets.org), a web accessible resource that integrates a wide range of functional genomic datasets from tropical disease pathogens and provides a ranking mechanism for identifying and prioritising novel therapeutic targets [1].

Materials and methods

Chemical datasets were obtained from three different resources: DrugBank, PubChem and StARlite (ChEMBL). A pipeline was developed to calculate a number of properties (molecular weight; number of flexible bonds; polar surface area; H bond donors/acceptors; and predicted octanol/water partition coefficient) and descriptors (InChi, IUPAC's standard and open chemical identifiers; SMILES; and molecular formula) for each molecule, to facilitate querying and linking to other databases. We have also calculated a number of binary fingerprints and molecular statistics to accelerate searches.

Results

A dataset of 504,020 chemicals, enriched in drugs and drug-like compounds, integrated into TDRTargets.org can be queried using: a textual search on molecular descriptors or chemical properties; a substructure search to find molecules containing the query structure; and a similarity search to find similar molecules (using Tanimoto distance) (see Figure 1). In the Starlite database 438,791 compounds are associated with 3,512 known druggable targets, and 2,224 of these could be linked to 3,043 pathogen targets based on sequence similarity. These relationships are available at TDRTargets.org.

Conclusions

A comprehensive collection of chemical data can be queried in various ways, including by chemical properties, structure and descriptors in TDRTargets.org. More importantly, one can also link compounds of interest to novel target genes in tropical disease causing parasitic organisms based on sequence similarity to known targets of these compounds.

References

Agüero F, et al.: Genomic-scale prioritization of drug targets: the TDR Targets database. Nat Rev Drug Discov 2008, 7(11):900–907. 10.1038/nrd2684
Article PubMed Central PubMed Google Scholar

Download references

Acknowledgements

This work was funded by the “Special Programme for Research and Training in Tropical Diseases (UNICEF/UNDP/World Bank/WHO)”. María Paula Magariños is supported by the Fogarty International Center (Grant Number D43TW007888). The content is solely the responsibility of the authors and does not necessarily represent the official views of the Fogarty International Center or the National Institutes of Health.

Author information

Authors and Affiliations

Instituto de Investigaciones Biotecnológicas, Universidad de San Martín, San Martín, Argentina
María Paula Magariños, Santiago Carmona & Fernán Agüero
European Bioinformatics: Institute, EBML Outstation, Hinxton, Cambridge, UK
John Overington
University of Pennsylvania, Philadelphia, PA, USA
Dhanasekaran Shanmugam & David Roos
University of Melbourne, Victoria, Australia
Maria Doyle & Stuart Ralph
University of Washington, Seattle, WA, USA
Greg Crowther & Wes Van Voorhis
Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
Christiane Hertz-Fowler & Matt Berriman
WHO/TDR, Geneva, Switzerland
Solomon Nwaka

Authors

María Paula Magariños
View author publications
You can also search for this author in PubMed Google Scholar
John Overington
View author publications
You can also search for this author in PubMed Google Scholar
Santiago Carmona
View author publications
You can also search for this author in PubMed Google Scholar
Dhanasekaran Shanmugam
View author publications
You can also search for this author in PubMed Google Scholar
Maria Doyle
View author publications
You can also search for this author in PubMed Google Scholar
Stuart Ralph
View author publications
You can also search for this author in PubMed Google Scholar
Greg Crowther
View author publications
You can also search for this author in PubMed Google Scholar
Christiane Hertz-Fowler
View author publications
You can also search for this author in PubMed Google Scholar
Solomon Nwaka
View author publications
You can also search for this author in PubMed Google Scholar
Matt Berriman
View author publications
You can also search for this author in PubMed Google Scholar
David Roos
View author publications
You can also search for this author in PubMed Google Scholar
Wes Van Voorhis
View author publications
You can also search for this author in PubMed Google Scholar
Fernán Agüero
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to María Paula Magariños.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Magariños, M.P., Overington, J., Carmona, S. et al. Designing and implementing chemoinformatic approaches in TDR Targets Database: linking genes to chemical compounds in tropical disease causing pathogens. BMC Bioinformatics 11 (Suppl 10), O10 (2010). https://0-doi-org.brum.beds.ac.uk/10.1186/1471-2105-11-S10-O10

Download citation

Published: 07 December 2010
DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/1471-2105-11-S10-O10

Highlights from the Sixth International Society for Computational Biology (ISCB) Student Council Symposium

Designing and implementing chemoinformatic approaches in TDR Targets Database: linking genes to chemical compounds in tropical disease causing pathogens

Background

Materials and methods

Results

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Bioinformatics

Contact us

Highlights from the Sixth International Society for Computational Biology (ISCB) Student Council Symposium

Designing and implementing chemoinformatic approaches in TDR Targets Database: linking genes to chemical compounds in tropical disease causing pathogens

Background

Materials and methods

Results

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us