Skip to main content
Top
Published in: Malaria Journal 1/2017

Open Access 01-12-2017 | Research

Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomes

Authors: Juliana Bernardes, Catherine Vaquero, Alessandra Carbone

Published in: Malaria Journal | Issue 1/2017

Login to get access

Abstract

Background

With the availability of complete genome sequences of both human and non-human Plasmodium parasites, it is now possible to use comparative genomics to look for orthology across Plasmodium species and for species specific genes. This comparative analyses could provide important clues for the development of new strategies to prevent and treat malaria in humans, however, the number of functionally annotated proteins is still low for all Plasmodium species. In the context of genomes that are hard to annotate because of sequence divergence, such as Plasmodium, domain co-occurrence becomes particularly important to trust predictions. In particular, domain architecture prediction can be used to improve the performance of existing annotation methods since homologous proteins might share their architectural context.

Results

Plasmobase is a unique database designed for the comparative study of Plasmodium genomes. Domain architecture reconstruction in Plasmobase relies on DAMA, the state-of-the-art method in architecture prediction, while domain annotation is realised with CLADE, a novel annotation tool based on a multi-source strategy. Plasmobase significantly increases the Pfam domain coverage of all Plasmodium genomes, it proposes new domain architectures as well as new domain families that have never been reported before for these genomes. It proposes a visualization of domain architectures and allows for an easy comparison among architectures within Plasmodium species and with other species, described in UniProt.

Conclusions

Plasmobase is a valuable new resource for domain annotation in Plasmodium genomes. Its graphical presentation of protein sequences, based on domain architectures, will hopefully be of interest for comparative genomic studies. It should help to discover species-specific genes, possibly underlying important phenotypic differences between parasites, and orthologous gene families for deciphering the biology of these complex and important Apicomplexan organisms. In conclusion, Plasmobase is a flexible and rich site where any biologist can find something of his/her own interest.

Availability

Literature
1.
go back to reference Aurrecoechea C, Brestelli J, Brunk B, Dommer J, Fischer S, Gajria B, et al. PlasmoDB: a functional genomic database for malaria parasites. Nucleic Acids Res. 2009;37:539–43.CrossRef Aurrecoechea C, Brestelli J, Brunk B, Dommer J, Fischer S, Gajria B, et al. PlasmoDB: a functional genomic database for malaria parasites. Nucleic Acids Res. 2009;37:539–43.CrossRef
2.
go back to reference Mansfield JM, Olivier M. Immune evasion by parasites. In: Kaufmann S, Sher A, Ahmed R, editors. Immunology of infectious diseases. Washington, DC: American Society of Microbiology; 2002. p. 379–92.CrossRef Mansfield JM, Olivier M. Immune evasion by parasites. In: Kaufmann S, Sher A, Ahmed R, editors. Immunology of infectious diseases. Washington, DC: American Society of Microbiology; 2002. p. 379–92.CrossRef
3.
go back to reference Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, et al. Pfam: clans, web tools and services. Nucleic Acids Res. 2005;34:247–51.CrossRef Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, et al. Pfam: clans, web tools and services. Nucleic Acids Res. 2005;34:247–51.CrossRef
4.
go back to reference Schultz J, Milpetz F, Bork P, Ponting CP. Smart, a simple modular architecture research tool: Identification of signaling domains. Proc Natl Acad Sci USA. 1998;95:5857–64.CrossRefPubMedPubMedCentral Schultz J, Milpetz F, Bork P, Ponting CP. Smart, a simple modular architecture research tool: Identification of signaling domains. Proc Natl Acad Sci USA. 1998;95:5857–64.CrossRefPubMedPubMedCentral
5.
go back to reference Sigrist CJ, Cerutti L, Hulo N, Gattiker A, Falquet L, Pagni M, et al. Prosite: a documented database using patterns and profiles as motif descriptors. Brief Bioinform. 2002;3:265–74.CrossRefPubMed Sigrist CJ, Cerutti L, Hulo N, Gattiker A, Falquet L, Pagni M, et al. Prosite: a documented database using patterns and profiles as motif descriptors. Brief Bioinform. 2002;3:265–74.CrossRefPubMed
6.
go back to reference Terrapon N, Gascuel O, Marechal E, Bréhélin L. Detection of new protein domains using co-occurrence: application to Plasmodium falciparum. Bioinformatics. 2009;25:3077–83.CrossRefPubMed Terrapon N, Gascuel O, Marechal E, Bréhélin L. Detection of new protein domains using co-occurrence: application to Plasmodium falciparum. Bioinformatics. 2009;25:3077–83.CrossRefPubMed
7.
go back to reference Ochoa A, Llinás M, Singh M. Using context to improve protein domain identification. BMC Bioinform. 2011;12:90.CrossRef Ochoa A, Llinás M, Singh M. Using context to improve protein domain identification. BMC Bioinform. 2011;12:90.CrossRef
8.
go back to reference Bernardes JS, Vieira FRJ, Zaverucha G, Carbone A. A multi-objective optimization approach accurately resolves protein domain architectures. Bioinformatics. 2016;32:345–53.CrossRefPubMed Bernardes JS, Vieira FRJ, Zaverucha G, Carbone A. A multi-objective optimization approach accurately resolves protein domain architectures. Bioinformatics. 2016;32:345–53.CrossRefPubMed
11.
go back to reference Bernardes JS, Zaverucha G, Catherine V, Carbone A. Improvement in protein domain identification is reached by breaking consensus, with the agreement of many profiles and domain co-occurrence. PLoS Comput Biol. 2016;12:e1005038.CrossRefPubMedPubMedCentral Bernardes JS, Zaverucha G, Catherine V, Carbone A. Improvement in protein domain identification is reached by breaking consensus, with the agreement of many profiles and domain co-occurrence. PLoS Comput Biol. 2016;12:e1005038.CrossRefPubMedPubMedCentral
12.
go back to reference Remmert M, Biegert A, Hauser A, Söding J. HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods. 2011;9:173–5.CrossRefPubMed Remmert M, Biegert A, Hauser A, Söding J. HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods. 2011;9:173–5.CrossRefPubMed
13.
go back to reference Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S, et al. Uniprot: the universal protein knowledgebase. Nucleic Acids Res. 2004;32(suppl 1):115–9.CrossRef Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S, et al. Uniprot: the universal protein knowledgebase. Nucleic Acids Res. 2004;32(suppl 1):115–9.CrossRef
14.
go back to reference Gene Ontology Consortium. The gene ontology project in 2008. Nucleic Acids Res. 2008;36(suppl 1):440–4. Gene Ontology Consortium. The gene ontology project in 2008. Nucleic Acids Res. 2008;36(suppl 1):440–4.
15.
16.
go back to reference Mitchell A, Chang H-Y, Daugherty L, Fraser M, Hunter S, Lopez R, et al. The interpro protein families database: the classification resource after 15 years. Nucleic Acids Res. 2015;43:D213–21.CrossRefPubMed Mitchell A, Chang H-Y, Daugherty L, Fraser M, Hunter S, Lopez R, et al. The interpro protein families database: the classification resource after 15 years. Nucleic Acids Res. 2015;43:D213–21.CrossRefPubMed
17.
go back to reference Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped blast and psi-blast: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–402.CrossRefPubMedPubMedCentral Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped blast and psi-blast: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–402.CrossRefPubMedPubMedCentral
19.
go back to reference Yeats C, Redfern O, Orengo C. A fast and automated solution for accurately resolving protein domain architectures. Bioinformatics. 2010;26:745–51.CrossRefPubMed Yeats C, Redfern O, Orengo C. A fast and automated solution for accurately resolving protein domain architectures. Bioinformatics. 2010;26:745–51.CrossRefPubMed
20.
go back to reference Ghouila A, Terrapon N, Gascuel O, Guerfali FZ, Laouini D, Maréchal E, et al. Eupathdomains: the divergent domain database for eukaryotic pathogens. Infect Genet Evol. 2011;11:698–707.CrossRefPubMed Ghouila A, Terrapon N, Gascuel O, Guerfali FZ, Laouini D, Maréchal E, et al. Eupathdomains: the divergent domain database for eukaryotic pathogens. Infect Genet Evol. 2011;11:698–707.CrossRefPubMed
21.
go back to reference Cao J, Kaneko O, Thongkukiatkul A, Tachibana M, Otsuki H, Gao Q, Tsuboi T, Tori M. Rhoptry neck protein ron2 forms a complex with microneme protein AMA1 in Plasmodium falciparum merozoites. Parasitol Int. 2009;58:29–35.CrossRefPubMed Cao J, Kaneko O, Thongkukiatkul A, Tachibana M, Otsuki H, Gao Q, Tsuboi T, Tori M. Rhoptry neck protein ron2 forms a complex with microneme protein AMA1 in Plasmodium falciparum merozoites. Parasitol Int. 2009;58:29–35.CrossRefPubMed
Metadata
Title
Plasmobase: a comparative database of predicted domain architectures for Plasmodium genomes
Authors
Juliana Bernardes
Catherine Vaquero
Alessandra Carbone
Publication date
01-12-2017
Publisher
BioMed Central
Published in
Malaria Journal / Issue 1/2017
Electronic ISSN: 1475-2875
DOI
https://doi.org/10.1186/s12936-017-1887-8

Other articles of this Issue 1/2017

Malaria Journal 1/2017 Go to the issue