TY - JOUR
T1 - CoGenT++: an extensive and extensible data environment for computational genomics
AU - Janssen, Paul
AU - Goldovsky, Leonid
AU - Ahren, Dag
AU - Audit, Benjamin
AU - Cases, Ildefonso
AU - Darzenta, Nikos
AU - Enright, Anton J.
AU - López-Bigas, Nuria
AU - Peregrin-Alvarez, Jose M.
AU - Smith, Mike
AU - Tsoka, Sophia
AU - Kunin, Victor
AU - Ouzounis, Christos
A2 - Borgermans, Paul
A2 - Geerts, Louis
A2 - Benotmane, Rafi
N1 - Score = 10
PY - 2005/10/1
Y1 - 2005/10/1
N2 - Motivation: CoGenT++ is a data environment for computational research in comparative and functional genomics, designed to address issues of consistency, reproducibility, scalability and accessibility.
Description: CoGenT++ facilitates the re-distribution of all fully sequenced and published genomes, storing information about species, gene names and protein sequences. We describe our scalable implementation of ProXSim, a continually updated all-against-all similarity database, which stores pairwise relationships between all genome sequences. Based on these similarities, derived databases are generated for gene fusions—AllFuse, putative orthologs—OFAM, protein families—TRIBES, phylogenetic profiles—ProfUse and phylogenetic trees. Extensions based on the CoGenT++ environment include disease gene prediction, pattern discovery, automated domain detection, genome annotation and ancestral reconstruction.
Conclusion: CoGenT++ provides a comprehensive environment for computational genomics, accessible primarily for large-scale analyses as well as manual browsing.
Availability: The database and component downloads are accessible at http://cgg.ebi.ac.uk/cogentpp.html
AB - Motivation: CoGenT++ is a data environment for computational research in comparative and functional genomics, designed to address issues of consistency, reproducibility, scalability and accessibility.
Description: CoGenT++ facilitates the re-distribution of all fully sequenced and published genomes, storing information about species, gene names and protein sequences. We describe our scalable implementation of ProXSim, a continually updated all-against-all similarity database, which stores pairwise relationships between all genome sequences. Based on these similarities, derived databases are generated for gene fusions—AllFuse, putative orthologs—OFAM, protein families—TRIBES, phylogenetic profiles—ProfUse and phylogenetic trees. Extensions based on the CoGenT++ environment include disease gene prediction, pattern discovery, automated domain detection, genome annotation and ancestral reconstruction.
Conclusion: CoGenT++ provides a comprehensive environment for computational genomics, accessible primarily for large-scale analyses as well as manual browsing.
Availability: The database and component downloads are accessible at http://cgg.ebi.ac.uk/cogentpp.html
KW - MySQL
KW - CoGenT
KW - genome meta analysis
KW - ProXSim
KW - genome comparison
KW - computational genomics
KW - data mining
UR - http://ecm.sckcen.be/OTCS/llisapi.dll/open/ezp_27244
UR - http://knowledgecentre.sckcen.be/so2/bibref/2874
U2 - 10.1093/bioinformatics/bti579
DO - 10.1093/bioinformatics/bti579
M3 - Article
VL - 21
SP - 3806
EP - 3810
JO - Bioinformatics
JF - Bioinformatics
IS - 19
ER -