Protein knowledge building through comparative genomics and data integration
General info
- Date from - to
- 01 Jan 2005 - 01 Dec 2011
- Project leader(s)
- Dr. Peter M.A. Groenen
- Participant(s)
- Prof. dr. Roland J. Siezen
- Prof. dr. Jack Leunissen
- Prof. dr. Jaap Heringa
- Prof. dr. Alexander E. Gorbalenya
- Theme
- Integrative bioinformatics
Abstract
Currently, the function of only about 50-80% of proteins in each genome is known or predicted. This fraction can be increased by comparative genomics and integration. In this project, we construct a repository of (the most) accurate sequence similarity information from all fully sequenced genomes. These sequence similarities form the basis of a phylogeny-based protein database. Also, enhanced methods of functional annotation based on sequence homology and non-homology methods are being developed. Finally, a data warehouse for enriched protein information, coupled with improved and robust visualization techniques, is being developed. This allows sophisticated data mining and knowledge building in the areas of biomedicine and biotechnology.

