Protein knowledge building through comparative genomics and data integration
General info
- Date from - to
- 01 Jan 2005 - 01 Dec 2011
- Project leader(s)
- Groenen, Peter M.A. Dr.
- Participant(s)
- Leunissen, Jack Prof. dr.
- Gorbalenya, Alexander E. Prof. dr.
- Siezen, Roland J. Prof. dr.
- Heringa, Jaap Prof. dr.
Abstract
Currently, the function of only about 50-80% of proteins in each genome is known or predicted. This fraction can be increased by comparative genomics and integration. In this project, a repository is constructed of (the most) accurate sequence similarity information from all fully sequenced genomes. These sequence similarities form the basis of a phylogeny-based protein database. Also, enhanced methods of functional annotation based on sequence homology and non-homology methods are being developed. Finally, a data warehouse is developed for enriched protein information, coupled with improved and robust visualization techniques. This allows sophisticated data mining and knowledge building in the areas of biomedicine and biotechnology.
Link to the end report of this project


