Annotation contributors
The GO Consortium integrates resources from a variety of research groups, from model organism and protein databases to biological research communities actively involved in the development and implementation of the Gene Ontology. Curators, developers, and others from multiple groups work to maintain the GO Knowledgebase. Below are alphabetical lists of our current and past collaborators.
Current groups contributing GO annotations
Name | Description | Funding | Contact |
---|---|---|---|
AgBase | A curated, open-source, web-accessible resource for functional analysis of agricultural plant and animal gene products | AgBase | |
Alzheimer’s Project at the University of Toronto | Curation of genes associated with Alzheimer’s disease that have been significant in previous genome wide association studies | Alzheimer’s Project | |
The Arabidopsis Information Resource (TAIR) | Database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana | The TAIR project is funded by academic institutional, corporate, and individual subscriptions; TAIR is administered by the 501(c)(3) non-profit Phoenix Bioinformatics | TAIR |
Berkeley Bioinformatics Open-source Projects (BBOP) | Development, use, and integration of ontologies into biological data analysis; software development | Subcontract of the main GO grant from the National Human Genome Research Institute (NHGRI) / National Institutes of Health (NIH) (U41) HG002273, and the Director, Office of Science, Office of Basic Energy Sciences (BES) of the United States Department of Energy under Contract No. DE-AC02-05CH11231 | BBOP |
Candida Genome Database (CGD) | CGD provides online access to genomic sequence data and manually curated functional information about genes and proteins of the human pathogen Candida albicans and related species | The Candida Genome Database project is funded by the National Institute of Dental & Craniofacial Research at the US National Institutes of Health | CGD |
Community Assessment of Community Annotation with Ontologies | The Community Assessment of Community Annotation with Ontologies (CACAO) is a competition for teams of undergrads around the world to improve the functional annotation of genes; CACAO was developed and is currently run at Texas A&M University | CACAO | |
ComplexPortal | The Complex Portal is a manually curated, encyclopaedic resource of macromolecular complexes from a number of key model organisms. | ComplexPortal | |
Critical Assessment of Functional Annotation | An ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function | CAFA | |
dictyBase | Informatics resource for the social amoeba Dictyostelium discoideum and related species | dictyBase | |
DisProt | The database of intrinsically disordered proteins | DisProt | |
EcoCyc | EcoCyc is a scientific database for the bacterium Escherichia coli K-12 MG1655 | EcoCyc | |
EcoliWiki | Community annotation for Escherichia coli | EcoliWiki | |
Ensembl | Predicted annotation for vertebrates based on orthology | Ensembl contact | |
EnsemblFungi | Predicted annotation for fungi based on orthology | EnsemblFungi contact | |
EnsemblPlants/Gramene | Predicted annotation for plants based on orthology | EnsemblPlants contact | |
FlyBase | A Database of Drosophila Genes & Genomes | National Human Genome Research Institute at the U.S. National Institutes of Health P41 HG000739 and Medical Research Council, UK grant MR/N030117/1 | FlyBase contact |
HUGO Gene Nomenclature Committee | The HGNC is responsible for approving unique symbols and names for human loci, including protein coding genes, ncRNA genes and pseudogenes, to allow unambiguous scientific communication | The work of the HGNC is supported by National Human Genome Research Institute (NHGRI) grant U24HG003345 & Wellcome Trust grant 208349/Z/17/Z | HGNC |
The Human Protein Atlas | A Swedish-based program initiated in 2003 with the aim to map all the human proteins in cells, tissues, and organs using an integration of various omics technologies, including antibody-based imaging, mass spectrometry-based proteomics, transcriptomics, and systems biology | HPA | |
Institute for Genome Sciences, University of Maryland | Data and tools for genomics research in a variety of systems; development of ECO, an ontology for describing evidence to support biological assertions; development of an Ontology of Microbial Phenotypes | ECO is funded by the National Science Foundation under Award Number 1458400 | IGS |
IntAct molecular interaction database | High-quality, manually annotated binary protein interactions and curated complexes | NHLBI Proteomics Center Award (HHSN268201000035C), European Commission grant Affinomics (FP7-241481), The Michael J. Fox Foundation for Parkinson’s Research LRRK2 Biology LEAPS, BBSRC MIDAS (BB/L024179/1) | IntAct |
InterPro | InterPro provides functional analysis of proteins by classifying them into families and predicting domains and important sites | InterPro is funded by a grant from BBSRC (BB/L024136/1) and EMBL core funds | InterPro |
JaponicusDB | JaponicusDB provides comprehensive structural and functional annotation, literature curation and access to large-scale data sets for Schizosaccharomyces japonicus | The S. japonicus community maintains JaponicusDB, led by the Francis Crick Institute and King’s College London, with contributions from PomBase (University of Cambridge) | JaponicusDB |
Mouse Genome Informatics (MGI) | Informatics resources for Laboratory Mouse and related Human Biology data | MGI is supported by the National Institutes of Health (NIH) as follows: MGD is supported by National Human Genome Research Institute (NHGRI) U41 HG000330, GXD is supported by Child Health/Human Development grant HD062499, MTB is supported by National Cancer Institute (NCI) grant CA089713, and MouseMine is supported by a subcontract to NHGRI grant HG004834. | MGI |
Norwegian University of Science and Technology, Systems Biology team | Development of procedures and approaches to build high quality knowledge sources (the ‘Knowledge Commons’) for understanding gene regulation processes | NTNU_SB | |
PANTHER | The PANTHER (Protein ANalysis THrough Evolutionary Relationships) knowledgebase supports biomedical and other research by providing comprehensive information about the evolution of protein-coding gene families, particularly protein phylogeny, function and genetic variation impacting that function | National Science Foundation Grant No. 1458808 | PANTHER |
PomBase | PomBase is a comprehensive database for the fission yeast Schizosaccharomyces pombe, providing structural and functional annotation, literature curation and access to large-scale data sets | Wellcome Trust Grant No. 104967/Z/14/Z | PomBase |
Rat Genome Database (RGD) | Database for the rat Rattus norvegicus | RGD is funded by grant HL64541 from the National Heart, Lung, and Blood Institute on behalf of the NIH | RGD |
Reactome | A knowledgebase of biological processes (formerly Genome Knowledgebase) | National Human Genome Research Institute (NHGRI) / National Institutes of Health (NIH) U41 HG00375, the Ontario Research (GL2) fund, the European Bioinformatics Institute, and the European Commission (PSIMEx) | Reactome |
RHEA | Rhea is an expert-curated knowledgebase of chemical and transport reactions of biological interest - and the standard for enzyme and transporter annotation in UniProtKB | Rhea curation and development is currently supported by the Swiss Federal Government through the State Secretariat for Education, Research and Innovation (SERI) | RHEA |
RNACentral | RNAcentral is a free, public resource that offers integrated access to a comprehensive and up-to-date set of non-coding RNA sequences provided by a collaborating group of Expert Databases representing a broad range of organisms and RNA types | The development of RNAcentral is coordinated by European Bioinformatics Institute and is supported by Wellcome | RNACentral |
Saccharomyces Genome Database (SGD) | The Saccharomyces Genome Database (SGD) provides comprehensive integrated biological information for the budding yeast Saccharomyces cerevisiae | National Human Genome Research Institute (NHGRI) / National Institutes of Health (NIH) [SGD U41 HG001315] | SGD |
The Synapse Gene Ontology and Annotation Initiative | Interactive knowledgebase that accumulates available research about synapse biology using Gene Ontology | SynGO | |
UniProt-Gene Ontology Annotation (UniProt-GOA) | Manual and electronic annotation of proteins in the UniProt Knowledgebase, carried out by a wide range of curation activities at the European Bioinformatics Institute, the SIB Swiss Institute of Bioinformatics and the Protein Information Resource | National Institutes of Health (NIH) [UniProt U41HG006104; GO U41HG002273]; British Heart Foundation [RG/13/5/30112] Parkinson’s UK [G-1307] and EMBL core funds. | UniProt-GOA |
University College London Functional Gene Annotation | Manual annotation of human proteins and microRNAs involved in cardiovascular and dementia-relevant processes | British Heart Foundation [RG/13/5/30112; SP/07/007/23671], Alzheimer’s Research UK [ARUK-NSG2016-13; NAS2017A-1; NGS2018-003], Parkinson’s UK [G-1307], National Institute for Health Research University College London Hospitals Biomedical Research Centre | UCL |
WormBase | WormBase is an international consortium of biologists and computer scientists dedicated to providing the research community with accurate, current, accessible information concerning the genetics, genomics and biology of Caenorhabditis elegans and related nematodes | WormBase is supported by grant #U41 HG002223 from the National Human Genome Research Institute at the US National Institutes of Health, the UK Medical Research Council, and the UK Biotechnology and Biological Sciences Research Council. | WormBase |
Xenbase | Xenbase’s mission is to provide the international research community with a comprehensive, integrated and easy to use web based resource that gives access the diverse and rich genomic, expression and functional data available from Xenopus research | Xenbase is supported by NICHD P41 HD064556 from the Eunice Kennedy Shriver National Institute of Child Health and Human Development | Xenbase |
Zebrafish Information Network (ZFIN) | The Zebrafish Model Organism Database is the community resource for genetic and genomic data involving zebrafish Danio rerio | ZFIN is funded by the National Human Genome Research Institute of the National Institutes of Health [HG002659] | ZFIN |
Past GO Consortium contributing groups
These groups have contributed to the GO Consortium in the past:
Name | Description | Funding | Contact | |
---|---|---|---|---|
ASAP | A Systematic Annotation Package for Community Analysis of Genomes | |||
Aspergillus Genome Database (AspGD) | AspGD is an organized collection of genetic and molecular biological information about the filamentous fungi of the genus Aspergillus | AspGD web submission form | ||
AstraZeneca | AstraZeneca is a global, science-led biopharmaceutical business. | |||
GeneDB | Curation for the whole range of organisms sequenced by the Sanger Institute’s Pathogen Genomics group; sunsetted in 2022 | GeneDB | ||
Gramene | A comparative plant genomics resource for model and reference genomes of more than 30 plant genomes. It provides information for genomes, gene models, gene annotations, genome and gene tree alignments, synteny, genetic variation, gene expression, and a knowledgebase for plant pathways (Plant Reactome). | The National Science Foundation (NSF) supported this work through NSF Plant Genome Initiative grant award #0321685 during the years 2004-2007, NSF award #0851652 (REU Bioinformatics and Computational Biology Summer Undergraduate Program) in 2009-2012, and since 2007, through NSF Plant Genome Research Resource grant award #0703908. Current work is being supported by NSF Improving Plant Genome Annotation grant award #1127112 (Gramene - Exploring Function through Comparative Genomics and Network Analysis). More information | Gramene web submission form | |
J Craig Venter Institute | Databases on several bacterial species; formerly known as The Institute for Genomic Research (TIGR). | |||
Microbial ENergy processes Gene Ontology Project (MENGO) | The MENGO project is a multi-institutional collaborative effort that aims to develop new Gene Ontology terms to describe microbial bioenergy related processes | Office of Science (BER), U.S. Department of Energy. | ||
Plant-Associated Microbe Gene Ontology (PAMGO) | Consortium A multi-institutional collaborative effort involving scientists working on plant pathogenic genomes: the bacteria Dickeya dadantii, Pseudomonas syringae pv tomato and Agrobacterium tumefaciens, the fungus Magnaporthe grisea, the oomycetes Phytophthora sojae and Phytophthora ramorum and the nematode Meloidogyne hapla | |||
Pseudomonas Genome Database (PseudoCAP) | A resource for annotations for the Pseudomonas aeruginosa PAO1 reference strain’s genome and comparative analyses of several related Pseudomonas species | Cystic Fibrosis Foundation Therapeutics Inc. | PseudoCAP | |
Renal Gene Ontology Annotation Initiative | European Bioinformatics Institute | Kidney Research UK | ||
SGN | The Sol Genomics Network (SGN) is a clade-oriented database dedicated to the biology of the Solanaceae family. | SGN | ||
SYSCILIA | The European project SYSCILIA is a systems biology approach to dissect cilia function and its disruption in human genetic disease | European Community’s Seventh Framework Programme (FP7/2007-2013) under the Health Cooperation Programme. | SYSCILIA | |
Tetrahymena Genome Database (TGD) | Database of information about the Tetrahymena thermophila genome sequence determined at The Institute for Genomic Research (TIGR) | TGD |