Ontologies

From phenoscape
Revision as of 02:29, 10 November 2012 by Haendel@ohsu.edu (talk | contribs) (Ontology coordination)

The Phenoscape project is coordinating the development and integration of multispecies and single species anatomy ontologies for vertebrates, with taxonomic focus on teleosts, zebrafish, amphibians, Xenopus, amniotes, and mouse. These ontologies are publicly available and being used and extended by the community (view the list). Where possible, we are also reusing external ontologies (e.g., PATO) from the broader biological community for the greatest interoperability across data types. The development of these ontologies follows OBO Foundry principles as much as possible, including reuse of ontology terms (by import or MIREOT) as the preferred mechanism for using shared terms. These anatomy ontologies are available from the OBO CVS-based version-control system, from which they are regularly loaded into the NCBO BioPortal where they can be readily browsed, searched, and visualized. The ontologies are also available for download as .obo files from OBO Foundry and can also be viewed using the OBO-Edit or Protege desktop software.

The existing model organism databases are largely hardcoded to reference identifiers for their own ontologies (e.g., MGI database reads MA term identifiers). In the long term, we will be exploring how to import terms into these ontologies. In the interim, the existing MOD vertebrate ontologies will continue to link to external terms using a cross-referencing strategy to external ontology identifiers (see Interontology links)

Anatomy Ontologies

Ontology coordination

In the past we have held regular project anatomy ontology conference calls to discuss development with the curators of the various anatomy ontologies listed below. The limb/fin branch was the focus of ontology development in year 1. Currently, we coordinate our development calls with the Phenotype RCN vertebrate working group monthly calls. Information about these calls is sent to the obo-anatomy, phenotype-rcn vertebrate google groups, and the phenoscape project listservs. These calls are open to everyone and call in info and notes from them is available at the Vertebrate Working Group Wiki Page.

In addition, the Phenoscape curators have been meeting weekly to have joint ontology editing sessions. Please let us know if you would like to join us, Melissa Haendel has been coordinating these sessions (haendel@ohsu.edu).

Phenoscape-ext

Phenoscape-ext contains the TAO, AAO, and VSAO combined with Uberon. We are currently in the process of reconciling the issues that arose during this merge (August, 2012), so you may see some messiness for a bit. This ontology will be used by Phenoscape for annotation of evolutionary characters. Phenoscape-ext is our first foray into working directly in OWL.

Vertebrate Skeletal Anatomy Ontology (VSAO)

VSAO contains terms representing structures in the skeletal system of vertebrates. It references terms from the Common Anatomy Reference Ontology (CARO), Gene Ontology (GO) Biological Process, Cell Ontology (CL), and the Phenotype and Trait Ontology (PATO). VSAO can be downloaded at the OBO Foundry and browsed at BioPortal.

  • Release version available here: [3]

Note that VSAO as released represents the outcome of the skeletal anatomy workshop held at NESCent and corresponding paper due out in PLOS ONE. Note also that VSAO development moving forward has been merged into the Phenoscape-ext ontology described above.

New skeletal terms for the limb/fin and cranial skeleton have and will continue to be added to the Phenoscape-ext ontology based on work done at the Phenotype RCN Vertebrate Working Group meeting in Boulder, CO (June 1-3, 2011) and from ongoing work.

Amphibian Anatomy Ontology (AAO)

AAO is a multispecies ontology for amphibian anatomy.

  • Download AAO from the OBO Foundry and browse at BioPortal
  • Source edit directory available here. The current version (April 2012) includes terms merged from the Xenopus Anatomy Ontology and updates from David Blackburn and Wasila Dahdul.

NOTE: as per above case with VSAO, AAO has been merged into phenoscape-ext and is no longer under development.

Teleost Anatomy Ontology (TAO)

TAO is a multi-species ontology for teleost fishes that was initialized with terms from the Zebrafish Anatomical Ontology (ZFA). The development of the TAO currently focuses on the skeletal system because it varies significantly across fishes, is well-preserved in fossil specimens, and it is often the focus of morphologically-based evolutionary studies in ichthyology. The development of the TAO is described in Dahdul et al. (2010).

NOTE: as per above case with VSAO, the TAO has been merged into phenoscape-ext and is no longer under development.

Mouse Adult Gross Anatomy (MA)

The Adult Mouse Anatomy (MA) ontology contains terms that represent structures in the postnatal mouse (Mus) and is used to annotate gene expression and phenotypes for the mouse at Mouse Genome Informatics (MGI) and other resources. Development of the MA is described in Hayamizu et al. (2005).

Xenopus Anatomy Ontology (XAO)

XAO contains terms that represent anatomical structures for the model organism Xenopus laevis (African clawed frog). Xenbase uses XAO to annotate Xenopus gene expression and phenotypes.

  • XAO can be downloaded at the OBO Foundry and browsed at the BioPortal.
  • Term tracker is available here: [4]
  • Source directory is available here: [5]

Zebrafish Anatomical Ontology (ZFA)

ZFA is used by ZFIN to annotate mutant phenotypes for the model organsim zebrafish (Danio rerio).

  • Download from the OBO Foundry or browse at the BioPortal.
  • Term requests can be submitted to the ZFA tracker.
  • Edit version is kept internally at ZFIN, but a prerelease edit version is available at [6]

ZFA terms are cross referenced to TAO terms, and these cross references will be maintained and updated as needed.

Taxonomy Ontologies

Vertebrate Taxonomy Ontology (VTO)

The Vertebrate Taxonomy Ontology includes the TTO, ATO, as well as a new Amniote Taxonomy Ontology. An initial release, including TTO and ATO, but using pan-vertebrate resources (see below) rather than a separate AmTO was made in May 2011. This taxonomy covers vertebrates and was built by starting with the NCBI taxonomy for vertebrates and splicing in TTO (except hagfish), ATO, and the IOC taxonomy of living birds. Synonyms from ITIS and Catalog of Life were attached if the primary name matched a name in the existing taxonomy. Subspecies names were added to their parent species as synonyms (not subclasses). The taxonomy is currently in the OBO format used for TTO and ATO, which includes the use of the Taxonomic Rank Vocabulary to tag taxa with specified rank.

  • NCBI This was used to fill in the gaps (non-avian amniotes) in the current proposed VTO. This provides taxonomy for GenBank submissions (including fossil taxa), but does not claim to be an authoritative source (and generally doesn't cover taxa that have not been submitted). It does provide some taxonomic synonyms as well.
  • Paleobiology Database This covers all groups represented in the fossil record - we have implemented a way to incorporate bulk taxonomy downloads in the VTO. We are still somewhat uncertain about how hierarchies are (dynamically) built in this resource.

These can provide links to additional synonyms and resources (e.g., TTO uses fishbase to provide common names and links to their pages). Taxonomic synonyms are particularly useful as aids to data curation, but common names can assist users in browsing the website. Taxonomic resources (above) can be used as sources of names as well.

  • Fishbase - their taxonomy is close to TTO (both based on Catalog of Fishes), but TTO uses it strictly as a name resource.
  • Global Names Index (GNI) - Extensive list of names but no hierarchy.

Teleost Taxonomy Ontology (TTO)

Phenotypes are associated with species using a taxonomy ontology, the Teleost Taxonomy Ontology (TTO) derived from the Catalog of Fishes (see also the representation on BioPortal, which can be navigated on-line). The TTO is updated in concert with Catalog of Fishes updates. Changes to the TTO relative to the latest version generated from a dump from the Catalog of Fishes are documented TTO_Changes.

Amphibian Taxonomy Ontology (ATO)

The ATO is derived from the AmphibiaWeb list, which provides both taxonomy and some synonyms from ITIS and the IUCN redlist.

Taxonomic Rank Vocabulary

During 2010, we released a separate Taxonomic Rank Vocabulary (TAXRANK), and removed all rank terms (e.g., family, genus, etc.) from the taxa within the TTO. Taxa in the TTO specify their ranks as property values via the metadata relation has_rank, but the object of the has_rank links is contained in the TAXRANK vocabulary.

Developing TAXRANK as a vocabulary, rather than an ontology (e.g., by defining an ordering relation between ranks) should facilitate its reuse in other taxonomic ontologies. Developing a cross-authority (e.g., ICZN, ICBN, etc.) ontology of ranks may be possible, but there does not appear to be a compelling need for such an ontology. The TAXRANK vocabulary can be browsed at Bioportal.

Fish Collection Codes Vocabulary

There is a flat vocabulary of fish collections, based on a list used in Catalog of Fishes, though with a few additions listed on the Fish Collection Updates page. The master list, from which the OBO-format ontology is generated is available as a google docs spreadsheet. It has been augmented with links to entries in the Biodiversity Collections Index.

The current release, as used in Phenex is Fish Collection Abbreviations

Additional documentation

Shared Ontologies

These ontologies listed below were initiated by the model organism communities. Phenoscape is actively involved in extending these ontologies.