Difference between revisions of "Terminology"

From phenoscape
(Phenotypes and Characters)
 
(45 intermediate revisions by 3 users not shown)
Line 1: Line 1:
'''Phenotype''': An EQ statement. A phenotype, i.e. EQ statement, corresponds to a part or whole character state (i.e. a character state may have multiple phenotypes).  Semantically, a phenotype is_a PATO:quality AND inheres_in some entity. <font color=red>''- used in all papers''</font>
+
__TOC__
 +
== Phenotypes ==
  
'''Distinct phenotype''': Phenotype that differs from others in entity, quality, related entity (where present), relation, or postcomposition (order of parens) <font color=red>''not used''</font>
+
=== Phenotypes and Characters ===
  
'''Phenotype assertion''': <font color=red>not defined, used in TAO paper only''</font>
+
; Phenotype: An Entity-Quality (EQ) statement. Semantically, a phenotype is_a PATO:quality AND inheres_in some entity. In OWL, a phenotype is a class expression that is equivalent to the intersection of a PATO:quality with all things that inheres_in some entity (a property restriction).
**'Another alternative would take advantage of phenotype annotations to infer a taxonomically variable relationship. In thiscase, a rule could be constructed that would allow a reasoner to infer from the (separately asserted) presence of Weberian apparatus in a taxon (e.g., Otophysi) that vertebra 1 is a Weberian vertebra in that taxon. Because the presence of the Weberian apparatus is a morphological character state and as such a phenotype, this strategy would use <B>phenotype assertions </B> to inject additional and taxon-specific relationships into the anatomy ontology, without the need for an ontology curator to maintain those separately.'
 
  
'''Annotation''': part of title of Phenex paper, not defined; defined in TAO paper: 'One common use of ontologies is the  <B>annotation </B> or “tagging” of objects or observations, such as genetic sequences, gene expression patterns, or whole organism phenotypes, with ontology terms. The convention used by model organism databases to  <B>annotate </B> mutant phenotypes is entity-quality (EQ) syntax in which an entity from an anatomical ontology is combined with a quality or nontaxon specific modifier (Gkoutos et al. 2004; Sprague et al. 2008). The Phenoscape project has adopted and extended EQ to  <B>annotate </B> evolutionary phenotypes, specifically systematic characters for os- tariophysan fishes. '
+
; Attribute: High-level PATO quality term, such as PATO:shape. Attribute terms are superclasses (direct or indirect) of lower-level PATO quality terms.
  
'''Phenotype annotation''': <font color=red>''- used in all papers, not defined!''</font>
+
; Character: Presumably homologous feature of an organism that varies across taxa.
**From SB paper: 'Because the active use of ontologies is relatively new to evolutionary biology in general and systematics in particular, the biological research project currently driving development of TAO is the  <B>phenotype annotation </B> of the systematic literature for teleost fishes by the Phenoscape project.'
 
  
**From SB paper: 'For example, a character state might be “Antorbital, triangular.” This phenotype can be expressed using the TAO term antorbital and the PATO term triangular. Such  <B>phenotype annotations </B> corresponding to systematic characters require the association of the anatomical term with a taxonomic name. Thus, TAO terms are associated with species or higher taxa through annotations to teleost scientific names from the Teleost Taxonomy On- tology (www.phenoscape.org; OBO CVS repository: http://obofoundry.org).'
+
; Character state: Variant of a character; assigned a code in a phylogenetic analysis.
  
**From Phenex paper:'Here, we address this problem at its root by development of a configurable software tool that employs standard ontologies and syntax to create computable  <B>phenotype annotations </B>.'
+
; Composite character state: A character state with more than one phenotype.
  
**'Specialized software has been developed to assist human curators in  <B>annotating </B> the phenotypes of mutant genotypes using EQ syntax...'
+
; Phenotyped character: A character with one or more states assigned to a phenotype.
  
'''Mutant phenotype''':  <font color=red>''- used in SB paper, not defined''</font>
+
=== Phenotype annotations ===
'''Zebrafish phenotype annotations''': - used currently in Phenoscape paper to mean 'Zebrafish phenotypes' or Zebrafish EQ statements'
 
** E.g.: 'Zebrafish phenotype annotations recorded in the ZFIN database (21,476 as of 13 Jan 2011) are associated with genes (3,865) that are mutated or knocked down using morpholinos'
 
  
'''Phenotype-taxon assertion (=Taxon-Phenotype annotations?)''': Phenotype that is assigned to a taxon <font color=red>''- not used in Phenex, TAO, or Curation papers, however, 'Taxon-Phenotype annotation used in Phenex paper''</font>  'From this knowledgebase, similar  <B>taxon–phenotype annotations </B> can be easily discovered by searching higher level anatomical or quality terms.'
+
; Phenotype annotation: (synonym: phenotype assertion) The connection of a phenotype to a taxon or gene.
  
'''Distinct phenotype-taxon assertions''': phenotypes assigned to taxa that are uniqueFor example, the same taxon might be assigned the same phenotype (EQ) in different papers or the same taxon may be assigned the same phenotype (EQ) for alternative character states (when annotating at high level of PATO granularity, e.g.). Thus the number of distinct phenotype assertions is lower than the number of phenotype assertions. <font color=red>''- not used''</font>
+
; Phenotype profile: The union of phenotypes that are asserted for a particular gene or a particular taxonWashington et al. (2009) defined a phenotypic profile or gene phenotype profile as 'the sum-total of the EQ descriptions for an individual genotype'.
  
 +
; Gene phenotype: A phenotype that has been used in a gene phenotype annotation.
  
'''Attribute''': Higher level PATO quality term used to group "value qualities". NOTE: In current Phenoscape paper we define as 'Attributes are relatively high-level nodes in the PATO quality ontology, such as shape, position, etc.'
+
; Gene phenotype annotation: EQ statement associated with a gene.
  
'''Value qualities''': Nodes at a lower level than 'Attribute' in the PATO hierarchy
+
; Taxon phenotype: A phenotype that has been used in a taxon phenotype annotation.  A taxon phenotype may correspond to part or whole of a described character state (i.e. a character state may be decomposed into multiple phenotypes).
  
'''Phenotype Group''': combination of an attribute with some entity, such as basihyal bone shape, tooth count, etc.
+
; Taxon phenotype annotation: EQ statement associated with a taxon.
  
= Taxa: =
+
== Taxa, Taxon Names, and Taxon Summaries ==
  
'''Total Publication Names''': Count of the total number of taxa used in the publications that have annotations. Includes repeated uses of the same publication name in different publications.
+
; Publication (Taxon) Name: The taxon name used in a publication from which we have curated data.
[*side note not for report: should be around 3,400 for 47 publications]
 
  
'''Distinct Publication Names''': Count of the unique number of publication taxa used in the publications that have annotations.  The same publication name used in different publications is counted only once.
+
; Total (Number of) Publication Taxon Names: Count of the total number of taxon names used in the publications that have annotations.  The same publication taxon name used in different publications is counted multiple times.
  
'''Distinct Valid Taxon Names''': Count of unique Valid Taxon names appearing in publications. The same valid taxon name used in different publications is counted only once.  This includes 'publication-specific' names, meaning those referring to Author and Year in parentheses.
+
; Distinct (Number of) Publication Taxon Names: Count of the unique number of taxon names used in the publications that have annotations. The same taxon name used in different publications is counted only once.
  
'''Total Mismatches''': Count of the total number of mismatches between Total Publication Names and Valid Taxon names. Each mismatch, including the same mismatch in different publications, is counted for the total. Also, do not count any 'publication-specific' names, meaning those refering to Author and Year in parentheses.
+
; Incompletely Identified Taxon: A taxon that is identified in a publication less specific than genus and species. Typically, these are of the form "''Genus'' sp. (Author Year)", where (Author Year) cites the publication in which it appears.
  
'''Distinct Mismatches''': Count of the unique mismatches between publication names and valid taxon names. The same publication name used in different publications is counted only once. Also, do not count any 'publication-specific' names, meaning those referring to Author and Year in parentheses.
+
; Valid Taxon Name: The name of a taxon that is currently valid, according to an authoritative naming source (e.g., Catalog of Fishes) that is used in construction of the Taxonomy Ontology under consideration. This may or may not be the same as the Publication Taxon Name.
  
'''Total Publication-Specific Names''': Count of the unique Valid Taxon names that refer to the Author and Year in parentheses.  For example, 'Danio sp. (Smith 1992)' is counted as a publication-specific name. <font color=red>''- used in Curation paper''</font>
+
; Distinct (Number of) Valid Taxon Names: Count of unique Valid Taxon names appearing in publications. The same valid taxon name used in different publications is counted only once.  This includes incompletely identified taxa (which we cite to the publication in which it appears, such as ''Danio'' sp. (Smith 1992)).
  
 +
; Total (Number of) Mismatches: Count of the total number of mismatches between publication taxon names and their corresponding valid taxon names.  Each mismatch, including the same mismatch in different publications, is counted for the total. This count excludes incompletely identified taxa (such as ''Danio'' sp. in Smith 1992).
  
''' Character''':
+
; Distinct (Number of) Mismatches: Count of the unique mismatches between publication names and valid taxon names.  The same mismatche occurring in different publications is counted only once.  This count excludes incompletely identified taxa (such as ''Danio'' sp. in Smith 1992).
  
'''Character state''': variant of a Character, assigned a code in a phylogenetic analysis <font color=red>''- used''</font>
+
; Total (Number of) Publication-Specific Names: Count of the unique valid taxon names that are incompletely identified in (and thus specific to) a publication (such as ''Danio'' sp. in Smith 1992). <font color=red>''- used in Curation paper''</font>
 
 
'''Composite character''': character state with more than one phenotype. <font color=red>''- used in Curation paper, Phenex paper''</font>
 
 
 
'''Phenotyped character''': Character with one or more states assigned to a phenotype <font color=red>''- not used''</font>
 
  
 
[[Category:Curation]]
 
[[Category:Curation]]

Latest revision as of 16:19, 4 June 2013

Phenotypes

Phenotypes and Characters

Phenotype
An Entity-Quality (EQ) statement. Semantically, a phenotype is_a PATO:quality AND inheres_in some entity. In OWL, a phenotype is a class expression that is equivalent to the intersection of a PATO:quality with all things that inheres_in some entity (a property restriction).
Attribute
High-level PATO quality term, such as PATO:shape. Attribute terms are superclasses (direct or indirect) of lower-level PATO quality terms.
Character
Presumably homologous feature of an organism that varies across taxa.
Character state
Variant of a character; assigned a code in a phylogenetic analysis.
Composite character state
A character state with more than one phenotype.
Phenotyped character
A character with one or more states assigned to a phenotype.

Phenotype annotations

Phenotype annotation
(synonym: phenotype assertion) The connection of a phenotype to a taxon or gene.
Phenotype profile
The union of phenotypes that are asserted for a particular gene or a particular taxon. Washington et al. (2009) defined a phenotypic profile or gene phenotype profile as 'the sum-total of the EQ descriptions for an individual genotype'.
Gene phenotype
A phenotype that has been used in a gene phenotype annotation.
Gene phenotype annotation
EQ statement associated with a gene.
Taxon phenotype
A phenotype that has been used in a taxon phenotype annotation. A taxon phenotype may correspond to part or whole of a described character state (i.e. a character state may be decomposed into multiple phenotypes).
Taxon phenotype annotation
EQ statement associated with a taxon.

Taxa, Taxon Names, and Taxon Summaries

Publication (Taxon) Name
The taxon name used in a publication from which we have curated data.
Total (Number of) Publication Taxon Names
Count of the total number of taxon names used in the publications that have annotations. The same publication taxon name used in different publications is counted multiple times.
Distinct (Number of) Publication Taxon Names
Count of the unique number of taxon names used in the publications that have annotations. The same taxon name used in different publications is counted only once.
Incompletely Identified Taxon
A taxon that is identified in a publication less specific than genus and species. Typically, these are of the form "Genus sp. (Author Year)", where (Author Year) cites the publication in which it appears.
Valid Taxon Name
The name of a taxon that is currently valid, according to an authoritative naming source (e.g., Catalog of Fishes) that is used in construction of the Taxonomy Ontology under consideration. This may or may not be the same as the Publication Taxon Name.
Distinct (Number of) Valid Taxon Names
Count of unique Valid Taxon names appearing in publications. The same valid taxon name used in different publications is counted only once. This includes incompletely identified taxa (which we cite to the publication in which it appears, such as Danio sp. (Smith 1992)).
Total (Number of) Mismatches
Count of the total number of mismatches between publication taxon names and their corresponding valid taxon names. Each mismatch, including the same mismatch in different publications, is counted for the total. This count excludes incompletely identified taxa (such as Danio sp. in Smith 1992).
Distinct (Number of) Mismatches
Count of the unique mismatches between publication names and valid taxon names. The same mismatche occurring in different publications is counted only once. This count excludes incompletely identified taxa (such as Danio sp. in Smith 1992).
Total (Number of) Publication-Specific Names
Count of the unique valid taxon names that are incompletely identified in (and thus specific to) a publication (such as Danio sp. in Smith 1992). - used in Curation paper