-
Notifications
You must be signed in to change notification settings - Fork 30
Using uberon for text mining
Authors and contributors:
- Chris Mungall (author)
Date: 2012
Document Type: ontology_usage_article
This article describes how to use uberon synonymy metadata for text mining
Uberon uses the standard 4 OBO synonym scopes:
- EXACT
- BROAD
- NARROW
- RELATED
The standard obo2owl mapping is used here consult the (obo spec)[http://oboformat.org] for details; currently the following annotation properties are used:
- hasExactSynonym
- hasBroadSynonym
- hasNarrowSynonym
- hasRelatedSynonym
The uberon build pipeline ensures that no two classes share the same string as either a label or exact synonym. This helps detect common categories of errors.
Languages are indicated by a lang tag, e.g. '@fr'. Note that this is not yet currently translated in the owl correctly.
The exception is latin, for which a LATIN synonym type is used
The ontology contains a growing list of synonym types or tags, which may be useful for text mining. See the ontology for a full list.
- ABBREVIATION - Acronym or abbreviation
- LATIN - Typically the TA preferred term
- DUBIOUS - the synonym may be contested or midleading
- DEPRECATED - a historic synonym that may be used in older texts but discouraged in modern usage
- SENSU - a term typically used within a certain taxonomic scope
- ...
We aim to eventually have provenance for all synonyms. Currently most of these are xrefs to species anatomy ontologies, but in future more will be PMIDs etc.
In some cases an NCBITaxon ID is used as synonym provenance. This indicates when a term is preferred or used within a particular taxonomic context.
We include a has_relational_adjective annotation property to indicate
what the adjectival form of the noun that describes the structure
is. For example, 'hippocampal' for Ammon's horn
.
In the Uberon bridging axiom ontologies the 'OBO Foundry unique label' property is used to provide a label that is intended to be unique across the whole OBO Foundry. The unique labels are generated automatically be suffixing the ontology-provided label with a qualifying term.
For example, the FMA class for 'heart' has the OBO Foundry unique label 'heart (canonical adult human)' to disambiguate it from 'heart (adult mouse)' in MA or the embryonic heart as represented in EHDAA2.
Uberon is a multi-species anatomy ontology and knowledge base, find out more on the home page