Skip to content

Commit

Permalink
added annotations and software entries
Browse files Browse the repository at this point in the history
  • Loading branch information
daniel-jettka committed Feb 1, 2024
1 parent 01cb747 commit 669e1d7
Show file tree
Hide file tree
Showing 3 changed files with 38 additions and 14 deletions.
10 changes: 6 additions & 4 deletions data/JTEI/10_2016-19/jtei-10-haaf-source.xml
Original file line number Diff line number Diff line change
Expand Up @@ -211,9 +211,10 @@
target="http://www.deutschestextarchiv.de/doku/software#cab"/></bibl>.</note> as
well as <ref target="http://www.deutschestextarchiv.de/dtaq/about">collaborative text
correction and annotation</ref><note rend="inside.parenthesis">See <bibl><title
level="a">DTAQ: Kollaborative Qualitätssicherung im Deutschen Textarchiv</title>
(Collaborative Quality Assurance within the DTA), accessed January 28, 2017, <ptr
target="http://www.deutschestextarchiv.de/dtaq/about"/></bibl>. On the process of
level="a"><ptr type="software" xml:id="R3"
target="#dtaq"/><rs type="soft.name" ref="R3">DTAQ: Kollaborative Qualitätssicherung im Deutschen Textarchiv</rs></title>
(Collaborative Quality Assurance within the DTA), accessed January 28, 2017, <rs type="url" ref="#R3"><ptr
target="http://www.deutschestextarchiv.de/dtaq/about"/></rs></bibl>. On the process of
quality assurance in the DTA, see, for example, <ref target="#haaf13" type="bibl">Haaf,
Wiegand, and Geyken 2013</ref>.</note>) is a matter of supporting scholarly projects
in their usage of the DTA infrastructure, which is part of the DTA’s mission. Second,
Expand Down Expand Up @@ -273,7 +274,8 @@
Since June 2014, nine complete volumes with a total of more than 3,500 manuscript pages
have been manually transcribed, annotated in TEI XML, and published via the DTA
infrastructure. Most of these manuscripts were keyed manually by a vendor and published at
an early stage in the web-based quality assurance platform DTAQ. There, the transcription
an early stage in the web-based quality assurance platform <ptr type="software" xml:id="R2"
target="#dtaq"/><rs type="soft.name" ref="#R2">DTAQ</rs>. There, the transcription
as well as the annotation of each document was checked and corrected, if necessary; DTAQ
also provided the means to add additional markup, such as the tagging of person names
(<gi>persName</gi>), directly at page level. After the process of quality control has
Expand Down
24 changes: 14 additions & 10 deletions data/JTEI/10_2016-19/jtei-10-romary-source.xml
Original file line number Diff line number Diff line change
Expand Up @@ -645,8 +645,8 @@
available at <ptr target="https://github.com/TEIC/TEI/issues/1512"/>. In our proposal,
the <gi>etym</gi> element has to be made recursive in order to allow the fine-grained
representations we propose here. The corresponding ODD customization, together with
reference examples, is available on <ptr type="software" xml:id="GitHub"
target="#GitHub"/><rs type="soft.name" ref="#GitHub">GitHub</rs>.</note> and the
reference examples, is available on <ptr type="software" xml:id="R1"
target="#GitHub"/><rs type="soft.name" ref="#R1">GitHub</rs>.</note> and the
fact that a change occurred within the contemporary lexicon (as opposed to its parent
language) is indicated by means of <att>xml:lang</att> on the source form.<note>There
may also be cases in which it is unknown whether a given etymological process occurred
Expand Down Expand Up @@ -768,8 +768,8 @@
text.<note>The interested reader may ponder here the possibility to also encode
scripts by means of the <att>notation</att> attribute instead of using a cluttering of
language subtags on <att>xml:lang</att>. For more on this issue, see the proposal in
the TEI <ptr type="software" xml:id="GitHub" target="#GitHub"/><rs type="soft.name"
ref="#GitHub">GitHub</rs> (<ptr target="https://github.com/TEIC/TEI/issues/1510"
the TEI <ptr type="software" xml:id="R2" target="#GitHub"/><rs type="soft.name"
ref="#R2">GitHub</rs> (<ptr target="https://github.com/TEIC/TEI/issues/1510"
/>).</note> This is why we have extended the <att>notation</att> attribute to
<gi>orth</gi> in order to allow for better representation of both language
identification and the orthographic content. With this double mechanism, we intend to
Expand Down Expand Up @@ -987,7 +987,7 @@
<p>The <gi>date</gi><note>The element <gi>date</gi> as a child of <gi>cit</gi> is another
example which does not adhere to the current TEI standards. We have allowed this
within our ODD document. A feature request proposal will be made on the <ptr
type="software" xml:id="GitHub" target="#GitHub"/><rs type="soft.name" ref="#GitHub"
type="software" xml:id="R3" target="#GitHub"/><rs type="soft.name" ref="#R3"
>GitHub</rs> page and this feature may or may not appear in future versions of the
TEI Guidelines.</note> element is listed within each etymon block; the values of
attributes <att>notBefore</att> and <att>notAfter</att> specify the range of time
Expand Down Expand Up @@ -1486,8 +1486,10 @@
extent of knowledge that is truly necessary to create an accurate model of metaphorical
processes. In order to do this, it is necessary to make use of one or more ontologies,
which could be locally defined within a project, and of external linked open data sources
such as <ref target="http://wiki.dbpedia.org/">DBpedia</ref> and <ref
target="https://www.wikidata.org/">Wikidata</ref>, or some combination thereof. Within
such as <ptr type="software" xml:id="R4"
target="#dbpedia"/><rs type="soft.name url" ref="#R4"><ref target="http://wiki.dbpedia.org/">DBpedia</ref></rs> and <ptr type="software" xml:id="R5"
target="wikidata"/><rs type="soft.name url" ref="#R5"><ref
target="https://www.wikidata.org/">Wikidata</ref></rs>, or some combination thereof. Within
TEI dictionary markup, URIs for existing ontological entries can be referenced in the
<gi>sense</gi>, <gi>usg</gi>, and <gi>ref</gi> elements as the value of the attribute
<att>corresp</att>.</p>
Expand All @@ -1496,7 +1498,8 @@
reference to the source entry’s unique identifier (if such an entry exists within the
dataset). In such cases, the etymon pointing to the source entry can be assumed to inherit
the source’s domain and sense information, and this information can be automatically
extracted with a fairly simple XSLT program; thus the encoders may choose to leave some or
extracted with a fairly simple <ptr type="software" xml:id="R6"
target="#XSLT"/><rs type="soft.name" ref="#R6">XSLT</rs> program; thus the encoders may choose to leave some or
all of this information out of the etymon section. However, in the case that the dataset
does not actually have entries for the source terms, or the encoder wants to be explicit
in all aspects of the etymology, as mentioned above, the source domain and the
Expand Down Expand Up @@ -1556,7 +1559,8 @@
type="metonymy"</tag>) and the etymon (<tag>cit type="etymon"</tag>) the source term’s
URI is referenced in <gi>oRef</gi> and <gi>pRef</gi> as the value of <att>corresp</att>
(<code>@corresp="#animal"</code>).</p>
<p>In <gi>sense</gi>, the URI corresponding to the DBpedia entry for <q>horse</q> is the
<p>In <gi>sense</gi>, the URI corresponding to the <ptr type="software" xml:id="R7"
target="#dbpedia"/><rs type="soft.name" ref="#R7">DBpedia</rs> entry for <q>horse</q> is the
value for the attribute <att>corresp</att>. Additionally, the <tag>date
notBefore="…"</tag> element–attribute pairing is used to specify that the term has only
been used for the <q>horse</q> since 1517 at maximum (corresponding to the first Spanish
Expand Down Expand Up @@ -2486,7 +2490,7 @@
<p>For the issues regarded as the most fundamentally important to creating a dynamic and
sustainable model for both etymology and general lexicographic markup in TEI, we have
submitted formal requests for changes to the TEI <ptr type="software" xml:id="GitHub"
target="#GitHub"/><rs type="soft.name" ref="#GitHub">GitHub</rs>, and will continue to
target="#R8"/><rs type="soft.name" ref="#R8">GitHub</rs>, and will continue to
submit change requests as needed. While this work represents a large step in the right
direction for those looking for means of representing etymological information, there are
still a number of unresolved issues that will need to be addressed. These remaining issues
Expand Down
18 changes: 18 additions & 0 deletions taxonomy/software-list.xml
Original file line number Diff line number Diff line change
Expand Up @@ -1532,6 +1532,24 @@
and "born digital" writing.</ref>
<note type="category">research</note>
</item>
<item xml:id="dtaq">
<name>DTAQ: Kollaborative Qualitätssicherung im Deutschen Textarchiv</name>
<ref type="URL">http://www.deutschestextarchiv.de/dtaq/about</ref>
<ref type="description"/>
<note type="category">research</note>
</item>
<item xml:id="dbpedia">
<name>DBpedia</name>
<ref type="URL">http://wiki.dbpedia.org/</ref>
<ref type="description"/>
<note type="category">research</note>
</item>
<item xml:id="wikidata">
<name>Wikidata</name>
<ref type="URL">https://www.wikidata.org/</ref>
<ref type="description"/>
<note type="category">research</note>
</item>
</list>
</body>
</text>
Expand Down

0 comments on commit 669e1d7

Please sign in to comment.