diff --git a/data/JTEI/13_2020-22/jtei-cc-ra-parisse-182-source.xml b/data/JTEI/13_2020-22/jtei-cc-ra-parisse-182-source.xml index 81f9370..ee3499a 100644 --- a/data/JTEI/13_2020-22/jtei-cc-ra-parisse-182-source.xml +++ b/data/JTEI/13_2020-22/jtei-cc-ra-parisse-182-source.xml @@ -92,19 +92,32 @@ collect and transcribe spoken language resources, their number is limited and thus corpora need to be interoperable and reusable in order to improve research on themes such as phonology, prosody, interaction, syntax, and textometry. To help researchers reach this - goal, CORLI has designed a pair of tools: TEICORPO to assist in the conversion and use of - spoken language corpora, and TEIMETA for metadata purposes. TEICORPO is based on the - principle of an underlying common format, namely TEI XML as described in its specification - for spoken language use (ISO 2016). This tool enables the conversion of transcriptions - created with alignment software such as CLAN, Transcriber, Praat, or ELAN as well as - common file formats (CSV, XLSX, TXT, or DOCX) and the TEI format, which plays the role of - a lossless pivot format. Backward conversion is possible in many cases, with limitations - inherent in the destination target format. TEICORPO can run the Treetagger part-of-speech - tagger and the Stanford CoreNLP tools on TEI files and can export the resulting files to - textometric tools such as TXM, Le Trameur, or Iramuteq, making it suitable for - spoken language corpora editing as well as for various research purposes.

+ goal, CORLI has designed a pair of tools: TEICORPO to assist in the + conversion and use of spoken language corpora, and TEIMETA for metadata purposes. + TEICORPO is based on the principle of an underlying common format, namely TEI XML + as described in its specification for spoken language use (ISO 2016). This tool enables + the conversion of transcriptions created with alignment software such as CLAN, + Transcriber, Praat, or ELAN as well as common file formats + (CSV, XLSX, TXT, or DOCX) and the TEI format, which plays the role of a lossless pivot + format. Backward conversion is possible in many cases, with limitations inherent in the + destination target format. TEICORPO can run the Treetagger part-of-speech + tagger and the Stanford CoreNLP tools on TEI files and can export the + resulting files to textometric tools such as TXM, Le Trameur, or + Iramuteq, making it suitable for spoken language corpora editing as well as for + various research purposes.

@@ -170,7 +183,8 @@ limited coverage, even if the corpora involved are very large.

- The TEICORPO Approach + The TEICORPO Approach

The goal of the CORLI consortium is to make it easier to deposit, share, and reuse data. With this goal in mind, CORLI has always promoted the use of open public repositories and open formats. Our policy is to advocate for the use of a common single @@ -1427,8 +1441,8 @@ CLARIN. In Selected Papers from the CLARIN Annual Conference 2016, edited by Lars Borin, 113–30. Linköping Electronic Conference Proceedings 136. Linköping, Sweden: LiU Electronic Press. ; . + target="https://ep.liu.se/ecp/article.asp?issue=136&article=009&volume=0"/>; + . Schmidt, Thomas, and Wilfried Schütte. 2010. FOLKER: An Annotation Tool for Efficient Transcription of Natural, Multi-party Interaction. In Proceedings of