Skip to content

Detailed syntax of information elements in the SIS

Piotr Banski edited this page Jul 11, 2023 · 9 revisions

(*** See an error, omission, obsolete information below? Let us know by opening a new issue report with one click. Thanks! ***)

This page provides more details on editing information elements in the SIS. Initially, as of July 2023, it restricts itself to info concerning the creation of data deposition format recommendations. If there is interest, the page can also eventually encompass info on preparing documentation on standards.

1. The easy way

First of all, if you choose to clone or fork the entire standards repository, editing XML information will be made easier thanks to the associated document grammars that provide some content completion or warn you about errors. That should work out of the box for any reasonably modern XML editor that recognises XML Schema and Schematron associations.

2. The target

Data deposition format recommendations are hiding in the directory /SIS/clarin/data/recommendations/

3. Digression: content

In the process of preparing format recommendations, some information is completely predefined: these are the data domain names and the recommendation levels. XML Schema supplies them in the form of drop-down selections; otherwise you're down to copy&paste, and in the crucial places, the SIS makes that easier by providing buttons that automatically copy names into the clipboard. That is true of domain names and also the data deposition formats that have been described in the SIS.

Sometimes, the format that a centre recommends (or discourages, etc.) will not (yet) be described by the SIS. A list of such formats, not having their own information pages but nevertheless mentioned by recommendations, is to be found in our Sanity Checker, at the top.

3. General centre/repository information

Use the element <info> for that. Note that that element may bear the @xml:lang attribute to indicate the language of the content. It is expected that, for example, Text+ centres are going to present at least some of their information in German (xml:lang="de"). Where the attribute is not present, its value is defaulted to "en" = English.

Clone this wiki locally