Skip to content

Commit

Permalink
Merge branch 'main' of https://github.com/CenterBLC/N1904
Browse files Browse the repository at this point in the history
  • Loading branch information
tonyjurg committed Oct 9, 2024
2 parents 9c9e600 + 63ec991 commit 741cc20
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion docs/textformats.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,8 @@ The relation between these features in relation to the surface text is shown in

<img src="features/images/details_surface_features.png" width="400" >

## Naming schema for text formating

The text formats in this Text-Fabric database are identified by unique names that reflect their actual formats. These names follow a structured naming schema, consisting of a string of keywords separated by hyphens (-).

```
Expand Down Expand Up @@ -71,4 +73,4 @@ fmt=text-unaccent-plain : Αρχη του ευαγγελιου Ιησου Χρ

## Character encoding

All Greek text in this Text-Fabric dataset is encoded in Unicode. However, there are specific aspects that may require attention when querying, particularly those involving polytonic accents and "pseudo-characters" like the iota subscript. For a detailed discussion on character encoding, please refer to the documentation [here](characterencoding.md#start).
All Greek text in this Text-Fabric dataset is encoded in Unicode. However, there are specific aspects that may require attention when querying, particularly those involving polytonic accents and "pseudo-characters" like the iota subscript. For a detailed discussion on character encoding, please refer to the documentation [here](characterencoding.md#start).

0 comments on commit 741cc20

Please sign in to comment.