Skip to content

Latest commit

 

History

History
20 lines (10 loc) · 899 Bytes

README.md

File metadata and controls

20 lines (10 loc) · 899 Bytes

Ilfhocail

An automatically generated lexicon of multiword expressions in Irish, collected from a number of Irish lexical resources.

json file

A large collection (17,592) of MWEs automatically extracted from Líonra Séimeantach na Gaeilge (The Irish Language Semantic Network), Gluais Tí Pota Focal, and An Sruth.

POS information and English translations, where included in the lexical resource, are included. Broad POS tags were added using UDPipe.

This resource is useful as training data, as part of a pre-processing pipeline for tasks such as parsing, and as a database for linguistic research.

Links to resources used

Líonra Séimeantach na Gaeilge: Created by Prof Kevin Scannell

Gluais Tí Pota Focal: Created by Michal Měchura

An Sruth: Created by Rody Gorman