Skip to content

gboye/SWIM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

#SWIM A tentative implementation of Small World Inflectional Morphology

##Preparing ###BDL-SyncLexiques

  • Prepare a bdlexique with phonological transcriptions and frequencies
  • Synchronize frequencies from Lexique.org
  • Remove duplicates
  • Transform frequencies into probabilities
  • Save the lexicon in CSV form

###BDL-GoldParadigm

  • Prepare the GOLD paradigms
  • Make complete paradigms in CSV form
    • with overabundance
    • with defectiveness

###MGC-GoldRules

  • Calculate the analogies and the classifications based on the GOLD paradigms
    • with all phonological maps
    • store in pandas form

##Training

  • Get a training sample
  • Choose a sample size
  • Make the corresponding paradigm table
  • Beware of orthographical and L23 duplicates
  • Allow for overabundance
  • Neutralize symbols according to a map
  • Index training samples with a serial and a map reference
  • Calculate the analogies and the classifications based on the training sample

##Testing

  • Calculate the complete paradigms for the verbs in the training sample
  • Make a test sample for verbs excluded from the training sample
  • Calculate the complete paradigms for the verbs in the test sample

##Evaluating

  • Identify the patterns that were not obtained in the training sample
  • Measure the distance between the patterns obtained from the training sample and the complete paradigms
  • Measure the distance between the classification obtained from the training sample and the complete paradigms

About

Small World Inflectional Morphology

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published