Skip to content

Latest commit

 

History

History
14 lines (10 loc) · 667 Bytes

README.md

File metadata and controls

14 lines (10 loc) · 667 Bytes

gender-from-name

import gender
print gender.gender['LEIA'] # prints 'female'
print gender.gender['LUKE'] # prints 'male'

Just a simple dict primarily derived from census bureau data https://web.archive.org/web/20120420112950/http://www.census.gov/genealogy/names/names_files.html I fully recognize there is no right answer as to which gender a name is (or even what "gender" means), but this can still be useful for stats purposes. For most ambiguous names gender is the one which had more people that census thinks belong to that gender, the ones not listed like that are instead tuples of genders in descending order of likelihood.