The work here is based on the corpora:
- abstracts from the journal TAC (Theory and Applications of Categories), from 1995 up to around Dec 2020. We have 755 abstracts. This corpus is described in https://github.com/ToposInstitute/tac-corpus/.
- sentences extracted from the ProofNet project (https://arxiv.org/abs/2302.12433)
- sentences from nLab (https://github.com/ToposInstitute/nlab-corpus and https://github.com/ToposInstitute/nLab2024-corpus)
- sentences from the Chicago notes (kept in a different repository)
For TAC: the file tac-clean.conll contains an automatically annotated version of the corpus, with dependency structures and POS tags, obtained via spacy. The file tac-manual.conllu has an improved version, with manual cleaning of latex, to be described.
The development version of the glossary was on the Definitions in Math (DefsMath) project page.
The project changed to MathGloss and can be seen at https://mathgloss.github.io/MathGloss/
Choosing some free open source books and journals for basic Math:
- https://math.dartmouth.edu/~prob/prob/prob.pdf (Baez rec)
- https://journals.uwyo.edu/index.php/ela/issue/view/271 open source journal for LinAlg?
- AIM textbooks https://textbooks.aimath.org/textbooks/approved-textbooks/
Category Theory Books:
-
Robert Goldblatt’s Topoi (originally published 1979, now a Dover pbk) a particularly helpful entry-point.
-
a step up, Tom Leinster’s short Basic Category Theory (CUP 2014) is indeed basic and is rightly very well-regarded.
-
Emily Riehl’s Category Theory in Context (Dover 2017)
-
Introducing Category Theory, Peter Smith https://www.logicmatters.net/resources/pdfs/SmithCat.pdf
-
Many lecture notes from https://www.logicmatters.net/categories/#lectures and https://www.logicmatters.net/categories/categories-lists/
particularly interesting https://math.berkeley.edu/~gbergman/245/3.2.pdf