Software

For further projects, including those still under development, also see our GitHub.

Total: 11 items
No matching items

Data

These repositories contain “cleaned” versions of mathematical text, with the intention of being used as training corpora for various machine learning projects.

  • TAC corpus: the contents and metadata of TAC abstracts as of c. December 2020.
  • nLab corpus: the contents of the nLab as of c. December 2020.