Jump to content

Multilingual NER Toolkit

From NeoWiki Demo
Revision as of 14:24, 11 May 2026 by NeoWiki (talk | contribs) (Importing NeoWiki demo data)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

The Multilingual NER Toolkit is an open-source named-entity recognition project tuned for historical European languages. Started in 2022, it ships models and tooling that handle the spelling variation, archaic vocabulary, and code-switching common in pre-modern texts.

Led by Sorbonne University, it serves as a foundation for downstream work on entity disambiguation and linking across cultural heritage corpora.