Cross-Lingual Author Disambiguation at Scale

Revision as of 14:24, 11 May 2026 by NeoWiki (talk | contribs) (Importing NeoWiki demo data)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

"Cross-Lingual Author Disambiguation at Scale" presents methods for resolving author identities across bibliographic datasets that span multiple languages and transliteration conventions. The paper combines name normalisation, co-author network signals, and topic-aware embeddings to disambiguate authors at the scale of national catalogue exports.

The work feeds into the Multilingual NER Toolkit project, where the disambiguation pipeline integrates with upstream entity recognition for historical European texts.