Document Type

Conference Presentation


Religious Studies

Conference Title

Proceedings of LaTeCH 2018 – The 11th SIGHUM Workshop at COLING2018


International Conference on Computational Linguistics (COLING 2018)


Santa Fe, NM

Conference Dates

August 20-26, 2018

Date of Presentation



We describe a new project publishing a freely available online dictionary for Coptic. The dictionary encompasses comprehensive cross-referencing mechanisms, including linking entries to an online scanned edition of Crum’s Coptic Dictionary, internal cross-references and etymological information, translated searchable definitions in English, French and German, and linked corpus data which provides frequencies and corpus look-up for headwords and multiword expressions. Headwords are available for linking in external projects using a REST API. We describe the challenges in encoding our dictionary using TEI XML and implementing linking mechanisms to construct a Web interface querying frequency information, which draw on NLP tools to recognize inflected forms in context. We evaluate our dictionary’s coverage using digital corpora of Coptic available online.