2020
Other  Open Access

MedLatin1 and MedLatin2: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts

Corbara S, Moreo A, Sebastiani F, Tavoni M

MedLatin  MedValla  Corpus  Dataset  Medieval Latin  Dante  Authorship Verification 

We present and make available MedLatin1 and MedLatin2, two datasets of medieval Latin texts to be used in research on computational authorship analysis. MedLatin1 and MedLatin2 consist of 294 and 30 curated texts, respectively, labelled by author, with MedLatin1 texts being of an epistolary nature and MedLatin2 texts consisting of literary comments and treatises about various subjects. As such, these two datasets lend themselves to supporting research in authorship analysis tasks, such as authorship attribution, authorship verification, or same-author verification.



Back to previous page
BibTeX entry
@misc{oai:it.cnr:prodotti:438795,
	title = {MedLatin1 and MedLatin2: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts},
	author = {Corbara S and Moreo A and Sebastiani F and Tavoni M},
	year = {2020}
}