2020
Report  Open Access

MedLatin1 and MedLatin2: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts

Corbara S., Moreo A., Sebastiani F., Tavoni M.

MedLatin  MedValla  Corpus  Dataset  Medieval Latin  Dante  Authorship Verification 

We present and make available MedLatin1 and MedLatin2, two datasets of medieval Latin texts to be used in research on computational authorship analysis. MedLatin1 and MedLatin2 consist of 294 and 30 curated texts, respectively, labelled by author, with MedLatin1 texts being of an epistolary nature and MedLatin2 texts consisting of literary comments and treatises about various subjects. As such, these two datasets lend themselves to supporting research in authorship analysis tasks, such as authorship attribution, authorship verification, or same-author verification.

Source: Research report, 2020



Back to previous page
BibTeX entry
@techreport{oai:it.cnr:prodotti:438795,
	title = {MedLatin1 and MedLatin2: Two Datasets for the Computational Authorship Analysis of Medieval Latin Texts},
	author = {Corbara S. and Moreo A. and Sebastiani F. and Tavoni M.},
	institution = {Research report, 2020},
	year = {2020}
}
CNR ExploRA

Bibliographic record

ISTI Repository

Deposited version Open Access

Also available from

arxiv.orgOpen Access