2022
Report  Open Access

OpenAIRE Research Graph deduplication workflow

La Bruzzo S. F., Artini M., Atzori C., Bardi A., Baglioni M., De Bonis M., Mannocci A., Manghi P., Pavone G.

Deduplication  Research Graph  OpenAiRe 

The OpenAIRE aggregation workflow can collect metadata records from different providers about the same scholarly work. Each metadata record can carry different information because, for example, some providers are not aware of links to projects, keywords, or other details. Another typical case is when OpenAIRE collects one metadata record from a repository about a pre-print and another from a journal about the published article. To provide correct statistics, OpenAIRE must identify those cases and "merge" the two metadata records so that the scholarly work is counted only once in the statistics OpenAIRE produces. This technical Report describes the Deduplication workflow and technique adopted to deduplicate the OpenAIRE Graph.

Source: ISTI Technical Report, ISTI-2022-TR/032, 2022


Metrics



Back to previous page
BibTeX entry
@techreport{oai:it.cnr:prodotti:478873,
	title = {OpenAIRE  Research Graph deduplication workflow},
	author = {La Bruzzo S. F. and Artini M. and Atzori C. and Bardi A. and Baglioni M. and De Bonis M. and Mannocci A. and Manghi P. and Pavone G.},
	doi = {10.32079/isti-tr-2022/032},
	institution = {ISTI Technical Report, ISTI-2022-TR/032, 2022},
	year = {2022}
}

OpenAIRE-Connect
OpenAIRE - CONNECTing scientific results in support of Open Science

OpenAIRE Nexus
OpenAIRE-Nexus Scholarly Communication Services for EOSC users


OpenAIRE