La Bruzzo S. F., Artini M., Atzori C., Bardi A., Baglioni M., De Bonis M., Mannocci A., Manghi P., Pavone G.
Deduplication Research Graph OpenAiRe
The OpenAIRE aggregation workflow can collect metadata records from different providers about the same scholarly work. Each metadata record can carry different information because, for example, some providers are not aware of links to projects, keywords, or other details. Another typical case is when OpenAIRE collects one metadata record from a repository about a pre-print and another from a journal about the published article. To provide correct statistics, OpenAIRE must identify those cases and "merge" the two metadata records so that the scholarly work is counted only once in the statistics OpenAIRE produces. This technical Report describes the Deduplication workflow and technique adopted to deduplicate the OpenAIRE Graph.
Source: ISTI Technical Report, ISTI-2022-TR/032, 2022
@techreport{oai:it.cnr:prodotti:478873, title = {OpenAIRE Research Graph deduplication workflow}, author = {La Bruzzo S. F. and Artini M. and Atzori C. and Bardi A. and Baglioni M. and De Bonis M. and Mannocci A. and Manghi P. and Pavone G.}, doi = {10.32079/isti-tr-2022/032}, institution = {ISTI Technical Report, ISTI-2022-TR/032, 2022}, year = {2022} }
Artini, Michele
0000-0002-4406-428X
Atzori, Claudio
0000-0001-9613-6639
Bardi, Alessia
0000-0002-1112-1292
De Bonis, Michele
0000-0003-2347-6012
La Bruzzo, Sandro Fabrizio
0000-0003-2855-1245
Manghi, Paolo
0000-0001-7291-3210
Mannocci, Andrea
0000-0002-5193-7851
Pavone, Gina
0000-0003-0087-2151
OpenAIRE-Connect
OpenAIRE - CONNECTing scientific results in support of Open Science
OpenAIRE Nexus
OpenAIRE-Nexus Scholarly Communication Services for EOSC users