La Bruzzo S. F., Artini M., Atzori C., Bardi A., Baglioni M., De Bonis M., Mannocci A., Manghi P., Pavone G.
Deduplication Research Graph OpenAiRe
The OpenAIRE aggregation workflow can collect metadata records from different providers about the same scholarly work. Each metadata record can carry different information because, for example, some providers are not aware of links to projects, keywords, or other details. Another typical case is when OpenAIRE collects one metadata record from a repository about a pre-print and another from a journal about the published article. To provide correct statistics, OpenAIRE must identify those cases and "merge" the two metadata records so that the scholarly work is counted only once in the statistics OpenAIRE produces. This technical Report describes the Deduplication workflow and technique adopted to deduplicate the OpenAIRE Graph.
Source: ISTI Technical Report, ISTI-2022-TR/032, 2022
@techreport{oai:it.cnr:prodotti:478873, title = {OpenAIRE Research Graph deduplication workflow}, author = {La Bruzzo S. F. and Artini M. and Atzori C. and Bardi A. and Baglioni M. and De Bonis M. and Mannocci A. and Manghi P. and Pavone G.}, doi = {10.32079/isti-tr-2022/032}, institution = {ISTI Technical Report, ISTI-2022-TR/032, 2022}, year = {2022} }
Artini, Michele0000-0002-4406-428X
Atzori, Claudio0000-0001-9613-6639
Bardi, Alessia0000-0002-1112-1292
De Bonis, Michele0000-0003-2347-6012
La Bruzzo, Sandro Fabrizio0000-0003-2855-1245
Manghi, Paolo0000-0001-7291-3210
Mannocci, Andrea0000-0002-5193-7851
Pavone, Gina0000-0003-0087-2151
OpenAIRE-Connect
OpenAIRE - CONNECTing scientific results in support of Open Science
OpenAIRE Nexus
OpenAIRE-Nexus Scholarly Communication Services for EOSC users