82 result(s)
Page Size: 10, 20, 50
Export: bibtex, xml, json, csv
Order by:

CNR Author operator: and / or
more
Typology operator: and / or
Language operator: and / or
Date operator: and / or
more
Rights operator: and / or
2021 Dataset Unknown

OpenAIRE research graph: dumps for research communities and initiatives
Manghi P., Atzori C., Bardi A., Baglioni M., Schirrwagen J., Dimitropoulos H., La Bruzzo S., Foufoulas I., Lohden A., Backer A., Mannocci A., Horst M., Czerniak A., Kiatropoulou K., Kokogiannaki A., De Bonis M., Artini M., Ottonello E., Lempesis A., Ioannidis A., Summan F.
This dataset contains dumps of the OpenAIRE Research Graph containing metadata records relevant for the research communities and initiatives collaborating with OpenAIRE. Each dataset is a tar file containing gzip files with one json per line. Each json is compliant to the schema available at DOI: 10.5281/zenodo.3974226DOI: 10.5281/zenodo.3974604
Project(s): RISIS 2 via OpenAIRE, BE OPEN via OpenAIRE, OpenAIRE-Advance via OpenAIRE

See at: CNR ExploRA


2021 Dataset Unknown

OpenAIRE Covid-19 publications, datasets, software and projects metadata
Bardi A., Kuchma I., Pavone G., Artini M., Atzori C., Backer A., Baglioni M., Czerniak A., De Bonis M., Dimitropoulos H., Foufoulas I., Horst M., Iatropoulou K., Jacewicz P., Kokogiannaki A., La Bruzzo S., Lazzeri E., Lohden A., Manghi P., Mannocci A., Manola N., Ottonello E., Schirrwagen J.
This dump provides access to the metadata records of publications, research data, software and projects that may be relevant to the Corona Virus Disease (COVID-19) fight. The dump contains records of the OpenAIRE COVID-19 Gateway (https://covid-19.openaire.eu/), identified via full-text mining and inference techniques applied to the OpenAIRE Research Graph (https://explore.openaire.eu/). The Graph is one of the largest Open Access collections of metadata records and links between publications, datasets, software, projects, funders, and organizations, aggregating 12,000+ scientific data sources world-wide, among which the Covid-19 data sources Zenodo COVID-19 Community, WHO (World Health Organization), BIP! FInder for COVID-19, Protein Data Bank, Dimensions, scienceOpen, and RSNA. The dump consists of a gzip file containing one json per line. Each json is compliant to the schema available at https://doi.org/10.5281/zenodo.3974226DOI: 10.5281/zenodo.3980490
Project(s): OpenAIRE-Advance via OpenAIRE

See at: CNR ExploRA


2020 Journal article Open Access OPEN

Entity deduplication in big data graphs for scholarly communication
Manghi P., Atzori C., De Bonis M., Bardi A.
Purpose: Several online services offer functionalities to access information from "big research graphs" (e.g. Google Scholar, OpenAIRE, Microsoft Academic Graph), which correlate scholarly/scientific communication entities such as publications, authors, datasets, organizations, projects, funders, etc. Depending on the target users, access can vary from search and browse content to the consumption of statistics for monitoring and provision of feedback. Such graphs are populated over time as aggregations of multiple sources and therefore suffer from major entity-duplication problems. Although deduplication of graphs is a known and actual problem, existing solutions are dedicated to specific scenarios, operate on flat collections, local topology-drive challenges and cannot therefore be re-used in other contexts. Design/methodology/approach: This work presents GDup, an integrated, scalable, general-purpose system that can be customized to address deduplication over arbitrary large information graphs. The paper presents its high-level architecture, its implementation as a service used within the OpenAIRE infrastructure system and reports numbers of real-case experiments. Findings: GDup provides the functionalities required to deliver a fully-fledged entity deduplication workflow over a generic input graph. The system offers out-of-the-box Ground Truth management, acquisition of feedback from data curators and algorithms for identifying and merging duplicates, to obtain an output disambiguated graph. Originality/value: To our knowledge GDup is the only system in the literature that offers an integrated and general-purpose solution for the deduplication graphs, while targeting big data scalability issues. GDup is today one of the key modules of the OpenAIRE infrastructure production system, which monitors Open Science trends on behalf of the European Commission, National funders and institutions.Source: Data technologies and applications 54 (2020): 409–435. doi:10.1108/DTA-09-2019-0163
DOI: 10.1108/dta-09-2019-0163
Project(s): OpenAIRE2020 via OpenAIRE, OpenAIRE-Advance via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Open Access | www.emerald.com Open Access | Data Technologies and Applications Restricted | Data Technologies and Applications Restricted | Data Technologies and Applications Restricted | Data Technologies and Applications Restricted


2020 Report Open Access OPEN

SoBigData e- Infrastructure Common Facilities 1
Assante M., Bardi A., Fernandez E., Manzi A., Pagano P.
This deliverable reports the design principles and software architectures characterising the release and development of the SoBigData e-Infrastructure common facilities, namely the social mining computational engine, the online coding and workflow design frameworks, and the online science monitoring dashboard. This report is the first of two versions of the document, each of which describing the design associated with a specific version of the infrastructure to be made available at M12 (December 2020) and at the end of the third year (SoBigData e-Infrastructure common facilities 2, December 2022). Specifically, the deliverable focuses on the design principles and reference architectures included in the first release of the SoBigData e-Infrastructure common facilities at M12. The deliverable consists of six sections. Section 1 briefly introduces the role of this deliverable for the development and delivery of the SoBigData e-Infrastructure common facilities. Section 2 describes the SoBigData e-infrastructure logical architecture contextualising the common facilities and how they relate with the rest. Section 3, section 4 and section 5 document the first release of the e-Infrastructure common facilities included in this report and available at M12, reporting the design principles and reference architectures of the released solutions. Specifically, section 3 describes the social mining computational engine, Section 4 presents the online coding and workflow design frameworks - which includes the RStudio and the Jupyter Notebooks via JupyterHub - and Section 5 reports the online science monitoring dashboard. Finally, section 6 concludes the report illustrating the whole Release Management process and its components for continuous integration.Source: Project report, SoBigData, D9.4, 2020
Project(s): SoBigData-PlusPlus via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Open Access


2020 Report Open Access OPEN

ARIADNEPlus - Initial report on JRA activities
Meghini C., Bardi A., Pagano P., Theodoridou M., Finn J.
This deliverable describes the progress of JRA activities (WP 12-16) carried out during the first year of the ARIADNEplus project, as requested by task 12.6. It offers an integrated overview of all the technological developments of the project in the first 18 months of work, pointing out any delay or relevant issues.Source: Project report, ARIADNEPlus, D12.1, 2020
Project(s): ARIADNEplus via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Open Access


2020 Report Open Access OPEN

RISIS 2 - Opening of Open Data VRE
Assante M., Baglioni M., Bardi A., Mangiacrapa F., Pagano P.
This report is a short document accompanying the Deliverable "D4.2 Opening of Open Data VRE", which is of type: "Websites, patents filling, etc.". It describes the design of the software components involved in the Open Data VRE, deployed to serve the current needs of the RISIS community.Source: Project report, RISIS 2, D4.2, 2020
Project(s): RISIS 2 via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Open Access


2020 Contribution to conference Open Access OPEN

Open Publishing Fest - Overcoming discoverbaility crisis: The OpenAIRE COVID-19 Gateway
Bardi A.
The OpenAIRE COVID-19 Gateway presented at the session "Overcoming the discoverability crisis" of the Open Publishing Fest 2020 hosted by Open Knowledge Maps and ReFigure.Source: Open Publishing Fest - Overcoming discoverability crisis, 27/05/2020
DOI: 10.5281/zenodo.3860218
DOI: 10.5281/zenodo.3860219
Project(s): OpenAIRE-Advance via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Open Access | zenodo.org Open Access


2020 Journal article Open Access OPEN

Data sources and persistent identifiers in the Open Science Research Graph of OpenAIRE
Schirrwagen J., Bardi A., Czerniak A., Loehden A., Rettberg N., Mertens M., Manghi P.
In this article, we give an overview of the data source typologies used in OpenAIRE and provide an outline on the role of persistent identifiers in the aggregation, curation and provision workflows that lead to the generation of the Research Graph in OpenAIRE.Source: International journal of digital curation 1 (2020). doi:10.2218/ijdc.v15i1.722
DOI: 10.2218/ijdc.v15i1.722
Project(s): OpenAIRE-Advance via OpenAIRE

See at: International Journal of Digital Curation Open Access | International Journal of Digital Curation Open Access | ISTI Repository Open Access | CNR ExploRA Open Access | International Journal of Digital Curation Open Access | International Journal of Digital Curation Open Access


2019 Contribution to conference Open Access OPEN

Open Science as-a-service for research communities and content providers
Bardi A.
Open Science is a set of practices of science according to which research activities and the research products they generate should be openly available, under terms that enable their findability, accessibility, re-use and re-distribution. The main effects of the implementation of Open Science principles is to enable responsible, reproducible and transparently assessable research. For an effective implementation of Open Science principles, a behavioral change in interested stakeholders and new tools for publishing in the scholarly communication ecosystem are required. Open Science publishing calls for the publishing of all types of research artefacts, beyond scientific literature. Today, the scholarly communication ecosystem lacks of tools and research community practices on Open Science publishing. To fill this gap and support a smooth transition towards Open Science, the OpenAIRE initiative is offering two novel services for research communities and content providers (e.g., institutional repositories, data repositories). The final goal is to support the cultural and technological shift towards the Open Science paradigm, from which all the different stakeholders in the research domain and of the society at large can benefit.Source: International Conference on the Economics of Grids, Clouds, Systems, and Services, Pisa, Italy, 18-20 September 2018
DOI: 10.1007/978-3-030-13342-9_1
Project(s): OpenAIRE-Connect via OpenAIRE

See at: link.springer.com Open Access | ISTI Repository Open Access | CNR ExploRA Open Access | ZENODO Open Access | zenodo.org Open Access | academic.microsoft.com Restricted | dblp.uni-trier.de Restricted | link.springer.com Restricted | link.springer.com Restricted | rd.springer.com Restricted


2019 Conference article Open Access OPEN

Foundations of a framework for peer-reviewing the research flow
Bardi A., Casarosa V., Manghi P.
Traditionally, peer-review focuses on the evaluation of scientific publications, literature products that describe the research process and its final results in natural language. The adoption of ICT technologies in support of science introduces new opportunities to support transparent evaluation, thanks to the possibility of sharing research products, even inputs, intermediate and negative results, repetition and reproduction of the research activities conducted in a digital laboratory. Such innovative shift also sets the condition for novel peer review methodologies, as well as scientific reward policies, where scientific results can be transparently and objectively assessed via machine-assisted processes. This paper presents the foundations of a framework for the representation of a peer-reviewable research flow for a given discipline of science. Such a framework may become the scaffolding enabling the development of tools for supporting ongoing peer review of research flows. Such tools could be "hooked", in real time, to the underlying digital laboratory, where scientists are carrying out their research flow, and they would abstract over the complexity of the research activity and offer user-friendly dashboards.Source: IRCDL 2019 - 15th Italian Research Conference on Digital Libraries, pp. 195–208, Pisa, Italy, 31 January - 01 February 2019
DOI: 10.1007/978-3-030-11226-4_16
DOI: 10.5281/zenodo.2554859
DOI: 10.5281/zenodo.2554858
DOI: 10.5281/zenodo.1493152
DOI: 10.5281/zenodo.1493151
Project(s): OpenUP via OpenAIRE

See at: ISTI Repository Open Access | ZENODO Open Access | zenodo.org Open Access | academic.microsoft.com Restricted | dblp.uni-trier.de Restricted | link.springer.com Restricted | link.springer.com Restricted | link.springer.com Restricted | CNR ExploRA Restricted | rd.springer.com Restricted


2019 Report Open Access OPEN

The OpenAIRE research graph: third-party publishing APIs
Atzori C., Baglioni M., Bardi A., Manghi P., La Bruzzo S., De Bonis M., Dell'Amico A., Artini M., Mannocci A., Ottonello E.
This work describes the specification of the OpenAIRE publishing APIs that support third-party services at publishing metadata about interlinked and packaged research products into the OpenAIRE Research Graph, in respect of the OpenAIRE interoperability guidelines (https://guidelines.openaire.eu). Research products generated by researchers using services of research infrastructures are today manually published by researchers in a repository external to their research infrastructure. This phase is often considered an extra burden, because researchers have to fill in metadata forms with information that is already available in the scope of the services they used. By using the OpenAIRE publishing APIs, services of research infrastructures can implement an on-demand publishing workflow for any type of research products to support their researchers at improving the FAIRness of their research products and relief them from the tedious step of finding a suitable repository and manually depositing the products in it.Source: ISTI Technical reports, 2019

See at: ISTI Repository Open Access | CNR ExploRA Open Access


2019 Conference article Open Access OPEN

The OpenAIRE Research Community Dashboard: On Blending Scientific Workflows and Scientific Publishing
Baglioni M., Bardi A., Kokogiannaki A., Manghi P., Iatropoulou K., Principe P., Vieira A., Nielsen L. H., Dimitropoulos H., Foufoulas I., Manola N., Atzori C., La Bruzzo S., Lazzeri E., Artini M., De Bonis M., Dell'Amico A.
Despite the hype, the effective implementation of Open Science is hindered by several cultural and technical barriers. Researchers embraced digital science, use "digital laboratories" (e.g. research infrastructures, thematic services) to conduct their research and publish research data, but practices and tools are still far from achieving the expectations of transparency and reproducibility of Open Science. The places where science is performed and the places where science is published are still regarded as different realms. Publishing is still a post-experimental, tedious, manual process, too often limited to articles, in some contexts semantically linked to datasets, rarely to software, generally disregarding digital representations of experiments. In this work we present the OpenAIRE Research Community Dashboard (RCD), designed to overcome some of these barriers for a given research community, minimizing the technical efforts and without renouncing any of the community services or practices. The RCD flanks digital laboratories of research communities with scholarly communication tools for discovering and publishing interlinked scientific products such as literature, datasets, and software. The benefits of the RCD are show-cased by means of two real-case scenarios: the European Marine Science community and the European Plate Observing System (EPOS) research infrastructure.Source: 23rd International Conference on Theory and Practice of Digital Libraries, TPDL, pp. 56–69, Oslo, Norway, September 9-12, 2019
DOI: 10.1007/978-3-030-30760-8_5
DOI: 10.5281/zenodo.3467104
DOI: 10.5281/zenodo.3467103
Project(s): OpenAIRE-Connect via OpenAIRE, OpenAIRE-Advance via OpenAIRE

See at: ISTI Repository Open Access | repositorium.sdum.uminho.pt Open Access | ZENODO Open Access | academic.microsoft.com Restricted | dblp.uni-trier.de Restricted | link.springer.com Restricted | link.springer.com Restricted | CNR ExploRA Restricted | rd.springer.com Restricted


2019 Dataset Unknown

OpenAIRE Research Graph Dump
Manghi P., Atzori C., Bardi A., Schirrwagen J., Dimitropoulos H., La Bruzzo S., Foufoulas I., Loehden A., Baecker A., Mannocci A., Horst M., Baglioni M., Czerniak A., Kiatropoulou K., Kokogiannaki A., De Bonis M., Artini M., Ottonello E., Lempesis A., Nielsen L. H., Ioannidis A., Bigarella C., Summan F.
The OpenAIRE Research Graph is one of the largest open scholarly record collections worldwide, key in fostering Open Science and establishing its practices in the daily research activities. Conceived as a public and transparent good, populated out of data sources trusted by scientists, the Graph aims at bringing discovery, monitoring, and assessment of science back in the hands of the scientific community. Imagine a vast collection of research products all linked together, contextualised and openly available. For the past ten years OpenAIRE has been working to gather this valuable record. OpenAIRE is pleased to announce the beta release of its Research Graph, a massive collection of metadata and links between scientific products such as articles, datasets, software, and other research products, entities like organisations, funders, funding streams, projects, communities, and data sources. As of today, the OpenAIRE Research Graph aggregates around 450Mi metadata records with links collecting from 10,000 data sources trusted by scientists, including repositories registered in OpenDOAR, Open Access journals registered in DOAJ, Crossref, Unpaywall, ORCID and Microsoft Academic Graph. After cleaning, deduplication, and fine-grained classification processes, they narrow down to ~100Mi publications, ~8Mi datasets, ~200K software research products, 8Mi other products linked together with semantic relations. More than 10Mi full-texts of Open Access publications are mined by algorithms to enrich metadata records with additional properties and links among research products, funders, projects, communities, and organizations. Thanks to the mining algorithm, the graph is completed with 480Mi semantic relations. The OpenAIRE Research graph is available via our BETA Explore Portal and you can download it from Zenodo.DOI: 10.5281/zenodo.3516918
Project(s): OpenAIRE-Advance via OpenAIRE

See at: CNR ExploRA


2019 Report Open Access OPEN

The OpenAIRE Research Graph Data Model
Manghi P., Bardi A., Atzori C., Baglioni M., Manola N., Schirrwagen J., Principe P.
The purpose of the European OpenAIRE infrastructure is to facilitate, foster, support, and monitor Open Science scholarly communication in Europe. The infrastructure has been operational for almost a decade and successful in linking people, ideas and resources in support of the free flow, access, sharing, and re-use of research outcomes. To this aim it offers dissemination and training on Open Access and Open Science, facilitates exchange of knowledge, and operates the technical services required to facilitate and monitor Open Science publishing trends and research impact across geographic and discipline boundaries. OpenAIRE services populate a research graph whose objects are scientific results, organizations, funders, communities, organizations, and data sources. In this article we describe the data model, inspired by several existing metadata standards.Source: ISTI Technical reports, 2019
DOI: 10.5281/zenodo.2643199
DOI: 10.5281/zenodo.2643198
Project(s): OPENAIREPLUS via OpenAIRE, OPENAIRE via OpenAIRE, OpenAIRE2020 via OpenAIRE, OpenAIRE-Connect via OpenAIRE, OpenAIRE-Advance via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Open Access | ZENODO Open Access


2019 Report Restricted

PARTHENOS - Deliverable 6.6 - PARTHENOS Cloud Infrastructure
Pagano P., Assante M., Frosini L., Manghi P., Bardi A., Sinibaldi F., Cirillo R., Panichi G.
"D6.6 PARTHENOS cloud infrastructure" is the revised and final version of "D6.1 PARTHENOS cloud infrastructure". This deliverable reports the PARTHENOS e-infrastructure architecture: the hardware and the services. Hardware is organized as a dynamic cloud of virtual machines, supporting computation and storage, while the services are organized into e-infrastructure middleware, storage, and end user services.Source: Project report, PARTHENOS, Deliverable D6.6, pp.1–63, 2019
Project(s): PARTHENOS via OpenAIRE

See at: data.d4science.net Restricted | CNR ExploRA Restricted


2019 Journal article Open Access OPEN

OpenAIRE: Advancing open science
Manghi P., Artini M., Atzori C., Baglioni M., Bardi A., La Bruzzo S., De Bonis M., Dimitropoulos H., Foufoulas I., Iatropoulou K., Manola N., Martziou S., Principe P.
OpenAIRE, the point of reference for Open Access in Europe, is now addressing the problem of enabling the Open Science paradigm. To this aim it will provide services to: (i) overcome the limits of today's scientific communication landscape, by allowing research communities and the relative e-infrastructures to fully publish, interlink, package and reuse their research artefacts (e.g. literature, data, and software) and their funding grants within the European and global ecosystem as supported/promoted by OpenAIRE, (ii) enable end-users (e.g. researchers, funder officers) to search and consult a rich and up-to-date knowledge graph of research results and (iii) enable scientific and educational information repositories and publishers to subscribe and be notified of changes in the OpenAIRE knowledge graph. These combined actions will bring long-term and immediate benefits to research communities, research organisations, repository managers, and funders by affecting the way research results are disseminated and reused. On the one hand, publishing the interlinked and packaged research literature, data and software via OpenAIRE drives research communities to an Open Science transition in a consistent and interoperable fashion. On the other hand, the resulting infrastructure concretely enables the construction of Open Science oriented services, supporting practices such as machine-assisted research reproducibility and evaluation.Source: The Grey journal (Print) 15 (2019): 141–146.
Project(s): OpenAIRE-Connect via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Restricted


2018 Conference article Open Access OPEN

The European project OpenUP: OPENing UP new methods, indicators and tools for peer review, impact measurement and dissemination of research results
Bardi A., Casarosa V., Manghi P.
Open Access and Open Scholarship are substantially changing the way scholarly artefacts are evaluated, published and assessed, while the introduction of new technologies and media in scientific workflows has changed the ``how and to whom'' science is communicated, and how stakeholders interact with the scientific community. OpenUP addresses key aspects and challenges of the currently transforming science landscape. Its main objectives are to: (i) identify and determine new mechanisms, processes and tools for the peer-review of all types of research results (publications, data, software, processes, etc.); (ii) explore, identify and classify innovative dissemination mechanisms with an outreach aim towards businesses and industry, education, and society as a whole; (iii) analyse and identify a set of novel indicators that assess the impact of research results and correlate them to channels of dissemination.Source: IRCDL 2018 - 14th Italian Research Conference on Digital Libraries, pp. 240–248, Udine, 25-26 January 2018
DOI: 10.1007/978-3-319-73165-0_24
Project(s): OpenUP via OpenAIRE

See at: ISTI Repository Open Access | ZENODO Open Access | zenodo.org Open Access | academic.microsoft.com Restricted | dblp.uni-trier.de Restricted | link.springer.com Restricted | link.springer.com Restricted | CNR ExploRA Restricted


2018 Conference article Open Access OPEN

GDup: De-duplication of Scholarly Communication Big Graphs
Atzori C., Manghi P., Bardi A.
Today, several online services offer functionalities to access information from big scholarly communication graphs, which interlink entities such as publications, authors, datasets, organizations, etc. Such graphs are often populated over time as aggregations of multiple sources and therefore suffer from entity duplication problems. Although deduplication of graphs is a known and actual problem, solutions tend to be dedicated and address a few of the underlying challenges. In this paper, we propose the GDup system, an integrated, scalable, general-purpose system for entity deduplication over big information graphs. GDup supports practitioners with the functionalities needed to realize a fully-fledged entity deduplication workflow over a generic input graph, inclusive of Ground Truth support, end-user feedback, and strategies for identifying and merging duplicates to obtain an output disambiguated graph. GDup is today one of the core components of the OpenAIRE infrastructure production system, monitoring Open Science trends on behalf of the European Commission.Source: 2018 IEEE/ACM 5th International Conference on Big Data Computing Applications and Technologies (BDCAT), pp. 142–151, Zurigo, 17-20/12/2018
DOI: 10.1109/bdcat.2018.00025
Project(s): OpenAIRE2020 via OpenAIRE, OpenAIRE-Advance via OpenAIRE

See at: ISTI Repository Open Access | ISTI Repository Open Access | ZENODO Open Access | zenodo.org Open Access | academic.microsoft.com Restricted | dblp.uni-trier.de Restricted | ieeexplore.ieee.org Restricted | CNR ExploRA Restricted | xplorestaging.ieee.org Restricted


2018 Journal article Open Access OPEN

An aggregation framework for digital humanities infrastructures: The PARTHENOS experience
Frosini L., Bardi A., Manghi P., Pagano P.
Digital Humanities Infrastructures (DHIs) are research infrastructures supporting researchers in the field of humanities by providing ICT tools and facilities for performing their studies and investigation activities. A DHI typically serves either researchers of one specific sector of humanities (e.g. history, archaeology) or focused research groups working on specific research topics (e.g. studies on the holocaust, on a specific manuscript), with little or no re-use of tools, services and data that could be shared and successfully adopted to answer research questions of different research disciplines. This fragmentation often represents a barrier to inter-disciplinary research collaborations. We present a technical framework for the federation of DHIs where tools, data, services, and knowledge available from each DHI are shared in an integrated environment where researchers can collaborate on specific research topics by creating customized Virtual Research Environments.Source: SCIRES-IT (Roma) 8 (2018): 33–50. doi:10.2423/i22394303v8n1p33
DOI: 10.2423/i22394303v8n1p33
Project(s): PARTHENOS via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Open Access | www.sciresit.it Open Access


2018 Conference article Open Access OPEN

De-duplicating the OpenAIRE scholarly communication big graph
Atzori C., Manghi P., Bardi A.
The OpenAIRE infrastructure populates a scholarly communication big graph interlinking metadata objects of publications, datasets, software, organizations, funders, and projects. In order to de-duplicate this graph, OpenAIRE has developed GDup, an integrated, scalable, general-purpose system for entity deduplication over big information graphs. GDup offers functionalities to realize a hilly-fledged entity deduplication workflow over a generic input graph, inclusive of Ground Truth support, end-user feedback, and strategies for identifying and merging duplicates to obtain an output disambiguated graph.Source: e-science 2018 - 14th IEEE International Conference on e-Science (e-Science), pp. 372–373, Amsterdam, the Netherlands, 29 October - 01 November 2018
DOI: 10.1109/escience.2018.00104
DOI: 10.5281/zenodo.1489139
DOI: 10.5281/zenodo.1489140
Project(s): OpenAIRE2020 via OpenAIRE, OpenAIRE-Advance via OpenAIRE

See at: ISTI Repository Open Access | ZENODO Open Access | zenodo.org Open Access | academic.microsoft.com Restricted | dblp.uni-trier.de Restricted | ieeexplore.ieee.org Restricted | CNR ExploRA Restricted | xplorestaging.ieee.org Restricted