55 result(s)
Page Size: 10, 20, 50
Export: bibtex, xml, json, csv
Order by:

CNR Author operator: and / or
more
Typology operator: and / or
Language operator: and / or
Date operator: and / or
more
Rights operator: and / or
not yet published Conference article Open Access OPEN
Exploring scientometrics with the OpenAIRE Graph: introducing the OpenAIRE Beginner's Kit
Mannocci A., Baglioni M.
The OpenAIRE Graph is an extensive resource housing diverse information onresearch products, including literature, datasets, and software, alongsideresearch projects and other scholarly outputs and context. It stands as acornerstone among contemporary research information databases, offeringinvaluable insights for scientometric investigations. Despite its wealth ofdata, its sheer size may initially appear daunting, potentially hindering itswidespread adoption. To address this challenge, this paper introduces theOpenAIRE Beginner's Kit, a user-friendly solution providing access to a subsetof the OpenAIRE Graph within a sandboxed environment coupled with a Jupyternotebook for analysis. The OpenAIRE Beginner's Kit is meticulously designed todemocratise research and data exploration, offering accessibility from standarddesktop and laptop setups. Within this paper, we provide a brief overview ofthe included dataset and offer guidance on leveraging the kit through aselection of illustrative queries tailored to address common scientometricinquiries.

See at: arxiv.org Open Access | CNR IRIS Open Access | CNR IRIS Restricted


2025 Other Open Access OPEN
Open Research Information: an ethical necessity. OpenAIRE's role in advancing Open Science
Pavone G., Baglioni M., Bardi A., Amodeo S.
Research integrity demands transparency not only in scientific outputs but also in the metadata that describes them. This principle requires that research information is collected through transparent methods and remains open and available for verification, enabling evidence-based monitoring and public decision-making. OpenAIRE maintains a “scholarly knowledge graph” of interconnected research metadata and provides a catalogue of services that support Open Science by facilitating knowledge discovery and research assessment.Project(s): SciLake via OpenAIRE

See at: CNR IRIS Open Access | www.ethics.cnr.it Open Access | CNR IRIS Restricted


2025 Journal article Open Access OPEN
Towards the interoperability of scholarly repository registries
Baglioni M., Pavone G., Mannocci A., Manghi P.
The enactment of Open Science relies on scholarly repositories that make research products findable and accessible, while scholarly repository registries maintain authoritative metadata and persistent identifiers (PIDs) to help researchers and infrastructure providers discover and access needed repositories. However, the proliferation of repositories targeting different research products (e.g., publications, data, and software) or serving specific disciplines has led to the creation of multiple registries whose scope is not mutually exclusive. Such a fragmented landscape poses significant concerns regarding authoritativeness, disambiguation, and coverage for scholarly communication service and infrastructure providers who consume content from these registries. These providers must either limit their focus to a single registry or manage complex data fusion strategies to integrate diverse repository profiles from various sources. While favouring the existence of a plurality of registries, this paper advocates for their interoperability, which is essential to eliminate the aforementioned barriers and enable their full, unambiguous utilisation. We analyse the data models of four prominent registries—FAIRsharing, re3data, OpenDOAR, and ROAR—and classify their properties and overlap. We provide a crosswalk between their data models and suggest a common data model shared across the examined registries to pave the way toward interoperability. As a means of validation, we include a coverage evaluation of the proposed data model.The paper adopts a pragmatic approach towards scholarly registry interoperability and suggests a common metadata model to foster the exchange of information across these platforms. The purpose of the paper is to serve as a cornerstone, initiating and engaging the community in discussions surrounding the interoperability of scholarly repository registries.Source: INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, vol. 26 (issue 1)
DOI: 10.1007/s00799-025-00414-y
Project(s): EOSC Future via OpenAIRE, OpenAIRE Nexus via OpenAIRE
Metrics:


See at: International Journal on Digital Libraries Open Access | CNR IRIS Open Access | link.springer.com Open Access | Software Heritage Restricted | Software Heritage Restricted | GitHub Restricted | GitHub Restricted | CNR IRIS Restricted


2025 Other Restricted
InfraScience research activity report 2024
Angioni S., Artini M., Assante M., Atzori C., Baglioni M., Bardi A., Bosio C., Bove P., Calanducci A., Candela L., Casini G., Castelli D., Cirillo R., Coro G., De Bonis M., Debole F., Dell'Amico A., Frosini L., Ibrahim Ahmed, La Bruzzo S., Lelii L., Manghi P., Mangiacrapa F., Mangione D., Mannocci A., Molinaro E., Oliviero A., Pagano P., Panichi G., Teresa M. T., Pavone G., Peccerillo B., Piccioli T., Procaccini M., Straccia U., Vannini G. L., Versienti L.
InfraScience is a research group within the Institute of Information Science and Technologies (ISTI) of the National Research Council of Italy (CNR), based in Pisa. This activity report outlines the group's research achievements and initiatives throughout 2024. InfraScience focused its efforts on key challenges in the areas of Data Infrastructures, e-Science, and Intelligent Systems, maintaining a strong synergy between research and development and a firm commitment to open science principles. In 2024, the group played a leading role in the development and evolution of two major Open Science infrastructures: D4Science and OpenAIRE. InfraScience researchers contributed significantly to the scientific community through the publication of peer-reviewed papers, active participation in EU-funded research projects, organization of international conferences and training activities, and engagement in various working groups and task forces. This report highlights these contributions and underscores the group's ongoing dedication to advancing open, collaborative, and impactful science.DOI: 10.32079/isti-ar-2025/001
Metrics:


See at: CNR IRIS Restricted | CNR IRIS Restricted | CNR IRIS Restricted


2024 Conference article Open Access OPEN
The ARIADNEplus Knowledge Base: a Linked Open Data set for archaeological research
Bardi A., Baglioni M., Artini M., Mannocci A., Pavone G.
The ARIADNE infrastructure provides tools and services for researchers to address archaeological grand challenges that require discovery and analysis of information scattered across different thematic and geographically distributed sources. The ARIADNEplus Knowledge Base (KB) is an archaeological Linked Open Data set modelled according to the ARIADNE ontology, based on CIDOC-CRM, and provided by an international network of organisations leaders in different domains of archaeological sciences. In February 2024, the ARIADNEplus KB features about 4 million archaeological resources. Thanks to the ARIADNE infrastructure, data providers increased the level of fairness of their resources and contributed to a unique asset for the archaeology research community, the European Open Science Cloud and society at large.Source: CEUR WORKSHOP PROCEEDINGS, vol. 3741, pp. 91-100. Viallasimius, Italy, 23-26/06/2024
Project(s): ARIADNEplus via OpenAIRE, ATRIUM via OpenAIRE

See at: ceur-ws.org Open Access | CNR IRIS Open Access | CNR IRIS Restricted


2024 Journal article Open Access OPEN
Il ruolo di OPENAIRE nel promuovere la scienza aperta
Pavone G., Baglioni M., Bardi A., Amodeo S.
L’integrità nella ricerca necessita di trasparenza non solo nel lavoro scientifico, ma anche nei metadati che ne descrivono i risultati. Per questo si parla di “Open Research Information” in riferimento all’insieme delle informazioni aperte sulla ricerca, tra cui i metadati che descrivono i vari prodotti scientifici, i dati bibliografici, le informazioni su finanziamenti, organizzazioni, affiliazioni e così via. Queste informazioni devono essere raccolte con metodi trasparenti, aperte e disponibili per la verifica, consentendo un monitoraggio basato su evidenze e decisioni pubbliche informate. OpenAIRE mantiene un “grafo della conoscenza scientifica”1 composto di metadati di ricerca interconnessi e fornisce un catalogo di servizi a supporto della Open Science2, facilitando ad esempio l’esplorazione dei contenuti scientifici e processi di valutazione basati su dati trasparenti.Source: BIOETICA, pp. 397-400
Project(s): SciLake via OpenAIRE

See at: CNR IRIS Open Access | CNR IRIS Restricted | CNR IRIS Restricted


2024 Dataset Open Access OPEN
OpenAIRE Graph Dataset v8.0.0 (July 2024)
Manghi P., Atzori C., Bardi A., Baglioni M., Dimitropoulos H., La Bruzzo S., Foufoulas I., Mannocci A., Horst M., Iatropoulou K., Kokogiannaki A., De Bonis M., Artini M., Lempesis A., Ioannidis A., Manola N., Principe P., Vergoulis T., Chatzopoulos S.
The OpenAIRE Graph is a large and rich collection of open and linked scholarly records from trusted data sources, such as journals, repositories, and registries. It aims to foster Open Science practices and enable the scientific community to discover, monitor, and evaluate science. The Graph is cleaned, deduplicated, enriched, and full-text mined to generate statistics and insights. The Graph is accessible via various services, such as OpenAIRE MONITOR, EXPLORE, ScholeXplorer (Scholix API for the retrieval of literature-data links), search APIs and snapshots in json format updated every six months. The Graph data are openly available with CC-BY license for third-parties to reuse and create added value services. The documentation is available at: https://graph.openaire.euDOI: 10.5281/zenodo.12819872
Project(s): FAIRCORE4EOSC via OpenAIRE, SciLake via OpenAIRE, EOSC Beyond via OpenAIRE, GraspOS via OpenAIRE, OSTrails via OpenAIRE
Metrics:


See at: CNR IRIS Open Access | zenodo.org Open Access | CNR IRIS Restricted


2023 Conference article Open Access OPEN
(Semi)automated disambiguation of scholarly repositories
Baglioni M, Mannocci A, Pavone G, De Bonis M, Manghi P
The full exploitation of scholarly repositories is pivotal in modern Open Science, and scholarly repository registries are kingpins in enabling researchers and research infrastructures to list and search for suitable repositories. However, since multiple registries exist, repository managers are keen on registering multiple times the repositories they manage to maximise their traction and visibility across different research communities, disciplines, and applications. These multiple registrations ultimately lead to information fragmentation and redundancy on the one hand and, on the other, force registries' users to juggle multiple registries, profiles and identifiers describing the same repository. Such problems are known to registries, which claim equivalence between repository profiles whenever possible by cross-referencing their identifiers across different registries. However, as we will see, this "claim set" is far from complete and, therefore, many replicas slip under the radar, possibly creating problems downstream. In this work, we combine such claims to create duplicate sets and extend them with the results of an automated clustering algorithm run over repository metadata descriptions. Then we manually validate our results to produce an "as accurate as possible" de-duplicated dataset of scholarly repositories.Source: CEUR WORKSHOP PROCEEDINGS, pp. 47-59. Bari, Italy, 23-24/02/2023
Project(s): OpenAIRE Nexus via OpenAIRE

See at: ceur-ws.org Open Access | CNR IRIS Open Access | ISTI Repository Open Access | CNR IRIS Restricted


2023 Other Open Access OPEN
OpenAIRE, comunità e servizi per praticare la scienza aperta
Pavone G, Atzori C, Baglioni M, Bardi A, Manghi P, Castelli D
Per praticare la ricerca secondo i principi dell'Open Science sono al contempo necessarie tecnologie - con infrastrutture che consentano e facilitino la collaborazione e lo scambio massivo di informazioni su scala internazionale - e competenze che permettano di massimizzarne uso e risultati. In altre parole occorrono servizi, scambio di competenze e formazione. Su queste direttrici si concentra il lavoro di OpenAIRE (Open Access Infrastructure for Research in Europe), l'infrastruttura europea per la Scienza Aperta che offre servizi tecnologici e una rete europea di scambio e sinergia per favorire la scienza aperta. Avviata come progetto europeo nel 2009 per il monitoraggio dell'Open Access, nel corso degli anni l'iniziativa è stata rifinanziata e il suo ambito di interesse esteso a tutte le componenti dell'Open Science. Nel 2018 si è costituita come organizzazione senza scopo di lucro per garantire una struttura permanente a supporto delle politiche nazionali ed europee per l'Open Science. Il network di OpenAIRE conta oltre 40 membri tra centri di ricerca, università, fondazioni ed enti gestori di servizi distribuiti in tutta Europa. Come comunità di pratica, OpenAIRE ha la missione di costituire e gestire un'infrastruttura che supporti una comunicazione scientifica aperta e sostenibile, fornendo i servizi, le risorse e il coordinamento di iniziative ed esperti necessari per implementare un ambiente comune europeo per la scienza aperta. Per realizzare questa visione, OpenAIRE offre servizi tecnologici, di training e di supporto, coprendo l'intero ciclo di vita della ricerca (la lista completa dei servizi è consultabile su catalogue.openaire.eu). I servizi tecnologici spaziano dalla gestione dei dati al discovery, dalla gestione di riviste al monitoraggio dei risultati della ricerca e dell'adozione di pratiche Open Science. Inoltre la rete internazionale dei NOAD (National Open Access Desk: openaire.eu/contact-noads) promuove la scienza aperta fornendo assistenza e formazione a vari livelli. L'obiettivo è abilitare i vari attori coinvolti nell'attività scientifica nelle pratiche dell'open science e dell'open access organizzando workshop nazionali e training dedicati. I NOADs inoltre forniscono consulenza esperta sulle infrastrutture che supportano i flussi di lavoro per la scienza aperta, nonché per la definizione di politiche per la sua implementazione, quali stesura e aggiornamento di policies istituzionali, individuazione degli obblighi normativi, di adempimenti relativi ai finanziamenti o di strumenti per il Data Management Plan (DMP). Il CNR, in particolare il suo istituto ISTI, in qualità di centro di sviluppo e innovazione tecnologica dell'infrastruttura e di gestore del NOAD Italiano, opera in accordo con la missione di OpenAIRE contribuendo in modo significativo alle sue attività e agli organismi di governo. L'ente offre dunque le sue competenze per garantire il mantenimento, l'operatività e l'innovazione dell'infrastruttura partecipando in iniziative e progetti che contribuiscono alla sostenibilità e all'innovazione dei servizi di questa infrastruttura. Come NOAD, offre formazione e supporto per affrontare problematiche quali la definizione di DMP, il rispetto dei principi "FAIR" per la gestione dei dati, e la stesura di politiche istituzionali. Le attività sono portate avanti in collaborazione con i NOAD in altri paesi europei in modo da massimizzare l'integrazione di soluzioni e politiche a livello europeo.

See at: CNR IRIS Open Access | ISTI Repository Open Access | CNR IRIS Restricted


2023 Other Open Access OPEN
OpenAIRE Graph: una risorsa aperta per la scienza aperta
Atzori C, Bardi A, Baglioni M, Manghi P
OpenAIRE Graph (OAG) is a knowledge graph that aggregates information (metadata, relationships) about different entities in the research world, such as publications, datasets, software, funded projects, repositories, and organisations. These entities are interconnected through semantic relationships, such as citations, supplements, similarity, and participation in projects. OAG is an open resource that can be used by funders, organisations, researchers, research communities, and publishers to gain a better understanding of the research landscape and dynamics at various levels, both local and global. As an open and freely accessible resource, produced in accordance with the fundamental values of Open Science as outlined in the UNESCO Recommendation on Open Science, OAG overcomes the use of proprietary data sources, supporting the reform of research assessment, researchers, and organisations as envisaged by the Coalition for Advancing Research Assessment (CoARA). OAG is built from bibliographic records obtained from well-known sources such as Crossref, open access journals registered in DOAJ (Directory of Open Access Journals), ORCID, Microsoft Academic Graph, Datacite, as well as from over 1,000 institutional repositories. The metadata of research products contained in the graph are disambiguated and enriched through full-text and data mining processes, making OAG usable for a variety of purposes, including: Research discovery Research assessment Analysis and/or prediction of research collaborations Support for research policy decision-making OAG is a freely accessible resource: search and discovery features are available through the explore.openaire.eu portal, programmatic integration is available through the HTTP Search API, the complete dataset, as well as other datasets that offer specialised views, are available on Zenodo. The monitor.openaire.eu portal hosts several dashboards dedicated to research organisations and funders that include the results of statistical, bibliometrics, and indicator analyses. Additional information is available at https://graph.openaire.eu, where the data models to which the datasets conform, API documentation, as well as the methodological approach used to build and process OAG are described. OAG can play a significant role in research assessment by providing a more comprehensive and accurate view of research output and impact. By aggregating data from a variety of sources, OAG can provide a more holistic picture of a researcher's or organization's research activities. This can help to identify areas of strength and weakness, as well as potential areas for collaboration. OAG can also be used to track the impact of research over time. By tracking citations, downloads, and other forms of engagement, OAG can help to measure the influence of research and the impact it has on society. This information can be used to inform research funding decisions, as well as to promote the dissemination of research findings. In addition to its quantitative measures, OAG can also provide qualitative insights into research. By analyzing the relationships between different research products, OAG can help to identify emerging trends and areas of collaboration. This information can be used to support research policy development and to promote the cross-fertilization of ideas. In conclusion, OAG is a powerful tool that has the potential to revolutionise the way research is assessed. By providing a more comprehensive and accurate view of research output and impact, OAG can help to make research assessment more fair, transparent, and equitable.

See at: CNR IRIS Open Access | ISTI Repository Open Access | CNR IRIS Restricted


2023 Other Open Access OPEN
Community building con OpenAIRE CONNECT
Bardi A, Baglioni M
Le comunità di ricerca, le reti universitarie, le infrastrutture di ricerca mirano a massimizzare il loro impatto sulla ricerca e sulla società e a dotare i loro ricercatori di strumenti comuni, politiche e linee guida condivise per migliorare la qualità della ricerca. Tuttavia, spesso non è facile ottenere la visibilità che meritano nei confronti degli enti finanziatori o del personale di ricerca. Analizzando il panorama attuale è possibile identificare un insieme di attività strategiche: 1. Ampia diffusione di tutte le attività e dei risultati dei ricercatori sia all'interno che al di fuori della propria comunità; 2. Monitoraggio dei risultati della ricerca della comunità; 3. Promozione e monitoraggio dell'adozione delle pratiche di Open Science (ad es. dati FAIR e pubblicazione in Open Access); 4. Monitoraggio dell'aderenza alle politiche condivise e alle best practices del dominio; 5. Centralizzazione della fornitura di servizi condivisi per ridurre i costi e raggiungere un maggior numero di utenti (ad es. per programmi di formazione rivolti a responsabili della ricerca, amministratori, ricercatori, studenti). Queste attività non sono semplici da realizzare in modo sostenibile. Spesso, il monitoraggio dei risultati della ricerca viene fatto manualmente, richiedendo molto sforzo per comunicare con ogni membro della comunità (sia persone che organizzazioni), garantire la qualità e armonizzare i dati raccolti in modo che possano essere diffusi e/o analizzati. Un altro problema comune è monitorare l'adozione delle pratiche di pubblicazione Open Science dei ricercatori, identificare le lacune e preparare tutorial e formazione per supportarli. OpenAIRE, un'infrastruttura di comunicazione scientifica impegnata nella promozione dell'Open Science, sta collaborando con diverse alleanze di università (ad es. Aurora, EUT+, EUTOPIA, FIT FORTHEM), infrastrutture di ricerca (ad es. EMBRC, IPERION-HS, DARIAH) e comunità specifiche del dominio (ad es. scienze marine, neuroinformatica) per affrontare queste sfide. Dal punto di vista tecnico, OpenAIRE opera il servizio CONNECT (https://connect.openaire.eu), attraverso il quale una comunità può avere un gateway personalizzabile dove scoprire tutti i prodotti della ricerca della comunità tramite un unico punto di accesso e servizi per facilitare l'adozione e il monitoraggio delle pratiche di Open Science. Dal punto di vista della formazione, le collaborazioni ci danno l'opportunità di arricchire e scambiare materiale formativo, competenze e impostare una strategia di disseminazione congiunta per migliorare ulteriormente la visibilità all'interno delle comunità, della rete OpenAIRE e oltre. La demo presenterà uno dei gateway pubblici per mostrare tutte le funzionalità integrate disponibili agli utenti, fra cui: cercare i prodotti della ricerca, collegarli tra loro e con i progetti che li hanno finanziati, cercare repository Open Access per depositare qualsiasi tipo di prodotto della ricerca, l'integrazione con il servizio ORCID. Presenterà anche la dashboard di amministrazione che può essere utilizzata dai curatori della comunità per configurare il gateway in termini di contenuti e aspetto.

See at: CNR IRIS Open Access | zenodo.org Open Access | CNR IRIS Restricted


2023 Other Open Access OPEN
InfraScience research activity report 2023
Artini M., Assante M., Atzori C., Baglioni M., Bardi A., Bosio C., Bove P., Calanducci A., Candela L., Casini G., Castelli D., Cirillo R., Coro G., De Bonis M., Debole F., Dell'Amico A., Frosini L., Ibrahim A. S. T., La Bruzzo S., Lelii L., Manghi P., Mangiacrapa F., Mangione D., Mannocci A., Molinaro E., Pagano P., Panichi G., Paratore M. T., Pavone G., Piccioli T., Sinibaldi F., Straccia U., Vannini G. L.
InfraScience is a research group of the National Research Council of Italy - Institute of Information Science and Technologies (CNR - ISTI) based in Pisa, Italy. This report documents the research activity performed by this group in 2023 to highlight the major results. In particular, the InfraScience group engaged in research challenges characterising Data Infrastructures, e-Science, and Intelligent Systems. The group activity is pursued by closely connecting research and development and by promoting and supporting open science. In fact, the group is leading the development of two large scale infrastructures for Open Science, i.e. D4Science and OpenAIRE. During 2023 InfraScience members contributed to the publishing of several papers, to the research and development activities of several research projects (primarily funded by EU), to the organization of conferences and training events, to several working groups and task forces.DOI: 10.32079/isti-ar-2023/002
Project(s): Blue Cloud via OpenAIRE, EOSC Future via OpenAIRE, TAILOR via OpenAIRE
Metrics:


See at: CNR IRIS Open Access | CNR IRIS Restricted


2023 Contribution to conference Open Access OPEN
OpenAIRE CONNECT for research alliances
Malaguarnera G, Bardi A, Baglioni M, Kokogiannaki A
Research alliances like university networks or associations gather their members, with common or complementary backgrounds, to maximize their impact on research and society and empower their researchers with common tools, shared policies and guidelines to improve the quality of the research. Via the alliance, members and their affiliated researchers have more collaboration and funding opportunities. However, often it is not easy for an alliance to gain the deserved visibility towards funding organizations or the research staff. By analyzing the current landscape of research alliances it is possible to identify a set of strategic activities: - Wide dissemination of all activities and researchers' results within and beyond the alliance itself; - Tracking the research outputs of the members, especially those resulted from a collaboration among the members of the alliance - Promote Open Access and other Open Science practices (e.g. data sharing and Open Access publishing) to foster a more free circulation of knowledge within and beyond the researchers of the alliance, increase collaboration opportunities, and use it an accelerator towards the Sustainable Development Goals (SDGs) as suggested by UNESCO - Tracking the adherence to shared policies, domain best practices (for thematic alliances) and Open Science practices - Find sustainable common solutions for services shared among the members, reducing the costs and reaching a higher number of users (e.g. for training programs targeting research managers, administrators, researchers, students) Addressing those activities in a sustainable way is in some cases not straightforward. Often, the tracking of research outcome is done manually, requiring a lot of effort for communicating with the single members, ensuring the quality and harmonizing the collected data so that it can be disseminated and/or analyzed. Another common problem is tracking the uptake of Open Science publishing practices of the researchers, identifying gaps and preparing tutorials and training to help them. OpenAIRE, a scholarly communication infrastructure committed to the promotion of Open Science, is collaborating with several research alliances (Aurora, EUT+, EUTOPIA, FIT FORTHEM) to address those challenges. From the technical point of view, OpenAIRE provides to each alliance a customizable gateway where all research products of the members can be discovered via a single entry point and services to ease the adoption and tracking of Open Science practices (see https://connect.openaire.eu for the list of exiting gateways and more information). From the training point of view, the collaborations give us the opportunity to enrich and exchange training material, expertise, and set up a joint dissemination strategy to further improve the visibility within the alliances, the OpenAIRE network, and beyond.DOI: 10.5281/zenodo.8300745
DOI: 10.5281/zenodo.8300744
Project(s): FAIRCORE4EOSC via OpenAIRE, EOSC Future via OpenAIRE, OpenAIRE Nexus via OpenAIRE
Metrics:


See at: ZENODO Open Access | ZENODO Open Access | CNR IRIS Open Access | ISTI Repository Open Access | zenodo.org Open Access | CNR IRIS Restricted


2022 Other Open Access OPEN
InfraScience research activity report 2021
Artini M, Assante M, Atzori C, Baglioni M, Bardi A, Bove P, Candela L, Casini G, Castelli D, Cirillo R, Coro G, De Bonis M, Debole F, Dell'Amico A, Frosini L, La Bruzzo S, Lazzeri E, Lelii L, Manghi P, Mangiacrapa F, Mangione D, Mannocci A, Ottonello E, Pagano P, Panichi G, Pavone G, Piccioli T, Sinibaldi F, Straccia U
InfraScience is a research group of the National Research Council of Italy - Institute of Information Science and Technologies (CNR - ISTI) based in Pisa, Italy. This report documents the research activity performed by this group in 2021 to highlight the major results. In particular, the InfraScience group confronted with research challenges characterising Data Infrastructures, eScience, and Intelligent Systems. The group activity is pursued by closely connecting research and development and by promoting and supporting open science. In fact, the group is leading the development of two large scale infrastructures for Open Science, i.e. D4Science and OpenAIRE. During 2021 InfraScience members contributed to the publishing of 25 papers, to the research and development activities of 18 research projects (15 funded by EU), to the organization of conferences and training events, to several working groups and task forces.DOI: 10.32079/isti-ar-2022/001
Project(s): ARIADNEplus via OpenAIRE, Blue Cloud via OpenAIRE, PerformFISH via OpenAIRE, EOSC-Pillar via OpenAIRE, DESIRA via OpenAIRE, EOSC Future via OpenAIRE, EOSCsecretariat.eu via OpenAIRE, EcoScope via OpenAIRE, RISIS 2 via OpenAIRE, OpenAIRE-Advance via OpenAIRE, OpenAIRE Nexus via OpenAIRE, SoBigData-PlusPlus via OpenAIRE
Metrics:


See at: CNR IRIS Open Access | ISTI Repository Open Access | CNR IRIS Restricted


2022 Conference article Open Access OPEN
"Knock Knock! Who's There?" A study on scholarly repositories' availability
Mannocci A, Baglioni M, Manghi P
Scholarly repositories are the cornerstone of modern open science, and their availability is vital for enacting its practices. To this end, scholarly registries such as FAIRsharing, re3data, OpenDOAR and ROAR give them presence and visibility across different research communities, disciplines, and applications by assigning an identifier and persisting their profiles with summary metadata. Alas, like any other resource available on the Web, scholarly repositories, be they tailored for literature, software or data, are quite dynamic and can be frequently changed, moved, merged or discontinued. Therefore, their references are prone to link rot over time, and their availability often boils down to whether the homepage URLs indicated in authoritative repository profiles within scholarly registries respond or not. For this study, we harvested the content of four prominent scholarly registries and resolved over 13 thousand unique repository URLs. By performing a quantitative analysis on such an extensive collection of repositories, this paper aims to provide a global snapshot of their availability, which bewilderingly is far from granted.DOI: 10.1007/978-3-031-16802-4_26
Project(s): OpenAIRE Nexus via OpenAIRE
Metrics:


See at: CNR IRIS Open Access | link.springer.com Open Access | ISTI Repository Open Access | CNR IRIS Restricted | CNR IRIS Restricted | CNR IRIS Restricted


2022 Software Metadata Only Access
dnet-dedup framework
Artini M., Atzori C., Bardi A., Baglioni M., De Bonis M., Dell'Amico A., La Bruzzo S. F., Mannocci A., Manghi P.
The GDup Software enables an integrated, scalable, general-purpose system for entity deduplication over big information graphs. GDup supports practitioners with the functionalities needed to realize a fully-fledged entity deduplication workflow over a generic input graph, including Ground Truth support, end-user feedback, and strategies for identifying and merging duplicates to obtain an output disambiguated graph. GDup is today one of the core components of the OpenAIRE infrastructure production system, monitoring Open Science trends on behalf of the European Commission.Project(s): OpenAIRE-Advance via OpenAIRE, OpenAIRE Nexus via OpenAIRE

See at: github.com Restricted | CNR IRIS Restricted


2022 Other Open Access OPEN
Data model description of the OpenAIRE Research Graph
La Bruzzo Sf, Artini M, Atzori C, Bardi A, Baglioni M, De Bonis M, Mannocci A, Manghi P, Pavone G
The OpenAIRE Graph (formerly known as the OpenAIRE Research Graph) is one of the largest open scholarly record collections worldwide, key to fostering Open Science and establishing its practices in daily research activities. Conceived as a public and transparent good, populated out of data sources trusted by scientists, the Graph aims at bringing discovery, monitoring, and assessment of science back into the hands of the scientific community. Imagine a vast collection of research products all linked together, contextualized, and openly available. For the past years, OpenAIRE has been working to gather this valuable record. It is a massive collection of metadata and links between scientific products such as articles, datasets, software, and other research products, entities like organizations, funders, funding streams, projects, communities, and data sources. This technical Report describes the public data model adopted by the OpenAIRE Graph.DOI: 10.32079/isti-tr-2022/031
Metrics:


See at: CNR IRIS Open Access | ISTI Repository Open Access | CNR IRIS Restricted


2022 Other Open Access OPEN
OpenAIRE Research Graph: aggregation workflow
La Bruzzo Sf, Artini M, Atzori C, Bardi A, Baglioni M, De Bonis M, Dell'Amico A, Mannocci A, Manghi P, Pavone G
The OpenAIRE Graph (formerly the OpenAIRE Research Graph) is one of the largest open scholarly record collections worldwide. It is key in fostering Open Science and establishing its practices in daily research activities. Conceived as a public and transparent good, populated out of data sources trusted by scientists, the Graph aims at bringing discovery, monitoring, and assessment of science back into the hands of the scientific community. OpenAIRE collects metadata records from more than 70K scholarly communication sources worldwide, including Open Access institutional repositories, data archives, and journals. All the metadata records (i.e., descriptions of research products) are put together in a data lake with records from Crossref, Unpaywall, ORCID, ROR, and information about projects provided by national and international funders. This technical Report describes the main Aggregation Workflow to orchestrate the data aggregation and the implemented mapping from some of the main datasources into the OpenAIRE research graph data model.DOI: 10.32079/isti-tr-2022/033
Project(s): OpenAIRE-Advance via OpenAIRE, OpenAIRE Nexus via OpenAIRE
Metrics:


See at: CNR IRIS Open Access | ISTI Repository Open Access | CNR IRIS Restricted


2022 Other Open Access OPEN
OpenAIRE Research Graph deduplication workflow
La Bruzzo Sf, Artini M, Atzori C, Bardi A, Baglioni M, De Bonis M, Mannocci A, Manghi P, Pavone G
The OpenAIRE aggregation workflow can collect metadata records from different providers about the same scholarly work. Each metadata record can carry different information because, for example, some providers are not aware of links to projects, keywords, or other details. Another typical case is when OpenAIRE collects one metadata record from a repository about a pre-print and another from a journal about the published article. To provide correct statistics, OpenAIRE must identify those cases and "merge" the two metadata records so that the scholarly work is counted only once in the statistics OpenAIRE produces. This technical Report describes the Deduplication workflow and technique adopted to deduplicate the OpenAIRE Graph.DOI: 10.32079/isti-tr-2022/032
Project(s): OpenAIRE-Connect via OpenAIRE, OpenAIRE Nexus via OpenAIRE
Metrics:


See at: CNR IRIS Open Access | ISTI Repository Open Access | CNR IRIS Restricted


2022 Other Open Access OPEN
InfraScience research activity report 2022
Artini M, Assante M, Atzori C, Baglioni M, Bardi A, Bove P, Candela L, Casini G, Castelli D, Cirillo R, Coro G, De Bonis M, Debole F, Dell'Amico A, Frosini L, La Bruzzo S, Lelii L, Manghi P, Mangiacrapa F, Mangione D, Mannocci A, Ottonello E, Pagano P, Panichi G, Pavone G, Piccioli T, Sinibaldi F, Straccia U, Zoppi F
InfraScience is a research group of the National Research Council of Italy - Institute of Information Science and Technologies (CNR - ISTI) based in Pisa, Italy. This report documents the research activity performed by this group in 2022 to highlight the major results. In particular, the InfraScience group confronted with research challenges characterising Data Infrastructures, e-Science, and Intelligent Systems. The group activity is pursued by closely connecting research and development and by promoting and supporting open science. In fact, the group is leading the development of two large scale infrastructures for Open Science, i.e. D4Science and OpenAIRE. During 2022 InfraScience members contributed to the publishing of several papers, to the research and development activities of 18 research projects (15 funded by EU), to the organization of conferences and training events, to several working groups and task forces.DOI: 10.32079/isti-ar-2022/004
Project(s): ARIADNEplus via OpenAIRE, Blue Cloud via OpenAIRE, EOSC-Pillar via OpenAIRE, DESIRA via OpenAIRE, EOSC Future via OpenAIRE, RISIS 2 via OpenAIRE, TAILOR via OpenAIRE, SoBigData-PlusPlus via OpenAIRE
Metrics:


See at: CNR IRIS Open Access | ISTI Repository Open Access | CNR IRIS Restricted