16 result(s)
Page Size: 10, 20, 50
Export: bibtex, xml, json, csv
Order by:

CNR Author operator: and / or
more
Typology operator: and / or
Language operator: and / or
Date operator: and / or
Rights operator: and / or
2021 Journal article Open Access OPEN

A workflow language for research e-infrastructures
Candela L., Grossi V., Manghi P., Trasarti R.
Research e-infrastructures are "systems of systems," patchworks of resources such as tools and services, which change over time to address the evolving needs of the scientific process. In such environments, researchers carry out their scientific process in terms of sequences of actions that mainly include invocation of web services, user interaction with web applications, user download and use of shared software libraries/tools. The resulting workflows are intended to generate new research products (articles, datasets, methods, etc.) out of existing ones. Sharing a digital and executable representation of such workflows with other scientists would enforce Open Science publishing principles of "reproducibility of science" and "transparent assessment of science." This work presents HyWare, a language and execution platform capable of representing scientific processes in highly heterogeneous research e-infrastructures in terms of so-called hybrid workflows. Hybrid workflows can express sequences of "manually executable actions," i.e., formal descriptions guiding users to repeat a reasoning, protocol or manual procedure, and "machine-executable actions," i.e., encoding of the automated execution of one (or more) web services. An HyWare execution platform enables scientists to (i) create and share workflows out of a given action set (as defined by the users to match e-infrastructure needs) and (ii) execute hybrid workflows making sure input/output of the actions flow properly across manual and automated actions. The HyWare language and platform can be implemented as an extension of well-known workflow languages and platforms.Source: International Journal of Data Science and Analytics (Online) (2021). doi:10.1007/s41060-020-00237-x
DOI: 10.1007/s41060-020-00237-x
Project(s): SoBigData via OpenAIRE

See at: link.springer.com Open Access | International Journal of Data Science and Analytics Open Access | ISTI Repository Open Access | CNR ExploRA Open Access | International Journal of Data Science and Analytics Restricted | International Journal of Data Science and Analytics Restricted


2021 Contribution to journal Open Access OPEN

Introduction to the special issue on social mining and big data ecosystem for open, responsible data science
Pappalardo L., Grossi V., Pedreschi D.
Source: International Journal of Data Science and Analytics (Online) (2021). doi:10.1007/s41060-021-00253-5
DOI: 10.1007/s41060-021-00253-5
Project(s): SoBigData via OpenAIRE, SoBigData-PlusPlus via OpenAIRE

See at: link.springer.com Open Access | ISTI Repository Open Access | CNR ExploRA Open Access | International Journal of Data Science and Analytics Restricted | International Journal of Data Science and Analytics Restricted


2021 Conference article Open Access OPEN

Data Science Workflows for the Cloud/Edge Computing Continuum
Grossi V., Trasarti R., Dazzi P.
Research infrastructures play a crucial role in the development of data science. In fact, the conjunction of data, infrastructures and analytical methods enable multidisciplinary scientists and innovators to extract knowledge and to make the knowledge and experiments reusable by the scientific community, innovators providing an im- pact on science and society. Resources such as data and methods, help domain and data scientists to transform research in an innovation question into a responsible data-driven analytical process. On the other hand, Edge computing is a new computing paradigm that is spreading and developing at an incredible pace. Edge computing is based on the assumption that for certain applications is beneficial to bring the computation as closer as possible to data or end-users. This paper introduces an approach for writing data science workflows targeting research infrastructures that encompass resources located at the edge of the network.Source: FRAME'21 - 1st Workshop on Flexible Resource and Application Management on the Edge, Virtual Event, Sweden, 25/06/2021
DOI: 10.1145/3452369.3463820
Project(s): SoBigData-PlusPlus via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Open Access


2021 Journal article Open Access OPEN

Data Science: a game changer for science and innovation
Grossi V., Giannotti F., Pedreschi D., Manghi P., Pagano P., Assante M.
This paper shows data science's potential for disruptive innovation in science, industry, policy, and people's lives. We present how data science impacts science and society at large in the coming years, including ethical problems in managing human behavior data and considering the quantitative expectations of data science economic impact. We introduce concepts such as open science and e-infrastructure as useful tools for supporting ethical data science and training new generations of data scientists. Finally, this work outlines SoBigData Research Infrastructure as an easy-to-access platform for executing complex data science processes. The services proposed by SoBigData are aimed at using data science to understand the complexity of our contemporary, globally interconnected society.Source: International Journal of Data Science and Analytics (Print) 11 (2021): 263–278. doi:10.1007/s41060-020-00240-2
DOI: 10.1007/s41060-020-00240-2

See at: link.springer.com Open Access | ISTI Repository Open Access | CNR ExploRA Open Access | International Journal of Data Science and Analytics Restricted | International Journal of Data Science and Analytics Restricted


2020 Report Open Access OPEN

SoBigData_training activities-planning material and reports 2
Wright J., Braghieri M., Rapisarda B.
Deliverable D4.3 provides an overview of training activities that have taken place from M36 to M52. It also includes a full overview of the training materials integration within the SoBigData Research Infrastructure. This final deliverable aims to provide a more detailed look into the various activities that have taken place since September 2018 and in particular to provide comprehensive reports on the events that have addressed gender and diversity issues. The whole project has sought to encourage more females into Computer Science and has therefore tailored some events to appeal to females in particular. SoBigData has also provided travel grants to assist female scientists in attending international events. With regards to the training materials, we have sought to make these a more effective resource by concentrating on four areas of improvement: Content Harmonization, Muti-Channel Content, Interactivity and Audience Targeting. These enhancements to the SoBigData Research Infrastructure will be covered in detail in Section 4. Discussions are also taking place with regards to the possibility of webinars, tutorials and video lectures to provide more options for the user as well as developing online assessment and exercises. The possibility of more targeted training paths is also being explored to provide a more tailored content for the user - whether they be a student, professional or the general public; the objective being to promote the dissemination of SoBigData training materials as widely as possible.Source: Project report, SoBigData, Deliverable D4.3, 2020
Project(s): SoBigData via OpenAIRE

See at: CNR ExploRA Open Access


2019 Report Open Access OPEN

SoBigData - Sustainability Plan
Grossi V., Rapisarda B.
This report provides a detailed sustainability plan for the pan-European SoBigData Research Infrastructure. The purpose of this document is to describe sustainability activities, actions and events that have been undertaken in order to provide some guidelines about SoBigData's governance and long-term sustainability. Our description involves a detailed overview of the main steps in order to evaluate the impact of sustainability-related actions which have been undertaken in the SoBigData project.Source: Project report, SoBigData, Deliverable D3.7, 2019
Project(s): SoBigData via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Open Access


2018 Conference article Open Access OPEN

SoBigData: social mining big data ecosystem
Giannotti F., Trasarti R., Bontcheva K., Grossi V.
One of the most pressing and fascinating challenges scientists face today, is understanding the complexity of our globally interconnected society. The big data arising from the digital breadcrumbs of human activities has the potential of providing a powerful social microscope, which can help us understand many complex and hidden socio-economic phenomena. Such challenge requires high-level analytics, modeling and reasoning across all the social dimensions above. There is a need to harness these opportunities for scientific advancement and for the social good, compared to the currently prevalent exploitation of big data for commercial purposes or, worse, social control and surveillance. The main obstacle to this accomplishment, besides the scarcity of data scientists, is the lack of a large-scale open ecosystem where big data and social mining research can be carried out. The SoBigData Research Infrastructure (RI) provides an integrated ecosystem for ethic-sensitive scientific discoveries and advanced applications of social data mining on the various dimensions of social life as recorded by "big data". The research community uses the SoBigData facilities as a "secure digital wind-tunnel" for large-scale social data analysis and simulation experiments. SoBigData promotes repeatable and open science and supports data science research projects by providing: (i) an ever-growing, distributed data ecosystem for procurement, access and curation and management of big social data, to underpin social data mining research within an ethic-sensitive context; (ii) an ever-growing, distributed platform of interoperable, social data mining methods and associated skills: tools, methodologies and services for mining, analysing, and visualising complex and massive datasets, harnessing the techno-legal barriers to the ethically safe deployment of big data for social mining; (iii) an ecosystem where protection of personal information and the respect for fundamental human rights can coexist with a safe use of the same information for scientific purposes of broad and central societal interest. SoBigData has a dedicated ethical and legal board, which is implementing a legal and ethical framework.Source: 27th International World Wide Web, WWW 2018, pp. 437–438, Lyon, France, 23-27/04/2018
DOI: 10.1145/3184558.3186205
Project(s): SoBigData via OpenAIRE

See at: dl.acm.org Open Access | ISTI Repository Open Access | CNR ExploRA Open Access | academic.microsoft.com Restricted | dblp.uni-trier.de Restricted | dl.acm.org Restricted | dl.acm.org Restricted


2017 Journal article Open Access OPEN

HyWare: a HYbrid Workflow lAnguage for Research E-infrastructures
Candela L., Giannotti F., Grossi V., Manghi P., Trasarti R.
Research e-infrastructures are "systems of systems", patchworks of tools, services and data sources, evolving over time to address the needs of the scientific process. Accordingly, in such environments, researchers implement their scientific processes by means of workflows made of a variety of actions, including for example usage of web services, download and execution of shared software libraries or tools, or local and manual manipulation of data. Although scientists may benefit from sharing their scientific process, the heterogeneity underpinning e-infrastructures hinders their ability to represent, share and eventually reproduce such workflows. This work presents HyWare, a language for representing scientific process in highly-heterogeneous e-infrastructures in terms of so-called hybrid workflows. HyWare lays in between "business process modeling languages", which offer a formal and high-level description of a reasoning, protocol, or procedure, and "workflow execution languages", which enable the fully automated execution of a sequence of computational steps via dedicated engines.Source: D-Lib magazine 23 (2017): 8–11. doi:10.1045/january2017-candela
DOI: 10.1045/january2017-candela
Project(s): SoBigData via OpenAIRE

See at: D-Lib Magazine Open Access | D-Lib Magazine Open Access | D-Lib Magazine Open Access | D-Lib Magazine Open Access | ISTI Repository Open Access | CNR ExploRA Open Access | OpenAIRE Open Access | D-Lib Magazine Open Access


2017 Report Closed Access

SoBigData - Periodic dissemination and impact report and plan for following year 1
Grossi V., Rapisarda B., Falchi C.
This first periodic report includes the description of dissemination and impact activities undertaken and scientific papers published, as well as the planning for the second period. Furthermore, It describes the efforts to be made to reach as wide an audience as possible, and the multiple strategies employed by the consortium. This deliverable covers a time period starting from December 2015 (M4) to February 2017 (M18).Source: Project report, SoBigData, Deliverable D3.4, 2017
Project(s): SoBigData via OpenAIRE

See at: CNR ExploRA Restricted


2017 Report Closed Access

SoBigData - VA e-Infrastructure service provision and operation report 1
Trasarti R., Pagano P., Falchi C., Grossi V., Rapisarda B.
The deliverable present the status of the SoBigData platform as an evolving e-infrastructure where the partners are continuosly adding new contents and improving the presentation of them. The virtual research enviroments (VREs) already integrated will be described and monitored with a set of KPIs describing the number of access, the experiments done and the social network activities related to them. Moreover a description of the VREs which are not yet public but are under an internal review phase will be described in order to understand how the consortium is prooceding in integrating resources to the e-infrastructure. An important note is the fact that this deliverable does not contain the assessment from the Advisory Board as described in the DOW, this due the fact that the board is not yet formed. Anyway the SoBigData Platform is begin used by the partner for inserting resources only in the last 6 months and therefore it is in stage where the content vary greatly (in the last month the resources in the catalogue doubled).Source: Project report, SoBigData, Deliverable D7.1, 2017
Project(s): SoBigData via OpenAIRE

See at: CNR ExploRA Restricted


2016 Report Open Access OPEN

SoBigData - Data processing workflow specification language
Candela L., Giannotti F., Grossi V., Manghi P., Trasarti R.
This document contains a general overview of the workflow language definition status.Source: Project report, SoBigData, Deliverable D10.11, 2016
Project(s): SoBigData via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA Open Access


2015 Report Restricted

SoBigData - Fact sheets aimed at different stakeholders
Grossi V., Rapisarda B., Romano V.
This document reports a first stakeholder analysis conducted by the SoBigData consortium among its partners. During the first three months of the project, a set of shared documents has been defined and filled in order to provide an initial stakeholder analysis. On the one hand, the aim was to discover and take a census about the stakeholders already involved in the consortium. On the other hand, the goal was to identify a set of potential stakeholders starting from the activities provided by the partners. The aim is to understand who are the stakeholders already involved and how their needs are addressed and to identify a set of potential stakeholders starting from the activities provided by the partners. This analysis is required for writing a set of initial fact sheets. Currently, for this first deliverable, three fact sheet has been created: the first one is designed for the economy/business application field, the second one is conceived for computer science while the last one is targeted to social science application field. These fact sheets are published on the project web-site for downloading by users and others. Moreover, the fact sheets serve as hand-outs at events such as workshops and conferences. The fact sheets will be produced and updated continuously throughout the project.Source: Project report, SoBigData, Deliverable D3.2, 2015

See at: CNR ExploRA Restricted


2015 Report Restricted

SoBigData - Project web site, project presentation, and social media presence
Grossi V., Rapisarda B., Romano V.
This report provides a full description of the main dissemination channels of the SoBigData project. On the one hand, we have classical media, such as web site while on the other hand we have a massive presence on social media, such as Twitter and Facebook. In this perspective, we can state that the project includes a wide range of dissemination channels from a website to a strong social media presence, including media and specialized journal. The project also has a wikitype environment for internal communication among the project partners. Furthermore, a project portal and management platform have been built to assist project planning and monitoring, and to allow discussion and exchange of draft documents between partners. The aim of this deliverable is to provide an overview of all dissemination channels. This document also describe the project logo and the project presentation. The project web site, presentation, and social media presence will be updated continuously throughout the project lifespan.Source: Project report, SoBigData, Deliverable D3.1, 2015

See at: CNR ExploRA Restricted


2015 Report Restricted

SoBigData - Data management report
Grossi V., Romano V., Trasarti R.
This deliverable describes a web content that provides an ongoing and up to date wiki containing the description of the datasets available in the consortium. The description includes statistics, metadata, sharing policies and archiving technologies as well as the preservation provisions and lifespan. For doing that a set of relevant metadata has been defined in order to provide an homogeneous view of the datasets. The defined set of metadata will be useful also for making the datasets available into the RI. In this perspective, this deliverable represents a first definition of the metadata for describing a dataset that will be available into the RI. Furthermore, this document presents the web form to insert a description of a new dataset, and the wiki page containing the list of the datasets available among the partners,. The proposed wiki page shows a set of relevant information, such as the name of dataset, the accessibility policy, the reference partner. Finally, this document provides a first census of the datasets available in the consortium.Source: Project report, SoBigData, Deliverable D8.1, 2015

See at: CNR ExploRA Restricted


2008 Conference article Restricted

Ontological Support For Association Rule Mining
Bellandi A., Furletti B., Grossi V., Romei A.
This paper describes some improvements of our previous work that realizes an integrated framework for extracting constraint ­based multi­level association rules with an ontology support. The ontology is not the repository of the data, but it models the application domain describing the meta­data. Furthermore, it permits to focus the analysis only on a subset of data and to express multi­ level constraints on them. In this context, we report some theoretical notion already introduced and a detailed description of the recent improvements: the introduction of the object properties in the framework, and the implementation of an user interface.Source: 26th IASTED International Conference on Artificial Intelligence and Applications, 2008, pp. 110–115, Innsbruck, Austria, 11-13 February 2008

See at: dl.acm.org Restricted | CNR ExploRA Restricted


2007 Conference article Restricted

Ontology-Driven Association Rule Extraction: A Case Study
Bellandi A., Furletti B., Grossi V., Romei A.
This paper proposes an integrated framework for extracting Constraint-based Multi-level Association Rules with an ontology support. The system permits the definition of a set of domain-specific constraints on a specific domain ontology, and to query the ontology for filtering the instances used in the association rule mining process. This method can improve the quality of the extracted associations rules in terms of relevance and understandability.Source: C&O:RR-2007 - International Workshop on Contexts and Ontologies: Representation and Reasoning, pp. 10–19, Roskilde University, Denmark, 21 August 2007
Project(s): MUSING

See at: CNR ExploRA Restricted