Page 1 of 2

2020 Journal article Open Access

An ethico-legal framework for social data science
Forgó N, Hänold S, Van Den Hoven J, Krügel T, Lishchuk I, Mahieu R, Monreale A, Pedreschi D, Pratesi F, Van Putten D
This paper presents a framework for research infrastructures enabling ethically sensitive and legally compliant data science in Europe. Our goal is to describe how to design and implement an open platform for big data social science, including, in particular, personal data. To this end, we discuss a number of infrastructural, organizational and methodological principles to be developed for a concrete implementation. These include not only systematically tools and methodologies that effectively enable both the empirical evaluation of the privacy risk and data transformations by using privacy-preserving approaches, but also the development of training materials (a massive open online course) and organizational instruments based on legal and ethical principles. This paper provides, by way of example, the implementation that was adopted within the context of the SoBigData Research Infrastructure.Source: INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, vol. 11, pp. 377-390
DOI: 10.1007/s41060-020-00211-7
Project(s): SoBigData via OpenAIRE

, SoBigData-PlusPlus via OpenAIRE

Metrics:

See at: Vrije Universiteit Brussel Research Portal Open Access | CNR IRIS | ISTI Repository | NARCIS | CNR IRIS Restricted

2022 Book Open Access

IAIL 2022 - Imagining the AI Landscape after the AI Act
Dushi D, Naretto F, Panigutti C, Pratesi F
We summarize the first Workshop on Imagining the AI Landscape after the AI Act (IAIL 2022), co-located with 1st International Conference on Hybrid Human-Artificial Intelligence (HHAI 2022), held on June 13, 2022 in Amsterdam, Netherlands.Source: CEUR WORKSHOP PROCEEDINGS, vol. 3221
Project(s): CoHuBiCoL via OpenAIRE

, TAILOR

, HumanE-AI-Net via OpenAIRE

, SoBigData-PlusPlus via OpenAIRE

See at: ceur-ws.org Open Access | CNR IRIS | ISTI Repository | CNR IRIS Restricted

2023 Book Open Access

Imagining the AI landscape after the AI act (Preface)
Dushi D, Naretto F, Pratesi F
We provide a summary of the second Workshop on Imagining the AI Landscape after the AI Act (IAIL 2023), co-located with the 2nd International Conference on Hybrid Human-Artificial Intelligence (HHAI 2023), held on June 27, 2023 in Munich, Germany.Source: CEUR WORKSHOP PROCEEDINGS
Project(s): CoHuBiCoL via OpenAIRE

, TAILOR

, HumanE-AI-Net via OpenAIRE

, SoBigData-PlusPlus via OpenAIRE

See at: ceur-ws.org Open Access | CNR IRIS | ISTI Repository | CNR IRIS Restricted

2024 Book Closed Access

Introduction to Special Issue on Trustworthy Artificial Intelligence
Calegari R., Giannotti F., Pratesi F., Milano M.
This special issue was conceived with the purpose of soliciting surveys addressing at least one dimension of TAI, providing a comprehensive and reasoned overview of the current state of the art. Emphasis was placed on the review and comparison of methodologies addressing specific trustworthiness dimensions or exploring the intricate interplay and tensions between different dimensions.Source: ACM COMPUTING SURVEYS, vol. 56 (issue 7), pp. 1-3
DOI: 10.1145/3649452
Metrics:

See at: dl.acm.org Restricted | Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna | Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna | CNR IRIS

2006 Other Restricted

Analisi e sviluppo di servizi Web utili al controllo della correttezza delle invocazioni
Pratesi F, Polini A
In the thesis is shown the development of a set of cooperating services deployed on the Axis platform.

See at: CNR IRIS Restricted | CNR IRIS

2013 Conference article Restricted

Privacy-aware distributed mobility data analytics
Pratesi F, Monreale A, Wang H, Rinzivillo S, Pedreschi D, Andrienko G, Andrienko N
We propose an approach to preserve privacy in an analytical process- ing within a distributed setting, and tackle the problem of obtaining aggregated information about vehicle traffic in a city from movement data collected by in- dividual vehicles and shipped to a central server. Movement data are sensitive because they may describe typical movement behaviors and therefore be used for re-identification of individuals in a database. We provide a privacy-preserving framework for movement data aggregation based on trajectory generalization in a distributed environment. The proposed solution, based on the differential pri- vacy model and on sketching techniques for efficient data compression, provides a formal data protection safeguard. Using real-life data, we demonstrate the ef- fectiveness of our approach also in terms of data utility preserved by the data transformation.Project(s): LIFT via OpenAIRE

See at: CNR IRIS Restricted | CNR IRIS

2013 Contribution to book Restricted

Privacy-preserving Distributed Movement Data Aggregation
Monreale A, Wang Wh, Pratesi F, Rinzivillo S, Pedreschi D, Andrienko G, Andrienko N
We propose a novel approach to privacy-preserving analytical processing within a distributed setting, and tackle the problem of obtaining aggregated information about vehicle traffic in a city from movement data collected by individual vehicles and shipped to a central server. Movement data are sensitive because people's whereabouts have the potential to reveal intimate personal traits, such as religious or sexual preferences, and may allow re-identification of individuals in a database. We provide a privacy-preserving framework for movement data aggregation based on trajectory generalization in a distributed environment. The proposed solution, based on the differential privacy model and on sketching techniques for efficient data compression, provides a formal data protection safeguard. Using real-life data, we demonstrate the effectiveness of our approach also in terms of data utility preserved by the data transformation.Source: LECTURE NOTES IN GEOINFORMATION AND CARTOGRAPHY, pp. 225-245
DOI: 10.1007/978-3-319-00615-4_13
Project(s): DATA SIM via OpenAIRE

Metrics:

See at: doi.org Restricted | CNR IRIS | CNR IRIS | link.springer.com

2013 Other Restricted

Differential privacy in distributed mobility analytics
Monreale A, Wang Wh, Pratesi F, Rinzivillo S, Pedreschi D, Andrienko G, Andrienko N
Movement data are sensitive, because people's whereabouts may allow re- identification of individuals in a de-identified database and thus can potentially reveal intimate personal traits, such as religious or sexual preferences. In this paper, we focus on a distributed setting in which movement data from individual vehicles are collected and aggregated by a centralized station. We propose a novel approach to privacy-preserving analytical processing within such a distributed setting, and tackle the problem of obtaining aggregated traffic information while preventing privacy leakage from data collection and aggregation. We study and analyze three different solutions based on the differential privacy model and on sketching techniques for efficient data compression. Each solution achieves different trade-off between privacy protection and utility of the transformed data. Using real-life data, we demonstrate the effectiveness of our approaches in terms of data utility preserved by the data transformation, thus bringing empirical evidence to the fact that the "privacy-by-design" paradigm in big data analytics has the potential of delivering high data protection combined with high quality even in massively distributed techno-social systems.

See at: CNR IRIS Restricted | CNR IRIS

2015 Conference article Restricted

Managing travels with PETRA: The Rome use case
Botea A, Braghin S, Lopes N, Guidotti R, Pratesi F
The aim of the PETRA project is to provide the basis for a city-wide transportation system that supports policies catering for both individual preferences of users and city-wide travel patterns. The PETRA platform will be initially deployed in the partner city of Rome, and later in Venice, and Tel-Aviv.Source: PROCEEDINGS - INTERNATIONAL CONFERENCE ON DATA ENGINEERING, pp. 110-111. Seoul, Korea, 13-17/04/2015
DOI: 10.1109/icdew.2015.7129558
Project(s): PETRA via OpenAIRE

Metrics:

See at: doi.org Restricted | CNR IRIS | ieeexplore.ieee.org | CNR IRIS

2015 Conference article Restricted

Mobility Mining for Journey Planning in Rome
Berlingerio M, Bicer V, Botea A, Braghin S, Lopes N, Guidotti R, Pratesi F
We present recent results on integrating private car GPS routines obtained by a Data Mining module. into the PETRA (PErsonal TRansport Advisor) platform. The routines are used as additional "bus lines", available to provide a ride to travelers. We present the effects of querying the planner with and without the routines, which show how Data Mining may help Smarter Cities applications.DOI: 10.1007/978-3-319-23461-8_18
Project(s): PETRA via OpenAIRE

Metrics:

See at: doi.org Restricted | CNR IRIS | CNR IRIS | link.springer.com

2017 Contribution to book Restricted

Assessing privacy risk in retail data
Pellungrini R, Pratesi F, Pappalardo L
Retail data are one of the most requested commodities by commercial companies. Unfortunately, from this data it is possible to retrieve highly sensitive information about individuals. Thus, there exists the need for accurate individual privacy risk evaluation. In this paper, we propose a methodology for assessing privacy risk in retail data. We define the data formats for representing retail data, the privacy framework for calculating privacy risk and some possible privacy attacks for this kind of data. We perform experiments in a real-world retail dataset, and show the distribution of privacy risk for the various attacks.DOI: 10.1007/978-3-319-71970-2_3
Project(s): SoBigData via OpenAIRE

Metrics:

See at: Lecture Notes in Computer Science Restricted | CNR IRIS | CNR IRIS | link.springer.com

2018 Contribution to book Open Access

Analyzing privacy risk in human mobility data
Pellungrini R, Pappalardo L, Pratesi F, Monreale A
Mobility data are of fundamental importance for understanding the patterns of human movements, developing analytical services and modeling human dynamics. Unfortunately, mobility data also contain individual sensitive information, making it necessary an accurate privacy risk assessment for the individuals involved. In this paper, we propose a methodology for assessing privacy risk in human mobility data. Given a set of individual and collective mobility features, we define the minimum data format necessary for the computation of each feature and we define a set of possible attacks on these data formats. We perform experiments computing the empirical risk in a real-world mobility dataset, and show how the distributions of the considered mobility features are affected by the removal of individuals with different levels of privacy risk.DOI: 10.1007/978-3-030-04771-9_10
Project(s): SoBigData via OpenAIRE

Metrics:

See at: Archivio della Ricerca - Università di Pisa Open Access | Lecture Notes in Computer Science Restricted | CNR IRIS | CNR IRIS | link.springer.com

2018 Conference article Open Access

Privacy Preserving Multidimensional Profiling
Pratesi F, Monreale A, Giannotti F, Pedreschi D
Recently, big data had become central in the analysis of human behavior and the development of innovative services. In particular, a new class of services is emerging, taking advantage of different sources of data, in order to consider the multiple aspects of human beings. Unfortunately, these data can lead to re-identification problems and other privacy leaks, as diffusely reported in both scientific literature and media. The risk is even more pressing if multiple sources of data are linked together since a potential adversary could know information related to each dataset. For this reason, it is necessary to evaluate accurately and mitigate the individual privacy risk before releasing personal data. In this paper, we propose a methodology for the first task, i.e., assessing privacy risk, in a multidimensional scenario, defining some possible privacy attacks and simulating them using real-world datasets.Source: LECTURE NOTES OF THE INSTITUTE FOR COMPUTER SCIENCES, SOCIAL INFORMATICS AND TELECOMMUNICATIONS ENGINEERING, pp. 142-152. Pisa, Italy, 29-30/11/2017
DOI: 10.1007/978-3-319-76111-4_15
Project(s): SoBigData via OpenAIRE

Metrics:

See at: CNR IRIS Open Access | link.springer.com | ISTI Repository | Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering Restricted | CNR IRIS | Archivio istituzionale della Ricerca - Scuola Normale Superiore

2022 Other Open Access

Mobility data mining: from technical to ethical (Dagstuhl Seminar 22022)
Berendt B, Matwin S, Renso C, Meissner F, Pratesi F, Raffaeta A, Rockwell G
This report documents the program and the outcomes of Dagstuhl Seminar 22022 "Mobility Data Analysis: from Technical to Ethical" that took place fully remote and hosted by Schloss Dagstuhl from 10-12 January 2022. An interdisciplinary team of 23 researchers from Europe, the Americas and Asia in the fields of computer science, ethics and mobility analysis discussed interactions between their topics and fields to bridge the gap between the more technical aspects to the ethics with the objective of laying the foundations of a new Mobility Data Ethics research field.DOI: 10.4230/dagrep.12.1.35
Project(s): MASTER via OpenAIRE

Metrics:

See at: CNR IRIS Open Access | ISTI Repository | doi.org Restricted | CNR IRIS

2022 Journal article Open Access

Where do migrants and natives belong in a community: a Twitter case study and privacy risk analysis
Kim J, Pratesi F, Rossetti G, Sîrbu A, Giannotti F
Today, many users are actively using Twitter to express their opinions and to share information. Thanks to the availability of the data, researchers have studied behaviours and social networks of these users. International migration studies have also benefited from this social media platform to improve migration statistics. Although diverse types of social networks have been studied so far on Twitter, social networks of migrants and natives have not been studied before. This paper aims to fill this gap by studying characteristics and behaviours of migrants and natives on Twitter. To do so, we perform a general assessment of features including profiles and tweets, and an extensive network analysis on the network. We find that migrants have more followers than friends. They have also tweeted more despite that both of the groups have similar account ages. More interestingly, the assortativity scores showed that users tend to connect based on nationality more than country of residence, and this is more the case for migrants than natives. Furthermore, both natives and migrants tend to connect mostly with natives. The homophilic behaviours of users are also well reflected in the communities that we detected. Our additional privacy risk analysis showed that Twitter data can be safely used without exposing sensitive information of the users, and minimise risk of re-identification, while respecting GDPR.Source: SOCIAL NETWORK ANALYSIS AND MINING, vol. 13 (issue 15)
DOI: 10.1007/s13278-022-01017-0
Project(s): SoBigData-PlusPlus via OpenAIRE

Metrics:

See at: CNR IRIS Open Access | link.springer.com | ISTI Repository | CNR IRIS Restricted

2022 Contribution to book Open Access

Ethics in smart information systems
Pratesi F, Trasarti R, Giannotti F
This chapter analyses some of the ethical implications of recent developments in artificial intelligence (AI), data mining, machine learning and robotics. In particular, we start summarising the more consolidated issues and solutions related to privacy in data management systems, moving towards the novel concept of explainability. The chapter reviews the development of the right to privacy and the right to explanation, culminated in the General Data Protection Regulation. However, the new kinds of big data (such as internet logs or GPS tracking) require a different approach to managing privacy requirements. Several solutions have been developed and will be reviewed here. Our view is that generally data protection must be considered from the beginning as novel AI solutions are developing using the Privacy-by-Design paradigm. This involves a shift in perspective away from remedying problems to trying to prevent them, instead. We conclude by covering the main requirements necessary to achieve a trustworthy scenario, as advised also by the European Commission. A step in the direction towards Trustworthy AI was achieved in the Ethics Guidelines for Trustworthy Artificial Intelligence produced by an expert group for the European Commission. The key elements in these guidelines will reviewed in this chapter. To ensure European independence and leadership, we must invest wisely by bundling, connecting and opening our AI resources while also having in mind ethical priorities, such as transparency and fairness.DOI: 10.51952/9781447363972.ch009
DOI: 10.56687/9781447363972-012
DOI: 10.2307/j.ctv2tbwqd5.14
Project(s): TAILOR via OpenAIRE

, PRO-RES

, SoBigData-PlusPlus via OpenAIRE

Metrics:

2024 Conference article Open Access

Operationalizing the fundamental rights impact assessment for AI systems: the FRIA project
Savella R., Pratesi F., Trasarti R., Gatt L., Gaeta M. C., Caggiano I. A., Aulino L., Troisi E., Izzo L.
This paper presents the FRIA Project, a multidisciplinary research study which connects the legal and ethical aspects related to the impact on fundamental rights of Artificial Intelligence systems and the technical issues that arise in the creation of an automated tool for the Fundamental Rights Impact Assessment, which is the ultimate objective of this work.Project(s): SoBigData-PlusPlus via OpenAIRE

See at: CNR IRIS Open Access | ital-ia2024.it | CNR IRIS Restricted

2013 Other Restricted

Privacy-by-design in big data analytics and social mining
Monreale A, Rinzivillo S, Pratesi F, Giannotti F, Pedreschi D
Privacy is ever-growing concern in our society: the lack of reliable privacy safeguards in many current services and devices is the basis of a diffusion that is often more limited than expected. Moreover, people feel reluctant to provide true personal data, unless it is absolutely necessary. Thus, privacy is becoming a fundamental aspect to take into account when one wants to use, publish and analyze data involving sensitive information. Unfortunately, it is increasingly hard to transform the data in a way that it protects sensitive information: we live in the era of big data characterized by unprecedented opportunities to sense, store and analyze social data describing human activities in great detail and resolution. As a result privacy preservation simply cannot be accomplished by de-identification. In this paper, we propose the privacy-by-design paradigm to develop technological frameworks for countering the threats of undesirable, unlawful effects of privacy violation, without obstructing the knowledge discovery opportunities of social mining and big data analytical technologies. Our main idea is to inscribe privacy protection into the knowledge discovery technology by design, so that the analysis incorporates the relevant privacy requirements from the start.

See at: CNR IRIS Restricted | CNR IRIS

2016 Other Restricted

PRISQUIT: a system for assessing privacy risk versus quality in data sharing
Pratesi F, Monreale A, Trasarti R, Giannotti F, Pedreschi D, Yanagihara T
Data describing human activities are an important source of knowledge useful for understanding individual and collective behavior and for developing a wide range of user services. Unfortunately, this kind of data is sensitive, because people's whereabouts may allow re-identification of individuals in a de-identified database. Therefore, Data Providers, before sharing those data, must apply any sort of anonymization to lower the privacy risks, but they must be aware and capable of controlling also the data quality, since these two factors are often a trade-off. In this paper we propose PRISQUIT (Privacy RISk versus QUalITy), a system enabling a privacy-aware ecosystem for sharing personal data. It is based on a methodology for assessing both the empirical (not theoretical) privacy risk associated to users represented in the data, and the data quality guaranteed only with users not at risk. Our proposal is able to support the Data Provider in the exploration of a repertoire of possible data transformations with the aim of selecting one specific transformation that yields an adequate trade-off between data quality and privacy risk. We study the practical effectiveness of our proposal over three data formats underlying many services, defined on real mobility data, i.e., presence data, trajectory data and road segment data.Project(s): SoBigData via OpenAIRE

See at: CNR IRIS Restricted | CNR IRIS

2017 Journal article Open Access

A data mining approach to assess privacy risk in human mobility data
Pellungrini R, Pappalardo L, Pratesi F, Monreale A
Human mobility data are an important proxy to understand human mobility dynamics, develop analytical services, and design mathematical models for simulation and what-if analysis. Unfortunately mobility data are very sensitive since they may enable the re-identification of individuals in a database. Existing frameworks for privacy risk assessment provide data providers with tools to control and mitigate privacy risks, but they suffer two main shortcomings: (i) they have a high computational complexity; (ii) the privacy risk must be recomputed every time new data records become available and for every selection of individuals, geographic areas, or time windows. In this article, we propose a fast and flexible approach to estimate privacy risk in human mobility data. The idea is to train classifiers to capture the relation between individual mobility patterns and the level of privacy risk of individuals. We show the effectiveness of our approach by an extensive experiment on real-world GPS data in two urban areas and investigate the relations between human mobility patterns and the privacy risk of individuals.Source: ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY (PRINT), vol. 9 (issue 3), pp. 31:1-31:27
DOI: 10.1145/3106774
Project(s): SoBigData via OpenAIRE

Metrics: