161 result(s)
Page Size: 10, 20, 50
Export: bibtex, xml, json, csv
Order by:

CNR Author operator: and / or
more
Typology operator: and / or
Language operator: and / or
Date operator: and / or
more
Rights operator: and / or
2005 Conference article Unknown
Speeding-up hierarchical agglomerative clustering in presence of expensive metrics
Nanni M.
In several contexts and domains, hierarchical agglomerative clustering (HAC) offers best-quality results, but at the price of a high complexity which reduces the size of datasets which can be handled. In some contexts, in particular, computing distances between objects is the most expensive task. In this paper we propose a pruning heuristics aimed at improving performances in these cases, which is well integrated in all the phases of the HAC process and can be applied to two HAC variants: single-linkage and complete-linkage. After describing the method, we provide some theoretical evidence of its pruning power, followed by an empirical study of its effectiveness over different data domains, with a special focus on dimensionality issues.Source: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 378–387, Hanoi, Vietnam, May 2005

See at: CNR ExploRA


2007 Contribution to book Open Access OPEN
Extracting trees of quantitative serial episodes
Nanni M., Rigotti C.
Among the family of the local patterns, episodes are commonly used when mining a single or multiple sequences of discrete events. An episode reflects a qualitative relation is-followed-by over event types, and the refinement of episodes to incorporate quantitative temporal information is still an on going research, with many application opportunities. In this paper, focusing on serial episodes, we design such a refinement called quantitative episodes and give a corresponding extraction algorithm. The three most salient features of these quantitative episodes are: (1) their ability to characterize main groups of homogeneous behaviors among the occurrences, according to the duration of the is-followed-by steps, and providing quantitative bounds of these durations organized in a tree structure; (2) the possibility to extract them in a complete way; and (3) to perform such extractions at the cost of a limited overhead with respect to the extraction of standard episodes.Source: Knowledge Discovery in Inductive Databases, edited by Sa?o D?eroski, Jan Struyf, pp. 170–188, 2007
DOI: 10.1007/978-3-540-75549-4_11
Metrics:


See at: liris.cnrs.fr Open Access | doi.org Restricted | Hyper Article en Ligne Restricted | link.springer.com Restricted | CNR ExploRA


2010 Journal article Open Access OPEN
Anonymization of moving objects databases by clustering and perturbation
Abul O., Bonchi F., Nanni M.
Preserving individual privacy when publishing data is a problem that is receiving increasing attention. Thanks to its simplicity the concept of k-anonymity, introduced by Samarati and Sweeney [1], established itself as one fundamental principle for privacy preserving data publishing. According to the k-anonymity principle, each release of data must be such that each individual is indistinguishable from at least k − 1 other individuals. In this article we tackle the problem of anonymization of moving objects databases. We propose a novel concept of k-anonymity based on co-localization, that exploits the inherent uncertainty of the moving object's whereabouts. Due to sampling and imprecision of the positioning systems (e.g., GPS) , the trajectory of a moving object is no longer a polyline in a three-dimensional space, instead it is a cylindrical volume, where its radius delta represents the possible location imprecision: we know that the trajectory of the moving object is within this cylinder, but we do not know exactly where. If another object moves within the same cylinder they are indistinguishable from each other. This leads to the definition of (k, delta)-anonymity for moving objects databases. We first characterize the (k, delta)-anonymity problem, then we recall NWA (Never Walk Alone), a method that we introduced in [2] based on clustering and spatial perturbation. Starting from a discussion on the limits of NWA we develop a novel clustering method that, being based on EDR distance [3], has the important feature of being time-tolerant. As a consequence it perturbs trajectories both in space and time. The novel method, named W4M(Wait for Me), is empirically shown to produce higher quality anonymization than NWA, at the price of higher computational requirements. Therefore, in order to make W4M scalable to large datasets, we introduce two variants based on a novel (and computationally cheaper) time-tolerant distance function, and on chunking. All the variants of W4M are empirically evaluated in terms of data quality and efficiency, and thoroughly compared to their predecessor NWA. Data quality is assessed both by means of objective measures of information distortion, and by more usability oriented measure, i.e., by comparing the results of (i) spatio-temporal range queries and (ii) frequent pattern mining, executed on the original database and on the (k,delta)-anonymized one. Experimental results over both real-world and synthetic mobility data confirm that, for a wide range of values of d and k, the relative distortion introduced by our anonymization methods is kept low. Moreover, the techniques introduced to make W4M scalable to large datasets, achieve their goal without giving up data quality in the anonymization process.Source: Information systems (Oxf.) 35 (2010): 884–910. doi:10.1016/j.is.2010.05.003
DOI: 10.1016/j.is.2010.05.003
Project(s): Veri Yayınlamada Hassas Bilgi Gizleme via OpenAIRE
Metrics:


See at: Aperta - TÜBİTAK Açık Arşivi Open Access | Information Systems Restricted | www.sciencedirect.com Restricted | CNR ExploRA


2010 Journal article Restricted
Dealing with interaction for complex systems modelling and prediction
Quattrociocchi W., Latorre D., Lodi E., Nanni M.
The increasing complexity of problems in the context of system modelling is leading to a new epistemological approach able to provide a representation which allows from one hand, to model complex phenomena with the support of mathematical and computational instruments, and on the other hand able to capture the global system description. In this paper is presented a methodology for complex dynamical systems modelling which is an extension of the supervised learning paradigm. The theoretical aspects of our methodology are introduced and then two different and heterogeneous case studies are presented.Source: International journal of artificial life research (Online) 1 (2010): 1–11. doi:10.4018/jalr.2010102101
DOI: 10.4018/jalr.2010102101
Metrics:


See at: International Journal of Artificial Life Research Restricted | www.igi-global.com Restricted | CNR ExploRA


2006 Contribution to conference Open Access OPEN
Quantitative episode trees
Nanni M., Rigotti C.
Among the family of the local patterns, episodes are com- monly used when mining a single or multiple sequences of discrete events. An episode re°ects a qualitative relation is-followed-by over event types, and the re¯nement of episodes to incorporate quantitative temporal in- formation is still an on going research, with many application opportu- nities. In this paper, focusing on serial episodes, we design such a re¯ne- ment called quantitative episodes and give a corresponding extraction algorithm. The three most salient features of these quantitative episodes are: (1) their ability to characterize main groups of homogeneous behav- iors among the occurrences, according to the duration of the is-followed- by steps, and providing quantitative bounds of these durations organized in a tree structure; (2) the possibility to extract them in a complete way; and (3) to perform such extractions at the cost of a limited overhead with respect to the extraction of standard episodes.Source: Workshop on Knowledge Discovery in Inductive Databases. KDID'06, Berlin, Germany, 18/09/2006

See at: ISTI Repository Open Access | CNR ExploRA


2006 Software Unknown
MiSTA v2.1
Mirco Nanni
Algoritmo di estrazione di pattern sequenziali con annotazioni temporali (tempi tipici di transizione) basato sull'integrazione stretta di metodi prefix-projection-based per pattern sequenziali e metodi di stima di densità basati su kernel.

See at: CNR ExploRA


2006 Software Unknown
TF-OPTICS: Time-focused density based clustering of trajectories
Margherita D'Auria, Mirco Nanni
Algoritmo di clustering density-based di traiettorie, con ricerca automatica dell'intervallo ottimale su cui focalizzare l'analisi.

See at: CNR ExploRA


2005 Report Open Access OPEN
Hierarchical agglomerative clustering in presence of expensive metrics (Extended Tech. Rep.)
Nanni M.
In several contexts and domains, hierarchical agglomerative clustering (HAC) offers best-quality results, but at the price of a high complexity which reduces the size of datasets which can be handled. In some contexts, in particular, computing distances between objects is the most expensive task. In all such situations the standard approach to HAC, which first computes all object-to-object distances and then performs the real clustering process, quickly yields high computational costs and large running times. One of the key means for containing such problem naturally lies in methods that can save a significant portion of distance computations, resulting in a smaller complexity. In this paper we propose a pruning heuristics well integrated in all the phases of the HAC process, developed for two HAC variants: single-linkage and complete-linkage. After describing the method, we provide some theoretical evidence of its pruning power, followed by an empirical study of its effectiveness over different data domains, with a special focus on dimensionality issues.Source: ISTI Technical reports, 2005

See at: ISTI Repository Open Access | CNR ExploRA


2006 Journal article Open Access OPEN
Time-focused clustering of trajectories of moving objects
Nanni M., Pedreschi D.
Spatio-temporal, geo-referenced datasets are growing rapidly, and will be more in the near future, due to both technological and social/commercial reasons. From the data mining viewpoint, spatio-temporal trajectory data introduce new dimensions and, correspondingly, novel issues in performing the analysis tasks. In this paper, we consider the clustering problem applied to the trajectory data domain. In particular, we propose an adaptation of a density-based clustering algorithm to trajectory data based on a simple notion of distance between trajectories. Then, a set of experiments on synthesized data is performed in order to test the algorithm and to compare it with other standard clustering approaches. Finally, a new approach to the trajectory clustering problem, called temporal focussing, is sketched, having the aim of exploiting the intrinsic semantics of the temporal dimension to improve the quality of trajectory clustering.Source: Journal of intelligent information systems 27 (2006): 267–289. doi:10.1007/s10844-006-9953-7
DOI: 10.1007/s10844-006-9953-7
Metrics:


See at: ISTI Repository Open Access | Journal of Intelligent Information Systems Restricted | www.springerlink.com Restricted | CNR ExploRA


2013 Contribution to book Unknown
Mobility data mining
Nanni M.
Source: Mobility Data - Modeling, Management, and Understanding, edited by Chiara Renso, Stefano Spaccapietra, Esteban Zimányi, pp. 105–124, 2013

See at: CNR ExploRA


2019 Journal article Open Access OPEN
Car telematics big data analytics for insurance and innovative mobility services
Longhi L., Nanni M.
Car telematics is a large and growing business sector aiming to collect mobility-related data (mainly private and commercial vehicles) and to develop services of various nature both for individual citizens and other companies. Such services and applications include information systems to support car insurances, info-mobility services, ad hoc studies for planning purposes, etc. In this work we report and discuss some of the key challenges that a car telematics pilot application is facing within the EU project "Track and Know". The paper introduces the overall context, the main business goals identified as potentially beneficial of big data solutions and the type of data sources that such applications can rely on (in particular, those available within the project for experimental studies), then discusses initial results of the solutions developed so far and ongoing lines of research. In particular, the discussion will focus on the most relevant applications identified for the project purposes, namely new services for car insurance, electric vehicles mobility and car- and ride-sharing.Source: Journal of ambient intelligence & humanized computing (Print) 11 (2019): 3989–3999. doi:10.1007/s12652-019-01632-4
DOI: 10.1007/s12652-019-01632-4
Project(s): Track and Know via OpenAIRE
Metrics:


See at: ISTI Repository Open Access | Journal of Ambient Intelligence and Humanized Computing Restricted | link.springer.com Restricted | CNR ExploRA


2016 Contribution to journal Open Access OPEN
Guest editors' introduction to the EcmlPkdd 2016 journal track special issue of Machine Learning
Gartner T., Nanni M., Passerini A., Robardet C.
Source: Data mining and knowledge discovery 30 (2016): 995–997. doi:10.1007/s10618-016-0476-8
DOI: 10.1007/s10618-016-0476-8
DOI: 10.1007/s10994-016-5587-3
Metrics:


See at: link.springer.com Open Access | Data Mining and Knowledge Discovery Open Access | Machine Learning Open Access | Data Mining and Knowledge Discovery Restricted | Machine Learning Restricted | CNR ExploRA


2016 Contribution to book Open Access OPEN
Partition-based clustering using constraint optimization
Grossi V., Guns T., Monreale A., Nanni M.
Partition-based clustering is the task of partitioning a dataset in a number of groups of examples, such that examples in each group are similar to each other. Many criteria for what constitutes a good clustering have been identified in the literature; furthermore, the use of additional constraints to find more useful clusterings has been proposed. In this chapter, it will be shown that most of these clustering tasks can be formalized using optimization criteria and constraints. We demonstrate how a range of clustering tasks can be modelled in generic constraint programming languages with these constraints and optimization criteria. Using the constraint-based modeling approach we also relate the DBSCAN method for density-based clustering to the label propagation technique for community discovery.Source: Data Mining and Constraint Programming. Foundations of a Cross-Disciplinary Approach, edited by Christian Bessiere, Luc De Raedt, Lars Kotthoff, Siegfried Nijssen, Barry O'Sullivan, Dino Pedreschi, pp. 282–299, 2016
DOI: 10.1007/978-3-319-50137-6_11
Metrics:


See at: lirias.kuleuven.be Open Access | Lirias Open Access | ISTI Repository Open Access | Vrije Universiteit Brussel Research Portal Restricted | doi.org Restricted | link.springer.com Restricted | researchportal.vub.be Restricted | CNR ExploRA


2016 Contribution to journal Open Access OPEN
Guest editors' introduction to the EcmlPkdd 2016 journal track special issue of Machine Learning
Gartner T., Nanni M., Passerini A., Robardet C.
Source: Machine learning 104 (2016): 149–150. doi:10.1007/s10994-016-5587-3
DOI: 10.1007/s10994-016-5587-3
DOI: 10.1007/s10618-016-0476-8
Metrics:


See at: link.springer.com Open Access | Data Mining and Knowledge Discovery Open Access | Machine Learning Open Access | ISTI Repository Open Access | Data Mining and Knowledge Discovery Restricted | Machine Learning Restricted | CNR ExploRA


2018 Conference article Closed Access
Advancements in mobility data analysis
Nanni M.
Some recent advancements in the area of Mobility Data Analysis are discussed, a field in which data mining and machine learning methods are applied to infer descriptive patterns and predictive models from digital traces of (human) movement.Source: 1st Italian Conference on Traffic Mining applied to Police Activities, TRAP 2017, pp. 11–16, Rome, Italy, 25-26/10/2017
DOI: 10.1007/978-3-319-75608-0_2
Metrics:


See at: doi.org Restricted | link.springer.com Restricted | CNR ExploRA


2020 Journal article Open Access OPEN
Ranking places in attributed temporal urban mobility networks
Nanni M., Tortosa L., Vicent J. F., Yeghikyan G.
Drawing on the recent advances in complex network theory, urban mobility flow patterns, typically encoded as origin-destination (OD) matrices, can be represented as weighted directed graphs, with nodes denoting city locations and weighted edges the number of trips between them. Such a graph can further be augmented by node attributes denoting the various socio-economic characteristics at a particular location in the city. In this paper, we study the spatio-temporal characteristics of "hotspots"of different types of socio-economic activities as characterized by recently developed attribute-augmented network centrality measures within the urban OD network. The workflow of the proposed paper comprises the construction of temporal OD networks using two custom data sets on urban mobility in Rome and London, the addition of socio-economic activity attributes to the OD network nodes, the computation of network centrality measures, the identification of "hotspots"and, finally, the visualization and analysis of measures of their spatio-temporal heterogeneity. Our results show structural similarities and distinctions between the spatial patterns of different types of human activity in the two cities. Our approach produces simple indicators thus opening up opportunities for practitioners to develop tools for real-time monitoring and visualization of interactions between mobility and economic activity in cities.Source: PloS one 15 (2020). doi:10.1371/journal.pone.0239319
DOI: 10.1371/journal.pone.0239319
Project(s): Track and Know via OpenAIRE, Track and Know via OpenAIRE
Metrics:


See at: PLoS ONE Open Access | PLoS ONE Open Access | PLoS ONE Open Access | Recolector de Ciencia Abierta, RECOLECTA Open Access | PLoS ONE Open Access | journals.plos.org Open Access | ISTI Repository Open Access | CNR ExploRA


2020 Conference article Open Access OPEN
Crash prediction and risk assessment with individual mobility networks
Guidotti R., Nanni M.
The massive and increasing availability of mobility data enables the study and the prediction of human mobility behavior and activities at various levels. In this paper, we address the problem of building a data-driven model for predicting car drivers' risk of experiencing a crash in the long-Term future, for instance, in the next four weeks. Since the raw mobility data, although potentially large, typically lacks any explicit semantics or clear structure to help understanding and predicting such rare and difficult-To-grasp events, our work proposes to build concise representations of individual mobility, that highlight mobility habits, driving behaviors and other factors deemed relevant for assessing the propensity to be involved in car accidents. The suggested approach is mainly based on a network representation of users' mobility, called Individual Mobility Networks, jointly with the analysis of descriptive features of the user's driving behavior related to driving style (e.g., accelerations) and characteristics of the mobility in the neighborhood visited by the user. The paper presents a large experimentation over a real dataset, showing comparative performances against baselines and competitors, and a study of some typical risk factors in the areas under analysis through the adoption of state-of-Art model explanation techniques. Preliminary results show the effectiveness and usability of the proposed predictive approach.Source: MDM 2020 - 21st IEEE International Conference on Mobile Data Management, pp. 89–98, Online conference, 30/06/2020 - 03/07/2020
DOI: 10.1109/mdm48529.2020.00030
Project(s): Track and Know via OpenAIRE, Track and Know via OpenAIRE
Metrics:


See at: ISTI Repository Open Access | doi.org Restricted | ieeexplore.ieee.org Restricted | CNR ExploRA


2020 Conference article Open Access OPEN
Discovering tourist attractions of cities using Flickr and OpenStreetMap Data
Vaziri F., Nanni M., Matwin S., Pedreschi D.
Tourism is a growing industry which needs accurate management and planning. Photography and tourism are inseparable; Photographs play the role of tourists' footprints during their visit to a touristic city. Nowadays, the large deployment of mobile devices and digital cameras has led to a massive increase in the volume of records of where people have been and when they were there. In this paper, we introduce a new method to automatically discover the touristic attractions of every single city with the use of two open-source platforms, Flickr and OpenStreetMap. We applied techniques to convert raw metadata of geotagged photos downloaded from Flickr to information about popular Points of Interest with the help of additional information retrieved from OpenStreetMap.Source: Advances in Tourism, Technology and Smart Systems, pp. 231–241, 5/12/2019
DOI: 10.1007/978-981-15-2024-2_21
Metrics:


See at: ISTI Repository Open Access | doi.org Restricted | link.springer.com Restricted | CNR ExploRA


2020 Conference article Open Access OPEN
Learning mobility flows from urban features with spatial interaction models and neural networks
Yeghikyan G., Opolka F. L., Nanni M., Lepri B., Lio P.
A fundamental problem of interest to policy makers, urban planners, and other stakeholders involved in urban development is assessing the impact of planning and construction activities on mobility flows. This is a challenging task due to the different spatial, temporal, social, and economic factors influencing urban mobility flows. These flows, along with the influencing factors, can be modelled as attributed graphs with both node and edge features characterising locations in a city and the various types of relationships between them. In this paper, we address the problem of assessing origin-destination (OD) car flows between a location of interest and every other location in a city, given their features and the structural characteristics of the graph. We propose three neural network architectures, including graph neural networks (GNN), and conduct a systematic comparison between the proposed methods and state-of-the-art spatial interaction models, their modifications, and machine learning approaches. The objective of the paper is to address the practical problem of estimating potential flow between an urban project location and other locations in the city, where the features of the project location are known in advance. We evaluate the performance of the models on a regression task using a custom data set of attributed car OD flows in London. We also visualise the model performance by showing the spatial distribution of flow residuals across London.Source: SMARTCOMP 2020 - IEEE International Conference on Smart Computing, pp. 57–64, Bologna, Italy, September 14-17, 2020
DOI: 10.1109/smartcomp50058.2020.00028
DOI: 10.48550/arxiv.2004.11924
Project(s): Track and Know via OpenAIRE, Track and Know via OpenAIRE
Metrics:


See at: arXiv.org e-Print Archive Open Access | ISTI Repository Open Access | doi.org Restricted | doi.org Restricted | ieeexplore.ieee.org Restricted | CNR ExploRA


2020 Master thesis Unknown
Simulating individual mobility network for electric vehicles
Shajari S.
Electric mobility appears to be one of the future ways to make cities more sustainable and improve the quality of life in urban environments. However, when it comes to private vehicles, users need to evaluate how their mobility lifestyle is going to change when their fuel-based vehicle is replaced by and electric one (EV). The objective of this work is to propose a process that, through a mix of mobility data analytics, ad hoc trip planning and simulation, is able to analyze the current fuel-based mobility of a user and quantitatively describe the impact of switching to EVs on her mobility life style. Exploiting a network- based representation of human mobility (Individual Mobility Networks), four simulation scenarios are considered, distinguished by the battery recharge options that the user might have in real life: recharging only at public stations, charging also at home, or also at work, or both. For each scenario we calculate how much battery the user has to charge in each charging option and how much time he has to wait for charging, as well as how much her original mobility (performed with a combustion engine) is affected by the limits of EVs, evaluating the expected increment in travel times and distances. This work is part of the activities of the H2020 European Project Track Know (https://trackandknowproject.eu/).Project(s): Track and Know via OpenAIRE

See at: CNR ExploRA