2002
Conference article  Open Access

Clustering transactional data

Giannotti F, Gozzi C, Manco G

K-Means Algoritm  Transactional data clustering  Transactional clustering  K-Means algoritm  Clustering 

In this paper we present a partitioning method capable to manage transactions, namelyt uples of variable size of categorical data. We adapt the standard definition of mathematical distance used in the KMeans algorithm to represent dissimilarityam ong transactions, and redefine the notion of cluster centroid. The cluster centroid is used as the representative of the common properties of cluster elements. We show that using our concept of cluster centroid together with Jaccard distance we obtain results that are comparable in qualityw ith the most used transactional clustering approaches, but substantiallyi mprove their efficiency.

Publisher: Springer-Verlag


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:91432,
	title = {Clustering transactional data},
	author = {Giannotti F and Gozzi C and Manco G},
	publisher = {Springer-Verlag},
	doi = {10.1007/3-540-45681-3_15},
	year = {2002}
}