2021
Conference article  Open Access

A co-occurrence based approach for mining overlapped co-clusters in binary data

Santa Rosa Nassar Dos Santos Y., De Santiago R., Perego R., Schaly M. H., Alvares L. O., Renso C., Bogorny V.

Trajectories  Co-clustering data mining  Clustering 

Co-clustering is a specific type of clustering that addresses the problem of simultaneously clustering objects and attributes of a data matrix. Although general clustering techniques find non-overlapping co-clusters, finding possible overlaps between co-clusters can reveal embedded patterns in the data that the disjoint clusters cannot discover. The overlapping co-clustering approaches proposed in the literature focus on finding global overlapped co-clusters and they might overlook interesting local patterns that are not necessarily identified as global co-clusters. Discovering such local co-clusters increases the granularity of the analysis, and therefore more specific patterns can be captured. This is the objective of the present paper, which proposes the new Overlapped Co-Clustering (OCoClus) method for finding overlapped co-clusters on binary data, including both global and local patterns. This is a non-exhaustive method based on the co-occurrence of attributes and objects in the data. Another novelty of this method is that it is driven by an objective cost function that can automatically determine the number of co-clusters. We evaluate the proposed approach on publicly available datasets, both real and synthetic data, and compare the results with a number of baselines. Our approach shows better results than the baseline methods on synthetic data and demonstrates its efficacy in real data.

Source: BRACIS 2021 - 10th Brazilian Conference on Intelligent Systems, pp. 375–389, Online Conference, 29/11/2021 - 3/12/2021


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:465551,
	title = {A co-occurrence based approach for mining overlapped co-clusters in binary data},
	author = {Santa Rosa Nassar Dos Santos Y. and De Santiago R. and Perego R. and Schaly M. H. and Alvares L. O. and Renso C. and Bogorny V.},
	doi = {10.1007/978-3-030-91702-9_25},
	booktitle = {BRACIS 2021 - 10th Brazilian Conference on Intelligent Systems, pp. 375–389, Online Conference, 29/11/2021 - 3/12/2021},
	year = {2021}
}

MASTER
Multiple ASpects TrajEctoRy management and analysis


OpenAIRE