2024
Conference article  Open Access

Information dissimilarity measures in decentralized knowledge distillation: a comparative analysis

Molo M. B., Vadicamo L., Carlini E., Gennaro C., Connor R.

Distributed intelligence  Divergence function  Knowledge distillation  Information dissimilarity measure 

Knowledge distillation (KD) is a key technique for transferring knowledge from a large, complex “teacher” model to a smaller, more efficient “student” model. Although initially developed for model compression, it has found applications across various domains due to the benefits of its knowledge transfer mechanism. While Cross Entropy (CE) and Kullback-Leibler (KL) are commonly used in KD, this work investigates the applicability of loss functions based on underexplored information dissimilarity measures, such as Triangular Divergence (TD), Structural Entropic Distance (SED), and Jensen-Shannon Divergence (JS), for both independent and identically distributed (iid) and non-iid data distributions. The primary contributions of this study include an empirical evaluation of these dissimilarity measures within a decentralized learning context, i.e., where independent clients collaborate without a central server coordinating the learning process. Additionally, the paper assesses the performance of clients by comparing pairwise distillation averaging among clients to conventional peer-to-peer pairwise distillation. Results indicate that while dissimilarity measures perform comparably in iid settings, non-iid distributions favor SED and JS, which also demonstrated consistent performance across clients.

Source: LECTURE NOTES IN COMPUTER SCIENCE, vol. 15268, pp. 140-154. Providence, USA, 4-6/11/2024


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:iris.cnr.it:20.500.14243/509644,
	title = {Information dissimilarity measures in decentralized knowledge distillation: a comparative analysis},
	author = {Molo M.  B. and Vadicamo L. and Carlini E. and Gennaro C. and Connor R.},
	doi = {10.1007/978-3-031-75823-2_12},
	booktitle = {LECTURE NOTES IN COMPUTER SCIENCE, vol. 15268, pp. 140-154. Providence, USA, 4-6/11/2024},
	year = {2024}
}

National Centre for HPC, Big Data and Quantum Computing
National Centre for HPC, Big Data and Quantum Computing

SUN
Social and hUman ceNtered XR


OpenAIRE