2022
Contribution to conference  Open Access

A topological pipeline for machine learning

Conti F.

Topological data analysis  Machine learning  Persistent homology  Pipeline 

The development of a topological pipeline for machine learning involves two crucial steps that strongly influence the performance of the pipeline. The first step is the choice of the filtration that associates a persistence diagram with digital data. The second step is the choice of the representation method for the persistence diagrams, which often relies on several parameters. In this work we develop a pipeline that associates persistence diagrams to digital data, via the most appropriate filtration for the type of data considered. Using a grid search approach, this pipeline determines optimal representation methods and parameters. We assess the performance of our pipeline, and in parallel we compare the different representation methods, on popular benchmark datasets. This work is a first step towards both an easy, ready to use, pipeline for data classification using persistent homology and machine learning, and to understand the theoretical reasons why, given a dataset and a task to be performed, a pair (filtration, topological representation) is better than another.

Source: Bridging applied and quantitative topology, Online conference, 09-13/05/2022



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:467058,
	title = {A topological pipeline for machine learning},
	author = {Conti F.},
	booktitle = {Bridging applied and quantitative topology, Online conference, 09-13/05/2022},
	year = {2022}
}
CNR ExploRA

Bibliographic record

ISTI Repository

Deposited version Open Access

Also available from

sites.google.comOpen Access