University of Leicester
Browse

Robust principal graphs for data approximation

Download (2.9 MB)
journal contribution
posted on 2018-05-25, 10:46 authored by Alexander N. Gorban, Evgeny M. Mirkes, Andrei Zinovyev
Revealing hidden geometry and topology in noisy data sets is a challenging task. Elastic principal graph is a computationally efficient and flexible data approximator based on embedding a graph into the data space and minimizing the energy functional penalizing the deviation of graph nodes both from data points and from pluri-harmonic configuration (generalization of linearity). The structure of principal graph is learned from data by application of a topological grammar which in the simplest case leads to the construction of principal curves or trees. In order to more efficiently cope with noise and outliers, here we suggest using a trimmed data approximation term to increase the robustness of the method. The modification of the method that we suggest does not affect either computational efficiency or general convergence properties of the original elastic graph method. The trimmed elastic energy functional remains a Lyapunov function for the optimization algorithm. On several examples of complex data distributions we demonstrate how the robust principal graphs learn the global data structure and show the advantage of using the trimmed data approximation term for the construction of principal graphs and other popular data approximators.

History

Citation

Archives of Data Science, Series A, 2017, 2(1) 16 S. online

Author affiliation

/Organisation/COLLEGE OF SCIENCE AND ENGINEERING/Department of Mathematics

Source

ECDA2015 (European Conference on Data Analysis), University of Essex, Colchester, UK

Version

  • VoR (Version of Record)

Published in

Archives of Data Science

Publisher

Institut für Informationswirtschaft und Marketing (IISM)

issn

2363-9881

Copyright date

2017

Available date

2018-05-25

Publisher version

https://publikationen.bibliothek.kit.edu/1000068111

Notes

The Parameters of the methods used are provided together with the code from the corresponding GitHub https://github.com/auranic/Elastic-principal-graphs/wiki/Robust-principalgraphs

Temporal coverage: start date

2015-09-02

Temporal coverage: end date

2015-09-04

Language

en

Usage metrics

    University of Leicester Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC