Graphs from features: tree-based graph layout for feature analysis

dc.contributor.authorMinghim, Rosane
dc.contributor.authorHuancapaza, Liz
dc.contributor.authorArtur, Erasmo
dc.contributor.authorTelles, Guilherme P.
dc.contributor.authorBelizario, Ivar V.
dc.contributor.funderConselho Nacional de Desenvolvimento Científico e Tecnológicoen
dc.contributor.funderCoordenação de Aperfeiçoamento de Pessoal de Nível Superioren
dc.contributor.funderConselho Nacional de Desenvolvimento Científico e Tecnológicoen
dc.date.accessioned2021-02-05T13:42:34Z
dc.date.available2021-02-05T13:42:34Z
dc.date.issued2020-11-18
dc.date.updated2021-02-05T13:31:34Z
dc.description.abstractFeature Analysis has become a very critical task in data analysis and visualization. Graph structures are very flexible in terms of representation and may encode important information on features but are challenging in regards to layout being adequate for analysis tasks. In this study, we propose and develop similarity-based graph layouts with the purpose of locating relevant patterns in sets of features, thus supporting feature analysis and selection. We apply a tree layout in the first step of the strategy, to accomplish node placement and overview based on feature similarity. By drawing the remainder of the graph edges on demand, further grouping and relationships among features are revealed. We evaluate those groups and relationships in terms of their effectiveness in exploring feature sets for data analysis. Correlation of features with a target categorical attribute and feature ranking are added to support the task. Multidimensional projections are employed to plot the dataset based on selected attributes to reveal the effectiveness of the feature set. Our results have shown that the tree-graph layout framework allows for a number of observations that are very important in user-centric feature selection, and not easy to observe by any other available tool. They provide a way of finding relevant and irrelevant features, spurious sets of noisy features, groups of similar features, and opposite features, all of which are essential tasks in different scenarios of data analysis. Case studies in application areas centered on documents, images and sound data demonstrate the ability of the framework to quickly reach a satisfactory compact representation from a larger feature set.en
dc.description.sponsorshipConselho Nacional de Desenvolvimento Científico e Tecnológico (grant number 307411/2016-8); Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) (Finance Code 001); Conselho Nacional de Desenvolvimento Científico e Tecnológico (grant number 310299/2018-7)en
dc.description.statusPeer revieweden
dc.description.versionPublished Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.articleid302en
dc.identifier.citationMinghim, R., Huancapaza, L., Artur, E., Telles, G. P. and Belizario, I. V. (2020) 'Graphs from Features: Tree-Based Graph Layout for Feature Analysis', Algorithms, 13 (11), 302 (23 pp). doi: 10.3390/a13110302en
dc.identifier.doi10.3390/a13110302en
dc.identifier.endpage23en
dc.identifier.issn1999-4893
dc.identifier.issued11en
dc.identifier.journaltitleJournal of Algorithmsen
dc.identifier.startpage1en
dc.identifier.urihttps://hdl.handle.net/10468/11043
dc.identifier.volume13en
dc.language.isoenen
dc.publisherMDPIen
dc.relation.urihttps://www.mdpi.com/1999-4893/13/11/302
dc.rights© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectFeature analysisen
dc.subjectFeature selectionen
dc.subjectMultidimensional visualizationen
dc.subjectSimilarity treesen
dc.subjectGraph layoutsen
dc.titleGraphs from features: tree-based graph layout for feature analysisen
dc.typeArticle (peer-reviewed)en
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
algorithms-13-00302.pdf
Size:
28.34 MB
Format:
Adobe Portable Document Format
Description:
Published version
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description: