Graphs from features: tree-based graph layout for feature analysis
dc.contributor.author | Minghim, Rosane | |
dc.contributor.author | Huancapaza, Liz | |
dc.contributor.author | Artur, Erasmo | |
dc.contributor.author | Telles, Guilherme P. | |
dc.contributor.author | Belizario, Ivar V. | |
dc.contributor.funder | Conselho Nacional de Desenvolvimento CientÃfico e Tecnológico | en |
dc.contributor.funder | Coordenação de Aperfeiçoamento de Pessoal de NÃvel Superior | en |
dc.contributor.funder | Conselho Nacional de Desenvolvimento CientÃfico e Tecnológico | en |
dc.date.accessioned | 2021-02-05T13:42:34Z | |
dc.date.available | 2021-02-05T13:42:34Z | |
dc.date.issued | 2020-11-18 | |
dc.date.updated | 2021-02-05T13:31:34Z | |
dc.description.abstract | Feature Analysis has become a very critical task in data analysis and visualization. Graph structures are very flexible in terms of representation and may encode important information on features but are challenging in regards to layout being adequate for analysis tasks. In this study, we propose and develop similarity-based graph layouts with the purpose of locating relevant patterns in sets of features, thus supporting feature analysis and selection. We apply a tree layout in the first step of the strategy, to accomplish node placement and overview based on feature similarity. By drawing the remainder of the graph edges on demand, further grouping and relationships among features are revealed. We evaluate those groups and relationships in terms of their effectiveness in exploring feature sets for data analysis. Correlation of features with a target categorical attribute and feature ranking are added to support the task. Multidimensional projections are employed to plot the dataset based on selected attributes to reveal the effectiveness of the feature set. Our results have shown that the tree-graph layout framework allows for a number of observations that are very important in user-centric feature selection, and not easy to observe by any other available tool. They provide a way of finding relevant and irrelevant features, spurious sets of noisy features, groups of similar features, and opposite features, all of which are essential tasks in different scenarios of data analysis. Case studies in application areas centered on documents, images and sound data demonstrate the ability of the framework to quickly reach a satisfactory compact representation from a larger feature set. | en |
dc.description.sponsorship | Conselho Nacional de Desenvolvimento CientÃfico e Tecnológico (grant number 307411/2016-8); Coordenação de Aperfeiçoamento de Pessoal de NÃvel Superior - Brasil (CAPES) (Finance Code 001); Conselho Nacional de Desenvolvimento CientÃfico e Tecnológico (grant number 310299/2018-7) | en |
dc.description.status | Peer reviewed | en |
dc.description.version | Published Version | en |
dc.format.mimetype | application/pdf | en |
dc.identifier.articleid | 302 | en |
dc.identifier.citation | Minghim, R., Huancapaza, L., Artur, E., Telles, G. P. and Belizario, I. V. (2020) 'Graphs from Features: Tree-Based Graph Layout for Feature Analysis', Algorithms, 13 (11), 302 (23 pp). doi: 10.3390/a13110302 | en |
dc.identifier.doi | 10.3390/a13110302 | en |
dc.identifier.endpage | 23 | en |
dc.identifier.issn | 1999-4893 | |
dc.identifier.issued | 11 | en |
dc.identifier.journaltitle | Journal of Algorithms | en |
dc.identifier.startpage | 1 | en |
dc.identifier.uri | https://hdl.handle.net/10468/11043 | |
dc.identifier.volume | 13 | en |
dc.language.iso | en | en |
dc.publisher | MDPI | en |
dc.relation.uri | https://www.mdpi.com/1999-4893/13/11/302 | |
dc.rights | © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). | en |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
dc.subject | Feature analysis | en |
dc.subject | Feature selection | en |
dc.subject | Multidimensional visualization | en |
dc.subject | Similarity trees | en |
dc.subject | Graph layouts | en |
dc.title | Graphs from features: tree-based graph layout for feature analysis | en |
dc.type | Article (peer-reviewed) | en |