Applied Soft Computing, ( ISI ), Volume (27), No (1), Year (2015-2) , Pages (474-486)

Title : ( Discernible visualization of high dimensional data using label information )

Authors: Asef Pourmasoumi Hassankiadeh , Amin Zamiri , Hadi Sadoghi Yazdi , Abbas Ghaemi Bafghi ,

Citation: BibTeX | EndNote

Abstract

Visualization methods could significantly improve the outcome of automated knowledge discovery systems by involving human judgment.Star coordinate is a visualization technique that maps k-dimensional data onto a circle using a set of axes sharing the same origin at the center of the circle.We propose a novel method toward automatic axes adjustment for high dimensional data in Star Coordinate visualization method.This method finds the best 2-dimensional view point (discernible visualization) that minimizes intra-cluster distances while keeping the inter-cluster distances as large as possible by using label information.The label information could be provided by the user or could be the result of performing a conventional clustering method over the input data. Visualization methods could significantly improve the outcome of automated knowledge discovery systems by involving human judgment. Star coordinate is a visualization technique that maps k-dimensional data onto a circle using a set of axes sharing the same origin at the center of the circle. It provides the users with the ability to adjust this mapping, through scaling and rotating of the axes, until no mapped point-clouds (clusters) overlap one another. In this state, similar groups of data are easily detectable. However an effective adjustment could be a difficult or even an impossible task for the user in high dimensions. This is specially the case when the input space dimension is about 50 or more.In this paper, we propose a novel method toward automatic axes adjustment for high dimensional data in Star Coordinate visualization method. This method finds the best two-dimensional view point that minimizes intra-cluster distances while keeping the inter-cluster distances as large as possible by using label information. We call this view point a discernible visualization, where clusters are easily detectable by human eye. The label information could be provided by the user or could be the result of performing a conventional clustering method over the input data. The proposed approach optimizes the Star Coordinate representation by formulating the problem as a maximization of a Fisher discriminant. Therefore the problem has a unique global solution and polynomial time complexity. We also prove that manipulating the scaling factor alone is effective enough for creating any given visualization mapping. Moreover it is showed that k-dimensional data visualization can be modeled as an eigenvalue problem. Using this approach, an optimal axes adjustment in the Star Coordinate method for high dimensional data can be achieved without any user intervention. The experimental results demonstrate the effectiveness of the proposed approach in terms of accuracy and performance.

Keywords

, Visualization, Star Coordinate, High dimensionality reduction, Fisher‘s discriminant form
برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

@article{paperid:1049752,
author = {Pourmasoumi Hassankiadeh, Asef and Zamiri, Amin and Sadoghi Yazdi, Hadi and Ghaemi Bafghi, Abbas},
title = {Discernible visualization of high dimensional data using label information},
journal = {Applied Soft Computing},
year = {2015},
volume = {27},
number = {1},
month = {February},
issn = {1568-4946},
pages = {474--486},
numpages = {12},
keywords = {Visualization; Star Coordinate; High dimensionality reduction; Fisher‘s discriminant form},
}

[Download]

%0 Journal Article
%T Discernible visualization of high dimensional data using label information
%A Pourmasoumi Hassankiadeh, Asef
%A Zamiri, Amin
%A Sadoghi Yazdi, Hadi
%A Ghaemi Bafghi, Abbas
%J Applied Soft Computing
%@ 1568-4946
%D 2015

[Download]