Title : ( The k-nearest Neighbor Classification of Histogram- and Trapezoid-Valued Data )
Authors: Fathimah Al Mashumah , Mostafa Razmkhah , Sohrab Effati ,Access to full-text not allowed by authors
Abstract
A histogram-valued observation is a specific type of symbolic objects that represents its value by a list of bins (intervals) along with their corresponding relative frequencies or probabilities. In the literature, the raw data in bins of all histogram-valued data have been assumed to be uniformly distributed. A new representation of such observations is proposed in this paper by assuming that the raw data in each bin are linearly distributed, which are called trapezoid-valued data. Moreover, new definitions of union and intersection between trapezoid-valued observations are made. This study proposes the k-nearest neighbour technique for classifying histogram-valued data using various dissimilarity measures. Further, the limiting behaviour of the computational complexities based on the performed dissimilarity measures are compared. To study the effect of using a distance instead of a dissimilarity measure, the Wasserstein distance is also used and the accuracy of the classification is compared. Some simulations are done to study the performance of the proposed procedures. Also, the results are applied to three various real data sets. Eventually, some conclusions are stated.
Keywords
, Dissimilarity measure; Histogram-valued data (HVD); Supervised learning; Trapezoid-valued data (TVD), Wasserstein distance.@article{paperid:1092123,
author = {Al Mashumah, Fathimah and Razmkhah, Mostafa and Effati, Sohrab},
title = {The k-nearest Neighbor Classification of Histogram- and Trapezoid-Valued Data},
journal = {Statistics, Optimization and Information Computing},
year = {2022},
volume = {10},
number = {1},
month = {September},
issn = {2311-004X},
pages = {1187--1203},
numpages = {16},
keywords = {Dissimilarity measure; Histogram-valued data (HVD); Supervised learning; Trapezoid-valued data (TVD);
Wasserstein distance.},
}
%0 Journal Article
%T The k-nearest Neighbor Classification of Histogram- and Trapezoid-Valued Data
%A Al Mashumah, Fathimah
%A Razmkhah, Mostafa
%A Effati, Sohrab
%J Statistics, Optimization and Information Computing
%@ 2311-004X
%D 2022