%0 Journal Article %T Depth-Based Classification for Distributions with Nonconvex Support %A Daniel Hlubinka %A Ondrej Vencalek %J Journal of Probability and Statistics %D 2013 %I Hindawi Publishing Corporation %R 10.1155/2013/629184 %X Halfspace depth became a popular nonparametric tool for statistical analysis of multivariate data during the last two decades. One of applications of data depth considered recently in literature is the classification problem. The data depth approach is used instead of the linear discriminant analysis mostly to avoid the parametric assumptions and to get better classifier for data whose distribution is not elliptically symmetric, for example, skewed data. In our paper, we suggest to use weighted version of halfspace depth rather than the halfspace depth itself in order to obtain lower misclassification rate in the case of ¡°nonconvex¡± distributions. Simulations show that the results of depth-based classifiers are comparable with linear discriminant analysis for two normal populations, while for nonelliptic distributions the classifier based on weighted halfspace depth outperforms both linear discriminant analysis and classifier based on the usual (nonweighted) halfspace depth. 1. Introduction Data classification or discrimination is an old statistical problem which has been discussed, studied, and applied since more than century. Therefore we will recall the problem only very briefly. The goal of the classification is to allocate a new observation into one of two (or more) groups. The rule for assessing a new observation to one of the possible groups is created by analysis of available observations with known group assignment (so-called training set). The most popular classification method is based on normality assumption, and it is known as the linear discriminant analysis (LDA). LDA is easy to use and quite successful in many cases, in particular for elliptically symmetric distributions. Naturally there are also nonparametric methods of classification. One of the motivations for replacing the LDA method is to avoid the normality assumption and therefore, hopefully, get better results for nonelliptic distributions. Recently it was proposed to use data depth as the basis for new nonparametric classifiers, see, for example, [1] or [2]. Let us recall the essential facts about the data depth. A depth of a point with respect to a probability measure is a nonnegative number which measures the ¡°centrality¡± of with respect to . In other words, depth should reflect position of the point with respect to the probability distribution, or in the sample version, position of the point with respect to the observed data cloud. Recall also that for a multivariate data there is no natural linear ordering available. The depth, as a nonnegative (and bounded) number allows to %U http://www.hindawi.com/journals/jps/2013/629184/