Modeling and Scaling of Categorical Data

Läuter, Henning; Ramadan, Ayad

Estimation and testing of distributions in metric spaces are well known. R.A. Fisher, J. Neyman, W. Cochran and M. Bartlett achieved essential results on the statistical analysis of categorical data. In the last 40 years many other statisticians found important results in this field. Often data sets contain categorical data, e.g. levels of factors or names. There does not exist any ordering or any distance between these categories. At each level there are measured some metric or categorical values. We introduce a new method of scaling based on statistical decisions. For this we define empirical probabilities for the original observations and find a class of distributions in a metric space where these empirical probabilities can be found as approximations for equivalently defined probabilities. With this method we identify probabilities connected with the categorical data and probabilities in metric spaces. Here we get a mapping from the levels of factors or names into points of a metric space. This mapping yields the scale for theEstimation and testing of distributions in metric spaces are well known. R.A. Fisher, J. Neyman, W. Cochran and M. Bartlett achieved essential results on the statistical analysis of categorical data. In the last 40 years many other statisticians found important results in this field. Often data sets contain categorical data, e.g. levels of factors or names. There does not exist any ordering or any distance between these categories. At each level there are measured some metric or categorical values. We introduce a new method of scaling based on statistical decisions. For this we define empirical probabilities for the original observations and find a class of distributions in a metric space where these empirical probabilities can be found as approximations for equivalently defined probabilities. With this method we identify probabilities connected with the categorical data and probabilities in metric spaces. Here we get a mapping from the levels of factors or names into points of a metric space. This mapping yields the scale for the categorical data. From the statistical point of view we use multivariate statistical methods, we calculate maximum likelihood estimations and compare different approaches for scaling.… show more

Author details:	Henning Läuter, Ayad Ramadan
URN:	urn:nbn:de:kobv:517-opus-49572
Publication series (Volume number):	Mathematische Statistik und Wahrscheinlichkeitstheorie : Preprint (2010, 03)
Publication type:	Preprint
Language:	German
Publication year:	2010
Publishing institution:	Universität Potsdam
Release date:	2011/03/31
RVK - Regensburg classification:	SI 990
Organizational units:	Mathematisch-Naturwissenschaftliche Fakultät / Institut für Mathematik
DDC classification:	5 Naturwissenschaften und Mathematik / 51 Mathematik / 510 Mathematik
License (German):	Keine öffentliche Lizenz: Unter Urheberrechtsschutz

Modeling and Scaling of Categorical Data

Download full text files

Export metadata

Additional Services