On the construction of an aggregated measure of the development of interval data

Młodak, Andrzej

doi:10.1007/s00180-013-0469-7

On the construction of an aggregated measure of the development of interval data

Original Paper
Published: 11 December 2013

Volume 29, pages 895–929, (2014)
Cite this article

Computational Statistics Aims and scope Submit manuscript

Andrzej Młodak¹

272 Accesses
16 Citations
Explore all metrics

Abstract

We analyse some possibilities for constructing an aggregated measure of the development of socio-economical objects in terms of their composite phenomenon (i.e., phenomenon described by many statistical features) if the relevant data are expressed as intervals. Such a measure, based on the deviation of the data structure for a given object from the benchmark of development is a useful tool for ordering, comparing and clustering objects. We present the construction of a composite phenomenon when it is described by interval data and discuss various aspects of stimulation and normalization of the diagnostic features as well as a definition of a benchmark of development (based usually on optimum or expected levels of these features). Our investigation includes the following options for the realization of this purpose: transformation of the interval model into a single–valued version without any significant loss of its statistical properties, standardization of pure intervals as well as definition of the interval “ideal” object. For the determination of a distance between intervals, the Hausdorff formula is applied. The simulation study conducted and the empirical analysis showed that the first two variants are especially useful in practice.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Data generation processes and statistical management of interval data

Article 30 June 2016

Generalization of Jaccard Index for Interval Data Analysis

Article 01 March 2023

The zonoid region parameter depth

Article Open access 13 December 2022

References

Allen J (1983) Maintaining knowledge about temporal intervals. Commun ACM 26(11):832–843
Article MATH Google Scholar
Anand S, Sen AK (1993) Human development index: methodology and measurements, occasional papers no. 12, human development report office, United Nations Development Program. New York, USA, http://hdr.undp.org/en/media/HDI_methodology.pdf
Ben-Israel A, Iyigun C (2008) Probabilistic d-clustering. J Classif 25:5–26
Article MathSciNet MATH Google Scholar
Bock H-H, Diday E (eds) (2000) Analysis of symbolic data. Exploratory methods for extracting statistical information from complex data. Springer, Heidelberg
Chavent M (2004) A Hausdorff distance between hyper-rectangles for clustering interval data. In: Banks D, House L, McMorris F, Arabie P, Gaul W (eds) Classification, clustering and data mining applications. Springer, Berlin pp 333–339
Chavent M, Lechevallier Y (2002) Classification, dynamical clustering of interval data: optimization of an adequacy criterion based on Hausdorff distance. In: Jajuga K, Sokołowski A, Bock H-H (eds) Clustering and data analysis. Recent advances and applications. Springer, Berlin, pp 53–60
Google Scholar
Chavent M, de Carvalho FAT, Lechevallier Y, Verde R (2006) New clustering methods for interval data. Comput Stat 21:211–229
Article MATH Google Scholar
Chavent M, Saracco J (2008) On central tendency and dispersion measures for intervals and hypercubes. Commun Stat Theory Methods 37:1471–1482
Article MathSciNet MATH Google Scholar
CSO (2007) Life conditions of the population in Poland in years 2004–2005, Central Statistical Office of Poland, Department of Social Statistics, Warszawa. Available also at http://www.stat.gov.pl/cps/rde/xbcr/gus/publ_warunki_zycia2004-2005.pdf
Dennis I, Guio A-C (2003) Poverty and social exclusion in the EU after Laeken, part 1–2. In: Population and social conditions. Series: Statistics in focus, European Communities, EUROSTAT, Luxembourg, Theme 3, No. 8–9
Gioia F, Lauro CN (2006) Principal component analysis on interval data. Comput Stat 21:343–363
Article MathSciNet MATH Google Scholar
Haldane JBS (1948) Note on the median of a multivariate distribution. Biometrika 35:414–415
Article MathSciNet MATH Google Scholar
Hellwig Z (1968) Procedure to evaluating high level manpower data and typology of countries by means of the taxonomic method. Stat Rev XV(4), 307-327 (in Polish)
Huang M-H (2011) A comparison of three major academic rankings for World Universities: from a research evaluation perspective. J Libr Inf Stud 9:1–25
Google Scholar
Malina A, Zeliaś A (1998) On building taxonometric measures on living conditions. Stat Transition 3(3):523–544
Google Scholar
Młodak A (2002) An approach to the problem of spatial differentiation of multi-feature objects using methods of game theory. Stat Transition 5(5):857–872
MathSciNet Google Scholar
Młodak A (2004) An application of Shapley value coefficients in a numerical taxonomy. Stat Rev LI(4):101–114
Google Scholar
Młodak A (2006) Multilateral normalizations of diagnostic features. Stat Transition 7(5):1125–1139
Google Scholar
Młodak A, Kubacki J (2010) A typology of Polish farms using some fuzzy classification method. Stat Transition New Ser 11:615–638
Google Scholar
Młodak A (2011) Classification of multivariate objects using interval quantile classes. J Classif 28:327–362
Article Google Scholar
Munkres J (1999) Topology, 2nd edition. Prentice Hall, Englewood Cliffs
Renz M (2006) Enhanced query processing on complex spatial and temporal data. Dissertation an der Fakultät für Mathematik, Informatik und Statistik der Ludwig-Maximilians Universität München, München, Germany, available at: http://edoc.ub.uni-muenchen.de/archive/00006231/01/renz_matthias.pdf
Rodgers JL, Nicewander WA (1988) Thirteen ways to look at the correlation coefficient. Am Stat 42:59–66
Article Google Scholar
Rousseeuw PJ, Leroy AM (1987) Robust regression and outlier detection. Wiley, New York
Vandev DL (2002) Computing of trimmed L1—median, Laboratory of Computer Stochastics, Institute of Mathematics, Bulgarian Academy of Sciences, (preprint), available at http://www.fmi.uni-sofia.bg/fmi/statist/Personal/Vandev/papers/aspap.pdf
Weber A (1909, reprint 1971) Theory of Location of Industries, Translated with an introduction and notes by Carl J. Friedrich, Ed. by Russel & Russel, New York
Zeliaś A (2002) Some notes on the selection of normalization of diagnostic variables. Stat Transition 5(5):787–802
Google Scholar

Download references

Acknowledgments

I would like to express my gratitude to the anonymous reviewer for interesting and very useful comments and suggestions.

Author information

Authors and Affiliations

Statistical Office in Poznań, Branch in Kalisz, pl. J. Kilińskiego 13, 62–800 , Kalisz, Poland
Andrzej Młodak

Authors

Andrzej Młodak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrzej Młodak.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Młodak, A. On the construction of an aggregated measure of the development of interval data. Comput Stat 29, 895–929 (2014). https://doi.org/10.1007/s00180-013-0469-7

Download citation

Received: 13 April 2010
Accepted: 21 November 2013
Published: 11 December 2013
Issue Date: October 2014
DOI: https://doi.org/10.1007/s00180-013-0469-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the construction of an aggregated measure of the development of interval data

Abstract

Access this article

Similar content being viewed by others

Data generation processes and statistical management of interval data

Generalization of Jaccard Index for Interval Data Analysis

The zonoid region parameter depth

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

On the construction of an aggregated measure of the development of interval data

Abstract

Access this article

Similar content being viewed by others

Data generation processes and statistical management of interval data

Generalization of Jaccard Index for Interval Data Analysis

The zonoid region parameter depth

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation