Detection of outliers in processing of small size data

  • V. S. Popukaylo Taras Shevchenko Transnistria State University, Tiraspol, Republic of Moldova
Keywords: small size data, outlier detection criteria, anomalous meterages, outlier analysis

Abstract

This article describes the criteria for detection of outliers power depending on a small size sample. Removing outliers is one of the stages of signals pre-processing. A statistical experiment, in which using a random number generator were received arrays of data, containing several thousand samples with normal distribution, with the given mean averages and standard deviation for each n-value, was conducted to solve this problem. Thus, we researched and vividly illustrated the possibility of Grubbs, Dixon, Tietjen — Moore, Irving, Chauvenet, Lvovsky and Romanovsky criteria at studied data sizes from 5 to 20 meterages. Conclusions about the applicability of each criterion for the outliersdetection in processing of small size data were made. Lvovsky criterion was recognized the optimal criterion. Dixon’s criterion was recommended for n ≤ 10. Irwin’s criterion was recommended when n ≥ 10. Tietjen—Moore’scriterion can be recommended for the detection of outliers in small samples for n > 5, since it recognizes errors well in the values of a ¯x + 4σ and has the least amount of I type mistakes. Grubb’s with an unknown standard deviation may be used in samples for n ≥ 15. Chauvenet and Romanovsky criteria cannot be recommended for the detection of outliers in small size data.

References

Marchuk V. I., Tokareva S. V. Sposoby obnaruzheniya anomal`nykh znachenii pri analize nestatsionarnykh sluchainykh protsessov [Methods for detection of outliers in the analysis of non-stationary random processes]. Shakhty, SRSUES, 2009. (Rus)

Kobzar` A. I. Prikladnaya matematicheskaya statistika. Dlya inzhenerov i nauchnykh rabotneykov [Applied mathematical statistics.For engineers and scientists]. Moscow, FIZMATLIT, 2012. (Rus)

Charu C. Aggarwal. Outlier Analysis. NY, Springer, 2013, 446 p.

PopukailoV.S. [The outlier criteria research in relation to small volume samples]. Radioelektronni i komp’yuterni sistemi, 2015, 3(73), pp. 39-44. (Rus)

Stolyarenko Yu.A. [The crystals сontrol of integrated schemes on the basis of statistical modeling by pointed distributions method]. Extended abstract of dissertation… Ph.D. in Engineering Science. Moscow, SUE NPTs “SPURT”, 2006. (Rus)

Gromyko G. L. Teoriya statistiki [Theory of Statistics]. Moscow, INFRA-M, 2011, 476 р. (Rus)

L`vovskii E. N. Statisticheskie metody postroeniya empiricheskikh formul: ucheb. posobie dlya vuzov [Statistical methods for constructing empirical formulas: a textbook for high schools]. Moscow, Vysshaya shkola, 1988.

Published
2016-10-29
How to Cite
Popukaylo, V. S. (2016). Detection of outliers in processing of small size data. Technology and Design in Electronic Equipment, (4–5), 42-46. https://doi.org/10.15222/TKEA2016.4-5.42