Qualitative analysis of data sets in the study of social and economic processes

Olga Ivanivna Korytska


Introduction. Ssocio-economic phenomena research provides monitoring and gathering of information according to various indicators that characterize them. Analysis of socio-economic phenomena which is associated with handling large amounts of data information has been carried out. A set of evaluation data is characterized by heterogeneity and inaccuracy. Information is formed by many experts, so it can  contain questionable and unprocessed data. For efficient and reliable evaluation of socio-economic phenomena it is necessary to carry out a qualitative analysis of data sets. This will ensure the formation of a homogeneous set of data, extracting of atypical data with abnormally large positive or negative value, improving of the reliability of the results of the study.

Purpose. The aim of the article is to consider and develop a method of qualitative analysis of data sets in the process of evaluation of  the socio-economic phenomena.

Results. The overview of the most common methods for assessing general data set has been done. Attention is drawn to imperfect methods of qualitative analysis of data sets in the process of evaluation of the socio-economic phenomena. Qualitative analysis of data sets containing an analysis of lost and uncertain information has been carried out. Methods for lost and uncertain information treatment are recommended. If there is a loss of information then it should be used "reduce to zero", conditional coding, method of maximum likelihood of  principal components. If there is questionable information its identification and evaluation should be held. Identification is carried out according to the criteria F.Hrabbs, G. Moore, N. Smirnov, G. Tityen. On the basis of assessment of data the questionable information should be eliminated from further processing or shoul be modified. The variants of information modification using robust statistics methods are proposed. The latter includes methods of Ch. Vinzor, A. Poincare, Jh. Tukey, P. Huber. Qualitative analysis of the data sets in the process of evaluation of the socio-economic phenomena involves the detection and treatment of atypical information. We have represented a generalized scheme of qualitative analysis of data sets. The technique can be applied in the study of various economic activities. Further research will involve testing methods for evaluation of the efficiency of industrial production in Ukraine.


criteria for evaluation; robust statistical methods; set of data; «doubtful» observation; missed information; qualitative analysis

Full Text:



Aivazian, S. A., Enukov, I. S., Meshalkin, L. D. (1983). Applied statistics: Basics modeling and analysis of spatial data [Prykladnaia statystyka: Osnovy modelirovania i pervichnaia obrabotka danykh ]. Moscow: Finance and statistic.

Dubrov, A. M., Mhitarian, I. S., Troshun, L. I. (2003). Multivariate statistic methods [Mnogomernye statisticheskie metody]. Moscow: Finance and statistic.

Soshnikova, L. A., Tomashevuch, V. N., Uebe, H., Sheffer, M. (1999). Multivariate statistic analysis in economics [Mnogomernyi statisticheskii analiz v ekonomike]. Moscov: YUNITI-DANA.

Hasings, C., Mosteller, F., Tukey, J. W., Winsor, C.P. (1947). Low moments for small samples: A comparative study of order statistics.

Grubbs, F. E. (1950). Sample Criteria for Testing Outlying observations.

Puankare, A. (1910). Method and science [Nauka i metod]. Sankt-Peterburg.

Smirnov, N. V. (1941). Estimation of the maximum term in the series of observations [Otsenka maksimalnogo chlena v riadu nabludenii].

Tietje,n G., Moore, H. (1972). Some Grubb’s type statistics for the detection of several outliers. Technometrics.

Andrews, David F; Peter, J Bickel; Frank, R. Hampel; Peter J. Huber; W H Rogers & John W Tukey (1972). Robust estimates of location: survey and advances. Princeton University Press.

Huber, G. P. (1984). Robust statistics [Robastnost v statistike]. Moscov: Mir.

Bondina, N. M., Povoroznuk, A. I., Shein, O. M. (2013). Computerization of specialized media [Komputeruzatsia spetsializovanykh seredovyshch]. Kharkiv: NTU: HPY.

Comparative analysis of the criteria blunders [Sravnitelnyi analiz kriteriev grubych oshybok], from : http://www.pisali.ru/zalvladimir/98176/.

Gauss, C. F. (1809). Theoria motus corporum coelestium. Gamburg.

Box, G. E. P., Draper, N. R. (n. d.). A basis for the selection of a response surface design. J. Amer. Statist. Ass.

Kolmogorov, A. N. (1989). Probability theory and mathematical statistics [Teoria veroiatnostei i matematicheskaia statystyka ]. Moscow: Nauka.

Article Metrics

Metrics Loading ...

Metrics powered by PLOS ALM


  • There are currently no refbacks.

Copyright (c)


Articles are distributed under Creative Commons Attribution  International 4.0 (CC-BY-NC 4.0) 

Science Works Journal "Ekonomichnyy analiz"

ISSN 1993-0259 (Print)  ISSN 2219-4649 (Online) DOI: 10.35774/econa

© West Ukrainian National University

© «Ekonomichnyy analiz», 2007-2023