Abstract
Benford’s law (BL), also known as the first-digit or significant-digit law, is an intriguing pattern in data sets, considers the frequency of occurrence of the first digits, which are not uniformly distributed as might be expected, conversely follow a specified theoretical distribution. According to BL, the occurrence of first non-zero digit in a numerical data, which is generated or found in nature, depends on a logarithmic distribution. Least square estimation (LSE) method is mostly preferred for the estimation of the unknown parameters from different types of geodetic data. The residuals and the normalized residuals of the LSE method, which follow normal distribution and expected values of them are zero are used in outlier detection problem. In this study, BL is investigated for residuals and the normalized residuals estimated from LSE method. Three types of geodetic data are used: (1) simulated regression models, (2) global positioning system (GPS) data, (3) leveling network. The first group data sets are simulated based on linear regression and univariate models and each simulated group is generated for a number of 100, 1000, and 10,000 samples. To generate second group, an international global navigation satellite system (GNSS) service (IGS) station data (ISTA) is processed by kinematic PPP approach using GIPSY OASIS II v6.4 software. Here, the observation duration of GPS data is 4 days. For the last data, a leveling network with 55 points involving 110 observations of height differences is simulated. BL has been applied to the residuals (v) and normalized residuals (w) estimated from LSE method. Goodness-of-fit test has been implemented to determine whether a population has a specified BL distribution or not. This test is based on how good a fit we have between the frequency of occurrence of residuals and normalized residuals in an observed sample and the expected frequencies obtained from the hypothesized distribution. The results depending on the statistical test show that each data set (residuals and normalized residuals) used in this study follows BL.
Similar content being viewed by others
References
Ausloos M, Herteliu C, Ileanu B (2015) Breakdown of Benford’s law for birth data. Physica A 419:736–745. https://doi.org/10.1016/j.physa.2014.10.041
Becker PW (1982) Patterns in listings of failure-rate & MTTF values and listings of other data. IEEE Trans Reliab 31:132–134
Benford F (1938) The law of anomalous numbers. Proc Am Philos Soc 78(4):551–573
Berber M (2008) Error analysis of geodetic networks. Shaker Publishing, Maastricht
Berger A, Hill TP (2015) An introduction to Benford’s law. Princeton University Press, Princeton
Berger A, Hill TP, Rogers E (2009) Benford online bibliography. http://www.benfordonline.net. Last accessed 25 Mar 2019
Brown RJC (2005) Benford’s Law and the screening of analytical data: the case of pollutant concentrations in ambient air. Analyst 130:1280–1285. https://doi.org/10.1039/b504462f
Chandra Das R, Sekhar Mishra C, Rajib P (2017) Detection of anomalies in accounting data using Benford’s law: evidence from India. J Soc Sci Stud 4(1):123–139
Formann AK (2010) The Newcomb–Benford law in its relation to some common distributions. PLoS ONE 5(5):e10541. https://doi.org/10.1371/journal.pone.0010541
Hill TP (1995) A statistical derivation of the significant-digit law. Stat Sci 10(4):354–363
Hofmann-Wellenhof B, Lichtenegger H, Collins J (2001) Global positioning system: theory and practice, 5th edn. Springer, New York
Jamain A (2001) Benford’s law. Master’s thesis, Department of Mathematics, Imperial College of London and ENSIMAG, London, UK
Koch KR (1999) Parameter estimation and hypothesis testing in linear models, 2nd edn. Springer, Berlin
Kossovsky AE (2014) Benford’s law theory, the general law of relative quantities and forensic fraud detection applications. World Scientific Publishing Co.Pte.Ltd, Singapore
Mir TA (2012) The law of the leading digits and the world religions. Physica A 391:792–798. https://doi.org/10.1016/j.physa.2011.09.001
Mir TA (2014) The Benford law behavior of the religious activity data. Physica A 408:1–9. https://doi.org/10.1016/j.physa.2014.03.074
Nagasaka K (1984) On Benford’s law. Ann Inst Stat Math 36(Part A):337–352
Newcomb S (1881) Note on the frequency of use of the different digits in natural numbers. Am J Math 4(1):39–40
Sarkar BP (1973) An observation on the significant digits of binomial coefficients and factorials. Sankhyā Indian J Stat Ser B (1960–2002) 35(3):363–364
Teunissen PJG (2003) Adjustment theory an introduction. Delft University Press, Delft
Walpole RE, Myers RH, Myers SL (1998) Probability and statistics for engineers and scientists, 6th edn. Prentice Hall, Upper Saddle River
Whyman G, Ohtori N, Shulzinger E, Bormashenko E (2016) Revisiting the Benford law: when the Benford-like distribution of leading digits in sets of numerical data is expectable? Physica A 461:595–601
Wlodarski J (1971) Fibonacci and Lucas numbers tend to obey Benford’s law. Fibonacci Q 9:87–88
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tunalioglu, N., Erdogan, B. Usability of the Benford’s law for the results of least square estimation. Acta Geod Geophys 54, 315–331 (2019). https://doi.org/10.1007/s40328-019-00259-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40328-019-00259-3