Mumic, N., & Filzmoser, P. (2021). A multivariate test for detecting fraud based on Benford’s law, with application to music streaming data. Statistical Methods and Applications, 30(3), 819–840. https://doi.org/10.1007/s10260-021-00582-6
Statistics and Probability; Statistics, Probability and Uncertainty
en
Abstract:
Benford's law became a prevalent concept for fraud and anomaly detection. It examines the frequencies of the leading digits of numbers in a collection of data and states that the leading digit is most often 1, with diminishing frequencies up to 9. In this paper we propose a multivariate approach to test whether the observed frequencies follow the theoretical Benford distribution. Our approach is based on the
concept of compositional data, which examines the relative information between the frequencies of the leading digits. As a result, we introduce a multivariate test for Benford distribution. In simulation studies and examples we compare the multivariate test performance to the conventional chi-square and Kolmogorov-Smirnov test, where the multivariate test turns out to be more sensitive in many cases. A
diagnostics plot based on relative information allows to reveal and interpret the possible deviations from the Benford distribution.
en
Research Areas:
Computational Materials Science: 50% Modelling and Simulation: 50%