Deviation (statistics)

In mathematics and statistics, deviation is the difference between a random variate (the observed value of a random variable) and a reference value (often a measure of central tendency). Deviations with respect to the sample mean and the population mean (or "true value") are called errors and residuals, respectively. The sign of the deviation reports the direction of that difference: the deviation is positive when the observed value exceeds the reference value. The absolute value of the deviation indicates the size or magnitude of the difference. In a given sample, there are as many deviations as sample points. Summary statistics can be derived from a set of deviations, such as the standard deviation and the mean absolute deviation, measures of dispersion, and the mean signed deviation, a measure of bias.

Types

Signed deviations

A deviation that is a difference between an observed value and the true value of a quantity of interest (such as the population mean) is an error.

A deviation that is the difference between the observed value and an estimate of the true value (e.g. the sample mean) is a residual. These concepts are applicable for data at the interval and ratio levels of measurement.

Unsigned or absolute deviation

In statistics, the absolute deviation of an element of a data set is the absolute difference between that element and a given point. Typically the deviation is reckoned from the central value, being construed as some type of average, most often the median or sometimes the mean of the data set:

D_{i}=|x_{i}-m(X)|,

where

D_i is the absolute deviation,
x_i is the data element,
m(X) is the chosen measure of central tendency of the data set—sometimes the mean ( ${\overline {x}}$ ), but most often the median.

Summary statistics

Mean signed deviation

For an unbiased estimator, the average of the signed deviations across the entire set of all observations from the unobserved population parameter value averages zero over an arbitrarily large number of samples. However, by construction the average of signed deviations of values from the sample mean value is always zero, though the average signed deviation from another measure of central tendency, such as the sample median, need not be zero.

Dispersion

Statistics of the distribution of deviations are used as measures of statistical dispersion.

Standard deviation is the frequently used measure of dispersion: it uses squared deviations, and has desirable properties, but is not robust.^[1]
Average absolute deviation, is the sum of absolute values of the deviations divided by the number of observations.
Median absolute deviation is a robust statistic, which uses the median, not the mean, of absolute deviations.^[2]
Maximum absolute deviation is a highly non-robust measure, which uses the maximum absolute deviation.

Normalization

Deviations have units of the measurement scale (for instance, meters if measuring lengths). One can nondimensionalize in two ways.

One way is by dividing by a measure of scale (statistical dispersion), most often either the population standard deviation, in standardizing, or the sample standard deviation, in studentizing (e.g., Studentized residual).

In relative differences, one scale deviations by location (instead of dispersion); the formula for a deviation in percent is the observed value minus accepted value divided by the accepted value multiplied by 100%.

References

↑ "2. Mean and standard deviation | The BMJ". The BMJ | The BMJ: leading general medical journal. Research. Education. Comment. 2020-10-28. Retrieved 2022-11-02.
↑ Jones, Alan R. (2018-10-09). Probability, Statistics and Other Frightening Stuff. Routledge. p. 73. ISBN 978-1-351-66138-6.

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[1] "2. Mean and standard deviation | The BMJ". The BMJ | The BMJ: leading general medical journal. Research. Education. Comment. 2020-10-28. Retrieved 2022-11-02.

[2] Jones, Alan R. (2018-10-09). Probability, Statistics and Other Frightening Stuff. Routledge. p. 73. ISBN 978-1-351-66138-6.

[1]

[2]