- Kurtosis Examples
- Kurtosis Formulas
- Kurtosis or Excess Kurtosis?
- Kurtosis Calculation Example
- Platykurtic, Mesokurtic and Leptokurtic
In statistics, kurtosis refers to the “peakedness”
of the distribution for a quantitative variable.
What's meant by “peakedness” is best understood from the example histograms shown below.
Kurtosis Examples
Test 4 is almost perfectly normally distributed. Its excess kurtosis is therefore close to 0.
The distribution for test 3 is somewhat “flatter” than the normal curve: the histogram bars are lower than the middle of the curve and higher towards its tails. Test 3 therefore has a negative excess kurtosis.
Test 5 is more “peaked” than the normal curve: its bars are higher than the peak of the curve and lower towards its tails. Therefore, test 5 has a positive excess kurtosis.
Test 2 roughly follows a uniform distribution. Because it's even flatter than test 3, it has a stronger negative excess kurtosis.
The strongest negative excess kurtosis is seen for test 1, which has a bimodal distribution.
Positive excess kurtosis is often seen for variables having strong (positive) skewness such as test 6.
So now that we've an idea what (excess) kurtosis means, let's see how it's computed.
Kurtosis Formulas
If your data contain an entire population rather than just a sample, the population kurtosis \(K_p\) is computed as
$$K_p = \frac{M_4}{M_2^2}$$
where \(M_2\) and \(M_4\) denote the second and fourth moments around the mean:
$$M_2 = \frac{\sum\limits_{i = 1}^N(X_i - \overline{X})^2}{N}$$
and
$$M_4 = \frac{\sum\limits_{i = 1}^N(X_i - \overline{X})^4}{N}$$
Note that \(M_2\) is simply the population-variance formula.
Kurtosis or Excess Kurtosis?
A normally distributed variable has a kurtosis of 3.0. Since this is undesirable, population excess kurtosis \(EK_p\) is defined as
$$EK_p = K_p - 3$$
so that excess kurtosis is 0.0 for a normally distributed variable.
Now, that's all fine. But what's not fine is that “kurtosis” refers to either kurtosis or excess kurtosis in standard textbooks and software packages without clarifying which of these two is reported.
Anyway.
Our formulas thus far only apply to data containing an entire population. If your data only contain a sample from some population -usually the case- then you'll want to compute sample excess kurtosis \(EK_s\) as
$$EK_s = ((N + 1) \cdot EK_p + 6) \cdot \frac{(N - 1)}{(N - 2) \cdot (N - 3)}$$
This formula results in “kurtosis” as reported by most software packages such as SPSS, Excel and Googlesheets. Finally, most text books suggest that the standard error for (excess) kurtosis \(SE_{eks}\) is computed as
$$SE_{eks} \approx \sqrt{\frac{24}{N}}$$
This approximation, however, is not accurate for small sample sizes. Therefore, software packages use a more complicated formula. I won't bother you with it.
Kurtosis Calculation Example
An example calculation for excess kurtosis is shown in this Googlesheet (read-only), partly shown below.
We computed excess kurtosis for scores 1 through 10. First off, we added their mean, M = 5.50. Next,
\(D\) denotes the difference scores (score - mean);
\(D^2\) are squared difference scores that are used for computing a variance;
\(D^4\) are difference scores raised to the fourth power.
As shown below, the remaining computations are fairly simple too.
Note that the excess kurtosis = 1.20. The SPSS output shown below confirms this result.
Platykurtic, Mesokurtic & Leptokurtic
Some terminology related to excess kurtosis is that
- a variable having excess kurtosis < 0 is called platykurtic;
- a variable having excess kurtosis = 0 is called mesokurtic;
- a variable having excess kurtosis > 0 is called leptokurtic.
Our kurtosis examples illustrate what platykurtic, mesokurtic and leptokurtic distributions tend to look like.
Finding Kurtosis in Excel & SPSS
First off, “kurtosis” always refers to sample excess kurtosis in Excel, Googlesheets and SPSS. It's found in Excel and Googlesheets by using something like =KURT(A2:A11)
SPSS has many options for computing excess kurtosis but I personally prefer using
As shown below, kurtosis can be selected from Options and dragged into the desired position.
Right, I think that's about it regarding (excess) kurtosis. If you've any questions or remarks, please throw me a comment below. Other than that:
thanks for reading!
SPSS TUTORIALS
THIS TUTORIAL HAS 27 COMMENTS:
By Ruben Geert van den Berg on March 16th, 2022
Interesting point but I think you meant "LOW kurtosis means fat tails of the distribution...", right?
Something that really struck me is that negative excess kurtosis looks way less "normal" than comparable positive kurtosis.
When creating the example histograms, I had a hard time to get excess kurtosis < -2.0 (histogram 1). This looks about a non-normal as it gets.
However, histogram 5 (excess kurtosis = +4.54) looks much more "normal" than histogram 1!
Nevertheless, most discussions mention "absolute kurtosis". I think that's a very ambiguous measure due to omitting its sign. Do you have any thoughts on these observations?
By Ruben Geert van den Berg on March 16th, 2022
Hi Peter!
With "Pearson kurtosis" you refer to what standard textbooks call (excess) kurtosis, right?
If you disagree that it indicates peakness/flatness of distributions, then how would you describe "kurtosis" in normal language?
And could you share any links/references on your opinion?
Thanks!
SPSS tutorials
By Jon Peck on March 16th, 2022
From NIST
National Institute of Standards and Technology, US Dept of Commerce
https://www.itl.nist.gov/div898/handbook/eda/section3/eda35b.htm#:~:text=Kurtosis%20is%20a%20measure%20of,tails%2C%20or%20lack%20of%20outliers.
Kurtosis is a measure of whether the data are heavy-tailed or light-tailed relative to a normal distribution. That is, data sets with high kurtosis tend to have heavy tails, or outliers. Data sets with low kurtosis tend to have light tails, or lack of outliers.
That makes sense to me, since the fourth moment (numerator) would be more affected than the second moment (denominator) by extreme values.
There is definitely confusion about whether to subtract 3 (normal distribution kurtosis) or not, so it is important to check which definition is being used. I'm inclined to the minus 3 version to make the comparison to a normal distribution more obvious. In SPSS, long before my time, I understand that some procedures reported the minus 3 version and some didn't subtract 3, but I think it now uses the minus 3 version everywhere.
By Steven on March 17th, 2022
May I ask what the sample kurtosis formula is?
By Steven on March 17th, 2022
Please ignore the previous question, I see you have included it. Thank you.