What Is Descriptive Statistics?

Summary: Descriptive statistics use numbers like mean, median, and mode and graphs like histograms, scatter plots and bar charts to describe a data sample without making inferences about the larger population. They also use measures of dispersion to describe the distribution of the data sample.

In this article, I’ll help you understand the difference between descriptive statistics and inferential statistics. Then we’ll walk through some examples of descriptive statistics and how you can calculate them yourself.

What Is Descriptive Statistics?

Descriptive statistics describes data through certain numbers — like mean, median and mode — and graphical representations — like histograms, pie charts and scatter plots — to make it easier to understand and interpret. Descriptive statistics don’t involve any inference beyond what is immediately available, meaning they represent the available data (sample) and don’t make generalizations about the entire population.

What Is Statistics?

Statistics is the science of collecting, analyzing and organizing data to infer proportions (sample) that are representative of the population. In other words, statistics is interpreting data to make predictions for the population.

There are two branches of statistics:

Descriptive Statistics: Descriptive statistics describes data through certain numbers and graphical representations.
Inferential Statistics: Inferential statistics involves taking a random sample of data from a population to describe and make inferences about the population.

Descriptive Statistics vs. Inferential Statistics

Descriptive statistics summarize data through certain numbers like mean, median and mode, and graphs like box plots, bar charts and histograms, so the data is easier to understand and interpret. Descriptive statistics don’t make generalizations or inferences beyond what is immediately available, only representing the data gathered in the sample.

Analyzing the differences between descriptive and inferential statistics. | Source: The Organic Chemistry Tutor

Commonly Used Measures

Measures of central tendency
Measures of dispersion (or variability)

What Are the Measures of Central Tendency?

A measure of central tendency is a summary of the data that typically describes the center of the data. This summary involves any or all of the following values:

Mean
Median
Mode

1. What Is the Mean?

Mean is the ratio of the sum of all observations in the data to the total number of observations. This is also known as the average. Thus, mean is a number around which the entire data set is spread.

More on StatisticsT-Test vs. Chi-Square Test: Differences and When to Use Each

2. What Is the Median?

Median is the point which divides the entire data into two equal halves. One half of the data is less than the median and the other half is greater than the median. Median is calculated by first arranging the data in either ascending or descending order.

If the number of observations is odd, median is given by the middle observation in the sorted form.
If the number of observations is even, median is given by the mean of the two middle observations in the sorted form.

An important point to note is that the order of the data (ascending or descending) does not affect the median.

3. What Is the Mode?

Mode is the number that has the maximum frequency in the entire data set. In other words, mode is the number that appears the most often. A data set can have one or more than one mode.

If there is only one number that appears the most number of times, the data has one mode, and is called uni-modal.
If there are two numbers that appear equally frequently, the data has two modes, and is called bi-modal.
If there are more than two numbers that appear equally frequently, the data has more than two modes. We call that multi-modal.

How to Find the Mean, Median and Mode

Consider the following data points:

17, 16, 21, 18, 15, 17, 21, 19, 11, 23

We calculate the mean as:

To calculate the median, let’s arrange the data in ascending order:

11, 15, 16, 17, 17, 18, 19, 21, 21, 23

Since the number of observations is even (10), median is given by the average of the two middle observations (fifth and sixth here).

descriptive statistics median calculation

Mode is given by the number that occurs the most number of times. Here, 17 and 21 both occur twice. Hence, this is a bi-modal data set and the modes are 17 and 21.

A few things to note:

Since median and mode don’t consider all the data points for calculations, median and mode are robust against outliers (i.e., these are not affected by outliers).
At the same time, mean considers all data points, so it is sensitive to outliers. A big outlier increases mean, leading it to overestimate the data; a small outlier decreases mean, leading it to underestimate the data.
If the distribution is symmetrical, mean = median = mode, or what we would call “normal distribution.”

More on DataHow to Find Outliers With IQR Using Python

What Are Measures of Dispersion?

Measures of dispersion describe the spread of the data. When paired with measures of central tendency, they can reveal how far data points deviate from a central value.

7 Measures of Dispersion

Absolute deviation from mean
Variance
Standard deviation
Range
Quartiles
Skewness
Kurtosis

1. Absolute Deviation From Mean

The absolute deviation from the mean, also called mean absolute deviation (MAD), describes the variation in the data set. In a sense, it tells you the average absolute distance of each data point in the set. We calculate it as:

descriptive statistics mean absolute deviation statistic

2. Variance

Variance measures how far data points spread out from the mean. A high variance indicates that data points are spread widely and a small variance indicates that the data points are closer to the data set’s mean. We calculate it as:

descriptive statistics variance calculation

3. Standard Deviation

The square root of variance is called the standard deviation. We calculate it as:

descriptive statistics standard deviation calculation

4. Range

Range is the difference between the maximum value and the minimum value in the data set. It is given as:

5. Quartiles

Quartiles are the points in the data set that divides the data set into four equal parts. Q1, Q2 and Q3 are the first, second and third quartile of the data set.

25 percent of the data points lie below Q1 and 75 percent lie above it.
50 percent of the data points lie below Q2 and 50 percent lie above it. Q2 is nothing but median.
75 percent of the data points lie below Q3 and 25 percent lie above it.

6. Skewness

The measure of asymmetry in a probability distribution is defined by skewness. Skewness can either be positive, negative or undefined. We’ll focus on positive and negative skew.

Positive Skew: This is the case when the tail on the right side of the curve is bigger than that on the left side. For these distributions, the mean is greater than the mode.
Negative Skew: This is the case when the tail on the left side of the curve is bigger than that on the right side. For these distributions, the mean is smaller than the mode.

The most commonly used method of calculating skewness is:

descriptive statistics skewness calculation

If the skewness is zero, the distribution is symmetrical. If it is negative, the distribution is negatively skewed and if it is positive, it is positively skewed.

descriptive statistics mean median mode bell curve illustration — Three examples of skewness. | Image: Wikipedia

7. Kurtosis

Kurtosis describes whether the data is light-tailed (lack of outliers) or heavy-tailed (outliers present) when compared to a normal distribution. There are three kinds of kurtosis:

descriptive statistics kurtosis illustration — Examples of the three forms of kurtosis. | Image: Wikipedia

Mesokurtic: This is the case when the kurtosis is zero, similar to normal distributions.
Leptokurtic: This is when the tail of the distribution is heavy (outlier present) and kurtosis is higher than that of the normal distribution.
Platykurtic: This is when the tail of the distribution is light (no outlier) and kurtosis is lesser than that of the normal distribution.

Frequently Asked Questions

What is the difference between descriptive and inferential statistics?

While both descriptive and inferential statistics collect samples of data from a larger population, descriptive statistics only presents findings on that specific sample of data without making inferences about the larger population. Meanwhile, inferential statistics uses its findings from a data sample to make generalizations about the larger population.

What are the key measures of central tendency?

The key measures of central tendency used in descriptive statistics are mean (the average of the data), median (the value at the middle of the data) and the mode (the value or values that appear the most frequently).

What are the measures of dispersion?

Common measures of dispersion used in descriptive statistics include absolute deviation from the mean, variance, standard deviation, range, quartiles, skewness and kurtosis.

What Is Descriptive Statistics?

What Is Statistics?

Descriptive Statistics vs. Inferential Statistics

Commonly Used Measures

What Are the Measures of Central Tendency?

1. What Is the Mean?

2. What Is the Median?

3. What Is the Mode?

How to Find the Mean, Median and Mode

What Are Measures of Dispersion?

7 Measures of Dispersion

1. Absolute Deviation From Mean

2. Variance

3. Standard Deviation

4. Range

5. Quartiles

6. Skewness

7. Kurtosis

Frequently Asked Questions

What is the difference between descriptive and inferential statistics?

What are the key measures of central tendency?

What are the measures of dispersion?

Recent Data Science Articles