Box Plot with Minitab
What is a Box Plot?
A box plot is a graphical method to summarize a data set by visualizing the minimum value, 25th percentile, median, 75th percentile, the maximum value, and potential outliers. A percentile is the value below which a certain percentage of data fall. For example, if 75% of the observations have values lower than 685 in a data set, then 685 is the 75th percentile of the data. At the 50th percentile, or median, 50% of the values are lower and 50% are higher than that value.
Box Plot Anatomy
The figure above describes how to read a box plot. Here are a few explanations that may help. The middle part of the plot, or the “interquartile range,” represents the middle quartiles (or the 75th minus the 25th percentile). The line near the middle of the box represents the median (or middle value of the data set). The whiskers on either side of the IQR represent the lowest and highest quartiles of the data. The ends of the whiskers represent the maximum and minimum of the data, and the individual dots beyond the whiskers represent outliers in the data set.
How to Use Minitab to Generate a Box Plot
Data File: “Box Plot” tab in “Sample Data.xlsx”
Steps to render a box plot in Minitab:
- Click Graph → Boxplot.
- A new window named “Boxplots” pops up.
- Click “OK” in the window “Boxplots.”
- Another new window named “Boxplot– One Y, Simple” pops up.
- Select “HtBk” as the “Graph Variables.”
- Click the “Data View” button and a new window named “Boxplot– Data View” pops up.
- Check the boxes “Mediansymbol” and “Mean ”
- Click “OK” in the window “Boxplot– Data View.”
- Click “OK” in the window “Boxplot– One Y, Simple.”
- The box plot appears automatically in the new window.
Model summary: The above figure demonstrates the result of the boxplot after navigating through the Minitab menus to yield this output. Notice the interquartile range between the 25th and 75th quartiles, the median line, the mean, and the whiskers.
The annotations of the graph “Box Plot Anatomy” is wrong.
It is Min(Maximum value, Q3+1.5*IQR) and Max(Minimum value, Q1 – 1.5*IQR).
I lcan’t see the graph region of the box plot .what’s the reason for that?
I suggest to clarify the whiskers ends.
In fact the extreme lines here should be longer if they were 1.5 IQR, some of the outliers shown here are clearly within 1.5 IQR.
The proposed formula (minimum(minimum value in the data,…)) would include any outlier
low end is min dataset value above or equal to Q1-1.5 IQR
high end is max dataset value below or equal to Q3+1.5 IQR