pandas.DataFrame.boxplot() Function | How to make box plots in pandas

Spread the love

pandas.DataFrame.boxplot(): This function Make a box plot from DataFrame columns.

Syntax: DataFrame.boxplot(column=None, by=None, ax=None, fontsize=None, rot=0, grid=True, figsize=None, layout=None, return_type=None, **kwds)

Make a box-and-whisker plot from DataFrame columns, optionally grouped by some other columns. A box plot is a method for graphically depicting groups of numerical data through their quartiles. The box extends from the Q1 to Q3 quartile values of the data, with a line at the median (Q2). The whiskers extend from the edges of box to show the range of the data. The position of the whiskers is set by default to 1.5 * IQR (IQR = Q3 – Q1) from the edges of the box. Outlier points are those past the end of the whiskers.

For further details see Wikipedia’s entry for boxplot.

Parameters:
column : str or list of str, optional
Column name or list of names, or vector. Can be any valid input to pandas.DataFrame.groupby().

by : str or array-like, optional
Column in the DataFrame to pandas.DataFrame.groupby(). One box-plot will be done per value of columns in by.

ax : object of class matplotlib.axes.Axes, optional
The matplotlib axes to be used by boxplot.

fontsize : float or str
Tick label font size in points or as a string (e.g., large).

rot : int or float, default 0
The rotation angle of labels (in degrees) with respect to the screen coordinate system.

grid : boolean, default True
Setting this to True will show the grid.

figsize : A tuple (width, height) in inches
The size of the figure to create in matplotlib.

layout : tuple (rows, columns), optional
For example, (3, 5) will display the subplots using 3 columns and 5 rows, starting from the top-left.

return_type : {‘axes’, ‘dict’, ‘both’} or None, default ‘axes’
The kind of object to return. The default is axes.

‘axes’ returns the matplotlib axes the boxplot is drawn on.

‘dict’ returns a dictionary whose values are the matplotlib Lines of the boxplot.

‘both’ returns a namedtuple with the axes and dict.

when grouping with by, a Series mapping columns to return_type is returned.

If return_type is None, a NumPy array of axes with the same shape as layout is returned.

**kwds
All other plotting keyword arguments to be passed to matplotlib.pyplot.boxplot().

Returns:
result :
The return type depends on the return_type parameter:

‘axes’ : object of class matplotlib.axes.Axes
‘dict’ : dict of matplotlib.lines.Line2D objects
‘both’ : a namedtuple with structure (ax, lines)
For data grouped with by:

Series
array (for return_type = None)

#example program on pandas.DataFrame.boxplot() Function

import numpy as np import pandas as pd np.random.seed(1234) df = pd.DataFrame(np.random.randn(16,4), columns=['Col1', 'Col2', 'Col3', 'Col4']) boxplot = df.boxplot(column=['Col1', 'Col2', 'Col3']) print(boxplot)

Output:
box_plot

admin

admin

Leave a Reply

Your email address will not be published. Required fields are marked *