Python data analysis | data visualization based on Pandas

Author: Han Xinzi@ShowMeAI
Tutorial address: http://www.showmeai.tech/tutorials/33
Article address: http://www.showmeai.tech/article-detail/150
Notice: All Rights Reserved. Please contact the platform and the author for reprint and indicate the source

In the previous tutorial, we saw the flexible operation of pandas for data analysis, but at the same time, as a powerful universal tool library, it can also easily support data visualization, and most basic image rendering can be realized in one line of code, which greatly speeds up our analysis efficiency, In this paper, we introduce the methods of pandas visualization and drawing various graphics.

1, Basic drawing function plot

The visualization function on Series and DataFrame is just a simple wrapper around the matplotlib library plot() method.

import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.randn(10,4),index=pd.date_range('1/1/2022',
   periods=10), columns=list('ABCD'))
df.plot()

The operation results are as follows:

If the index consists of dates, as shown in the above figure, the date will be filled automatically.

In addition to the default line diagram, you can also draw a variety of styles, and you can use dataframe plot. [figure type parameter] method to select different figures.

2, Bar chart

import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.rand(10,4),columns=['e','f','g','h'])
df.plot.bar()

The operation results are as follows:

To generate stacked histogram, you can set stacked=True

import pandas as pd
df = pd.DataFrame(np.random.rand(10,4),columns= ['e','f','g','h'])
df.plot.bar(stacked=True)

The operation results are as follows:

To get a horizontal bar chart, you can use the barh method:

import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.rand(10,4),columns= ['e','f','g','h'])
df.plot.barh(stacked=True)

The operation results are as follows:

3, Histogram

You can use plot Hist () method draws histogram. We can specify the quantity.

import pandas as pd
import numpy as np
df = pd.DataFrame({'a':np.random.randn(1000)+1,'b':np.random.randn(1000),'c':np.random.randn(1000) - 1, 'd':np.random.randn(1000) -2}, columns=['a', 'b', 'c', 'd'])
df.plot.hist(bins=20)

The operation results are as follows:

You can draw a different histogram for each column using the following code:

import pandas as pd
import numpy as np
df = pd.DataFrame({'a':np.random.randn(1000)+1,'b':np.random.randn(1000),'c':np.random.randn(1000) - 1, 'd':np.random.randn(1000) -2}, columns=['a', 'b', 'c', 'd'])
df.diff().hist(bins=20)

The operation results are as follows:

4, Box diagram

You can call series box. Plot () and dataframe box. Plot() or dataframe Boxplot() to plot boxplot to visualize the distribution of values in each column.

For example, this is a box plot representing five trials of 10 observations of a random variable on [0,1).

import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.rand(10, 5), columns=['A', 'B', 'C', 'D', 'E'])
df.plot.box()

The operation results are as follows:

5, Area map

You can use series plot. Area() or dataframe plot. The area () method creates an area map.

import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.rand(10, 4), columns=['A', 'B', 'C', 'D'])
df.plot.area()

The operation results are as follows:

6, Scatter diagram

To create a scatter chart, you can use dataframe plot. Scatter() method.

import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.rand(50, 4), columns=['a', 'b', 'c', 'd'])
df.plot.scatter(x='a', y='b')

The operation results are as follows:

7, Pie chart

To create a pie chart, you can use dataframe plot. Pie() method.

import pandas as pd
import numpy as np
df = pd.DataFrame(3 * np.random.rand(4), index=['a', 'b', 'c', 'd'], columns=['x'])
df.plot.pie(subplots=True)

The operation results are as follows:

Data and code download

The code of this tutorial series can be downloaded from the github corresponding to ShowMeAI, which can be run in the local python environment. Babies who can surf the Internet scientifically can also directly learn through one click operation and interactive operation with the help of Google Lab!

The quick look-up tables involved in this series of tutorials can be downloaded and obtained at the following address:

Extended references

ShowMeAI related articles recommended

ShowMeAI series tutorial recommendations

Keywords: Python Data Analysis Visualization pandas

Added by 01hanstu on Fri, 25 Feb 2022 11:55:00 +0200