Displaying distribution of categorical variables in Pandas

A great feature supported by Pandas is the data.hist(). The hist function allows me to visualize the distribution of numerical values. However, that being said: it only allows me to view the distribution for numerical variables.

If I wanted to view the distribution of categorical variables, I'd need to run select_dtypes in a for loop as follows:

import matplotlib.pyplot as plt

for col in tips.select_dtypes(include=["category"]):
    tips[col].value_counts().plot(kind='bar')
    plt.show()

Is there an easier (more Pythonic) way to view the distribution of categorical variables.

asked Mar 30, 2016 at 16:34

user1008537

Take a look at stanford.edu/~mwaskom/software/seaborn/tutorial/…

Stefan
– Stefan

2016-03-30 17:07:52 +00:00
Commented Mar 30, 2016 at 17:07
Thanks Stefan, I am familiar with Seaborn - but I'd love to stick with the pandas ecosystem.

user1008537
– user1008537

2016-03-30 17:08:44 +00:00
Commented Mar 30, 2016 at 17:08
@Phil it would probably help if you explained why "absolutely not". Assuming that you meant that the duplicate doesn't apply. The accepted answer seems to do the same thing you're doing in the loop.

Andras Deak -- Слава Україні
– Andras Deak -- Слава Україні

2016-04-06 19:32:39 +00:00
Commented Apr 6, 2016 at 19:32
I have a solution at stackoverflow.com/a/54266570/5084355

Roman Orac
– Roman Orac

2019-01-19 11:26:44 +00:00
Commented Jan 19, 2019 at 11:26

Add a comment |

Collectives™ on Stack Overflow

Displaying distribution of categorical variables in Pandas [duplicate]

0

Linked

Hot Network Questions