Python pandas count occurrences in each column

Question

I am new to pandas. Can someone help me in calculating frequencies of values for each columns.

Dataframe:

id|flag1|flag2|flag3|  
---------------------
1 |  1  |   2 |   1 |  
2 |  3  |   1 |   1 |  
3 |  3  |   4 |   4 |  
4 |  4  |   1 |   4 |  
5 |  2  |   3 |   2 |

I want something like

id|flag1|flag2|flag3|  
---------------------
1 |  1  |   2 |   2 |  
2 |  1  |   1 |   1 |  
3 |  2  |   1 |   0 |  
4 |  1  |   1 |   2 |

Explanation - id 1 has 1 value in flag1, 2 values in flag2 and 2 values in flag3.

why id 5 should be ignored? The last line could be 5|0|0|0 — RomanPerekhrest
– RomanPerekhrest, Commented Nov 20, 2017 at 11:01
id is not used, that is why it is ignored. values in column do not represent that they belong to a specific id, they represent numbers and I have to categorise on basis of those numbers — Rajat Srivastava
– Rajat Srivastava, Commented Nov 20, 2017 at 11:13

jezrael · Accepted Answer · 2017-11-20 12:02:24Z

2

First filter only flag columns by filter or removing id column and then apply function value_counts, last replace NaNs to 0 and cast to ints:

df = df.filter(like='flag').apply(lambda x: x.value_counts()).fillna(0).astype(int)
print (df)
   flag1  flag2  flag3
1      1      2      2
2      1      1      1
3      2      1      0
4      1      1      2

Or:

df = df.drop('id', 1).apply(lambda x: x.value_counts()).fillna(0).astype(int)
print (df)
   flag1  flag2  flag3
1      1      2      2
2      1      1      1
3      2      1      0
4      1      1      2

Thank you, Bharath for suggestion:

df = df.filter(like='flag').apply(pd.Series.value_counts()).fillna(0).astype(int)

edited Nov 20, 2017 at 12:02

answered Nov 20, 2017 at 11:06

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Bharath M Shetty Over a year ago

Sir you dont need lambda here you can use pd.Series.value_counts, used here stackoverflow.com/questions/46863602/…

Collectives™ on Stack Overflow

Python pandas count occurrences in each column

1 Answer 1

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related