how do I find count of unique combination in 2 columns of dataframe in pandas

Question

df = pd.DataFrame({'col1': [1,2,4,3], 'col2': [2,1,3,4]})
   col1 col2
0   1     2
1   2     1
2   4     3
3   3     4

Desired outcome

  col1 col2 count
0   1     2     2
1   4     3     2

I tried

(df.groupby(['team1','team2']).size()
   .sort_values(ascending=False)
   .reset_index(name='Count')
)

but this is not giving me unique combination

what you tried should be posted in the question. Comments are meant for clarifications and may get deleted. Please edit what you've tried into the question. — Ch3steR
– Ch3steR, Commented Apr 19, 2022 at 13:57

Scott Boston · Accepted Answer · 2022-04-19 13:59:34Z

2

You do something like this also,

df.apply(set, axis=1).value_counts()

Output:

{1, 2}    2
{3, 4}    2
dtype: int64

answered Apr 19, 2022 at 13:59

Scott Boston

154k15 gold badges160 silver badges207 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

mozway · Accepted Answer · 2022-04-19 14:04:10Z

2

IIUC, you can first compute a frozenset from your two columns, then use named aggregation:

# compute unordered grouper
group = df[['col1', 'col2']].agg(frozenset, axis=1)

# craft a dictionary of expected output
# first rows for the existing columns + new column for count
d = {c: (c, 'first') for c in df}
d.update({'count': ('col1', 'count')})
# {'col1': ('col1', 'first'),
#  'col2': ('col2', 'first'),
#  'count': ('col1', 'count')}

# perform the aggregation
df.groupby(group, as_index=False).agg(**d)

output:

   col1  col2  count
0     1     2      2
1     4     3      2

edited Apr 19, 2022 at 14:04

answered Apr 19, 2022 at 13:59

mozway

267k13 gold badges56 silver badges106 bronze badges

Comments

BENY · Accepted Answer · 2022-04-19 14:01:01Z

1

Let us check

df[:] = np.sort(df.to_numpy(),axis=1)
df.value_counts()
Out[132]: 
col1  col2
1     2       2
3     4       2
dtype: int64

answered Apr 19, 2022 at 14:01

BENY

324k22 gold badges176 silver badges250 bronze badges

Collectives™ on Stack Overflow

how do I find count of unique combination in 2 columns of dataframe in pandas

3 Answers 3

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related