How to get the indexes of all minimum values in pandas dataframe?

Question

I have a dataframe:

df = pd.DataFrame({'A': [0, 0, 1], 'B': [1, 0, 0]}, index=['x', 'y', 'z'])

   A  B
x  0  1
y  0  0
z  1  0

For each row, I want the names of all the columns with the lowest value (edit: per row), something like:

x  A
y  A
y  B
z  B

# or 

x  [A]
y  [A, B]
z  [B]

I know idxmin() gives the first instance of the lowest value:

df.idxmin(axis=1)

x    A
y    A
z    B

But what is an efficient way to get all of them?

This question gives all of the rows with the minimum value in a specific column, but that's not quite what I want.

Edit: Here's a better toy df to play with for getting the column names with the minimum value in each row:

df2 = pd.DataFrame({'A': [1, 0, 6], 'B': [3, 0, 2]}, index=['x', 'y', 'z'])

   A  B
x  1  3
y  0  0
z  6  2

Upvote because you found df.idxmin

ifly6
– ifly6

2022-02-08 21:46:13 +00:00
Commented Feb 8, 2022 at 21:46 — ifly6
– ifly6, Commented Feb 8, 2022 at 21:46
Should the minimum be per group of overall?

mozway
– mozway

2022-02-08 21:50:46 +00:00
Commented Feb 8, 2022 at 21:50 — mozway
– mozway, Commented Feb 8, 2022 at 21:50
@mozway the minimum per row

hmg
– hmg

2022-02-08 22:14:07 +00:00
Commented Feb 8, 2022 at 22:14 — hmg
– hmg, Commented Feb 8, 2022 at 22:14

mozway · Accepted Answer · 2022-02-08 22:20:06Z

2

You can use groupby+transform('min'):

s = df.stack()
s[s.eq(s.groupby(level=0).transform('min'))]

Output:

Alternative format:

s = df.stack()
(s[s.eq(s.groupby(level=0).transform('min'))]
  .reset_index()
  .groupby('level_0')['level_1'].apply(list)
 )

Output:

level_0
x       [A]
y    [A, B]
z       [B]
Name: level_1, dtype: object

edited Feb 8, 2022 at 22:20

answered Feb 8, 2022 at 21:47

mozway

267k13 gold badges56 silver badges106 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

hmg Over a year ago

Thanks for this. what does level=-1 do in groupby? (It also seems like levels 1 and 0 both give the same result, for this toy example at least.)

mozway Over a year ago

This means the last (right) level of the multindex ;)

mozway Over a year ago

@hmg I did a mistake, it should be level 0 to get the min per row. Level -1 would get the min per column (coincidentally identical here)

wwnde · Accepted Answer · 2022-02-08 21:49:41Z

2

Convert the df into bool by finding every min value and pull columns that return True into a list

 s= df==df.min()

df['column_min']=s.agg(lambda s: s.index[s].values, axis=1)




 A   B   column_min
x  0  1        [A]
y  0  0     [A, B]
z  1  0        [B]

answered Feb 8, 2022 at 21:49

wwnde

26.7k6 gold badges22 silver badges38 bronze badges

1 Comment

mozway Over a year ago

OP clarified the requirements, you need to use a groupby to get the min per row ;)

user7864386 · Accepted Answer · 2022-02-08 22:08:22Z

1

This is a one-liner, similar to @mozway's second solution but uses a boolean mask similar to @wwnde's:

min_cols = df.eq(df.min(axis=1), axis=0).stack().groupby(level=0).apply(lambda x: x.index.get_level_values(1)[x].tolist())

Output:

x       [A]
y    [A, B]
z       [B]

answered Feb 8, 2022 at 22:08

user7864386

Collectives™ on Stack Overflow

How to get the indexes of all minimum values in pandas dataframe?

3 Answers 3

3 Comments

1 Comment

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

3 Comments

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related