dataframe boolean selection along columns instead of row

Question

Suppose I have the following dataframe:

          a         b         c         d 
0  0.049531  0.408824  0.975756  0.658347
1  0.981644  0.520834  0.258911  0.639664
2  0.641042  0.534873  0.806442  0.066625
3  0.764057  0.063252  0.256748  0.045850

and I want only the subset of columns whose value in row 0 is creater than 0.5. I can do this:

df2 = df.T
myResult = df2[df2.iloc[:, 0] > 0.5].T

But this feels like a horrible hack. Is there a nicer way to do boolean indexing along columns? Somewhere I can specify an axis argument?

I believe you've got the most elegant way out there.

ericmjl
– ericmjl

2014-08-12 20:11:13 +00:00
Commented Aug 12, 2014 at 20:11 — ericmjl
– ericmjl, Commented Aug 12, 2014 at 20:11

Artur Nowak · Accepted Answer · 2017-08-18 18:37:57Z

7

How about this?

df.loc[:, df.iloc[0, :] > 0.5]

edited Aug 18, 2017 at 18:37

Artur Nowak

5,3843 gold badges26 silver badges32 bronze badges

answered Aug 12, 2014 at 20:32

chrisb

52.7k8 gold badges73 silver badges70 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

MorganM Over a year ago

Yes, that is precisely what I was looking for.

EdChum · Accepted Answer · 2014-08-12 20:32:35Z

1

Another method without using transpose is to create a boolean mask on whether the first row has values larger than 0.5 and then drop the NaN's with a threshold and then finally make a list of the df columns to filter the original df. This is pretty obfuscated though ;)

In [76]:

df[list(df[df.head(1)> 0.5].dropna(thresh=1, axis=1))]
Out[76]:
              c         d
index                    
0      0.975756  0.658347
1      0.258911  0.639664
2      0.806442  0.066625
3      0.256748  0.045850

answered Aug 12, 2014 at 20:32

EdChum

397k204 gold badges837 silver badges583 bronze badges

Comments

ericmjl · Accepted Answer · 2014-08-12 20:12:36Z

0

Another way of looking at your answer:

In [14]: df.T[df.T[0] > 0.5].T
Out[14]: 
          c        d 
0  0.975756  0.658347
1  0.258911  0.639664
2  0.806442  0.066625
3  0.256748  0.045850

answered Aug 12, 2014 at 20:12

ericmjl

14.9k13 gold badges57 silver badges83 bronze badges

1 Comment

ericmjl Over a year ago

Triple transpose might not be as elegant as your answer.

Collectives™ on Stack Overflow

dataframe boolean selection along columns instead of row

3 Answers 3

1 Comment

Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related