Pandas Get Non-Null Values From Row Into One Cell [duplicate]

Question

Given the following data frame:

a = pd.DataFrame({'A': [1,2], 'B': [4,0], 'C': [1,2]})
a
    A   B   C
0   1   4   1
1   2   0   2

I would like to create a new column D containing the non-null values (per row) separated by columns. Like this:

    A   B   C    D
0   1   4   1    1,4,1
1   2   0   2    1,0,2

In reality, I will have many columns. Thanks in advance!

seems like you need df.apply(lambda x :','.join(x.astype(str)),axis=1) — BENY
– BENY, Commented Aug 22, 2017 at 20:18

Brad Solomon · Accepted Answer · 2017-11-19 02:35:28Z

1

An alternative:

a['D'] = a.apply(lambda row: ','.join(row.dropna()
          .astype(int).astype(str)), axis=1)

print(a)
   A  B  C      D
0  1  4  1  1,4,1
1  2  0  2  2,0,2

edited Nov 19, 2017 at 2:35

answered Aug 22, 2017 at 20:19

Brad Solomon

41.2k39 gold badges167 silver badges260 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

cs95 Over a year ago

apply with a lambda and a loop?

andrew_reece · Accepted Answer · 2017-08-22 20:22:08Z

1

# example data with NaN values
a = pd.DataFrame({'A': [np.nan,2], 'B': [4,np.nan], 'C': [1,2]})
a
     A    B  C
0  NaN  4.0  1
1  2.0  NaN  2

# make new column with non-null values
a['D'] = a.apply(lambda x: [val for val in x if not np.isnan(val)], axis=1)
a
     A    B  C           D
0  NaN  4.0  1  [4.0, 1.0]
1  2.0  NaN  2  [2.0, 2.0]

answered Aug 22, 2017 at 20:22

andrew_reece

21.4k3 gold badges40 silver badges64 bronze badges

5 Comments

Brad Solomon Over a year ago

Having trouble with this in pandas 0.20.3 and I'm not sure why, frankly.

andrew_reece Over a year ago

I just tested again, also on 0.20.3, no issues. What's the trouble you're having?

Brad Solomon Over a year ago

Very strange. Mind trying with a = pd.DataFrame({'A': [1,2], 'B': [4,0], 'C': [1,2]}, dtype=float)?

andrew_reece Over a year ago

I get an error then: ValueError: Wrong number of items passed 3, placement implies 1. I knocked it around a bit just now but couldn't figure out why it barfs when all values aren't NaN. I'll have a look later. Good observation, any ideas?

Brad Solomon Over a year ago

No, like I said I'm a bit stumped because your solution is pretty straightforward. And this solution works for both cases.

nanojohn · Accepted Answer · 2017-08-22 20:24:47Z

1

You can do something along the lines of the following:

combVals = []
a = a.T
for col in a.columns:
    combVals.append(str(a[col].dropna().astype(int).tolist())[1:-1])
a = a.T
a['D'] = combVals
print(a)
   A  B  C        D
0  1  4  1  1, 4, 1
1  2  0  2  2, 0, 2

You can remove the spaces in column D by doing: a['D'] = a['D'].str.replace(' ','')

answered Aug 22, 2017 at 20:24

nanojohn

5921 gold badge3 silver badges13 bronze badges

Collectives™ on Stack Overflow

Pandas Get Non-Null Values From Row Into One Cell [duplicate]

3 Answers 3

1 Comment

5 Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

5 Comments

Comments

Linked

Related