Pandas Dataframe, how to group columns together in Python

Question

I have a pandas Dataframe and i want to group some of the columns to build higher levels columns:

Exemple : i have

Index       A       B       C       D
    1    0.25     0.3    0.25    0.66
    2    0.25     0.3    0.25    0.66
    3    0.25     0.3    0.25    0.66

and i want

    Index              AB        ||           CD
    Subindex       A   |      B  ||      C    |      D 
    1            0.25  |    0.3  ||   0.25    |    0.66
    2            0.25  |    0.3  ||   0.25    |    0.66
    3            0.25  |    0.3  ||   0.25    |    0.66

Thank you for your help...

"i have ","i want "...and you've tried?

Pedro Lobito
– Pedro Lobito

2018-12-10 21:52:41 +00:00
Commented Dec 10, 2018 at 21:52 — Pedro Lobito
– Pedro Lobito, Commented Dec 10, 2018 at 21:52
Check multiple index

BENY
– BENY

2018-12-10 21:53:00 +00:00
Commented Dec 10, 2018 at 21:53 — BENY
– BENY, Commented Dec 10, 2018 at 21:53

ALollz · Accepted Answer · 2018-12-10 22:08:44Z

4

Create a dictionary to define your mapping and use pd.MultiIndex.from_tuples. If needed you can also specify names=['level_0', 'level_1'] to add names.

import pandas as pd

d = {'A': 'AB', 'B': 'AB', 'C': 'CD', 'D': 'CD'}
df.columns = pd.MultiIndex.from_tuples([*zip(map(d.get, df), df)])
# Equivalently
# df.columns = pd.MultiIndex.from_tuples([(d[col], col) for col in df.columns])

Output:

         AB         CD      
          A    B     C     D
Index                       
1      0.25  0.3  0.25  0.66
2      0.25  0.3  0.25  0.66
3      0.25  0.3  0.25  0.66

edited Dec 10, 2018 at 22:08

answered Dec 10, 2018 at 21:55

ALollz

59.7k7 gold badges73 silver badges97 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

Tbertin Over a year ago

Hi, thank you for your answer, but i've already done a double for loop to fill a new dataframe like you said, and it seems that it's not the fastest option. Is there no possible way to make the changes in place ?

piRSquared Over a year ago

@Tbertin your question/comment doesn't make a lot of sense. This answer does alter the dataframe in place and should be pretty fast as it is only altering the columns object.

piRSquared Over a year ago

@ALollz pd.MultiIndex.from_tuples([*zip(map(d.get, df), df)]) as a fun alternative. Your's is more readable of course (-:

ALollz Over a year ago

Thanks :D. Definitely need to just commit that syntax to memory very soon.

Rory L · Accepted Answer · 2020-06-04 06:28:43Z

2

`groupby` / `concat` hack

m = {'A': 'AB', 'B': 'AB', 'C': 'CD', 'D': 'CD'}
pd.concat(dict((*df.groupby(m, 1),)), axis=1)

         AB         CD      
          A    B     C     D
Index                       
1      0.25  0.3  0.25  0.66
2      0.25  0.3  0.25  0.66
3      0.25  0.3  0.25  0.66

Note that with this method it is possible to select an arbitrary subset of the columns in the original DataFrame, whereas the alternative answer appears to require a valid dictionary mapping for all values in the parent DataFrame

edited Jun 4, 2020 at 6:28

Rory L

531 silver badge6 bronze badges

answered Dec 10, 2018 at 21:59

piRSquared

296k68 gold badges509 silver badges654 bronze badges

3 Comments

Tbertin Over a year ago

so sorry for that but how can you group AB and CD now in a higher level ? At the end i would like to have DataFrame ['ABCD'] ['AB] ['A'] for example. The same logic doesn't seem to work...

Chacho Fuva Over a year ago

What if I a have a column "F" that I don't want to group?

piRSquared Over a year ago

pd.concat(dict((*df.drop(cols2skip, axis=1).groupby(m, 1),)), axis=1) where cols2skip is a list of columns to not include. If there is only one column pd.concat(dict((*df.drop('F', axis=1).groupby(m, 1),)), axis=1)

Collectives™ on Stack Overflow

Pandas Dataframe, how to group columns together in Python

2 Answers 2

4 Comments

`groupby` / `concat` hack

3 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

4 Comments

groupby / concat hack

3 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related

`groupby` / `concat` hack