Group seperated counting values in a pandas dataframe

Question

I have following df

     A   B
0    1   10
1    2   20
2    NaN 5
3    3   1
4    NaN 2
5    NaN 3
6    1   10
7    2   50
8    Nan 80
9    3   5

Consisting of repeating sequences from 1-3 seperated by a variable number of NaN's.I want to groupby each this sequences from 1-3 and get the minimum value of column B within these sequences.

Desired Output something like:

     B_min
0    1
6    5

Many thanks beforehand

draj

post you written code

Zaraki Kenpachi
– Zaraki Kenpachi

2020-03-11 13:28:59 +00:00
Commented Mar 11, 2020 at 13:28 — Zaraki Kenpachi
– Zaraki Kenpachi, Commented Mar 11, 2020 at 13:28

jezrael · Accepted Answer · 2020-03-11 13:50:39Z

1

Idea is first remove rows by missing values by DataFrame.dropna, then use GroupBy.cummin by helper Series created by compare A for equal by Series.eq and Series.cumsum, last data cleaning to one column DataFrame:

df = (df.dropna(subset=['A'])
       .groupby(df['A'].eq(1).cumsum())['B']
       .min()
       .reset_index(drop=True)
       .to_frame(name='B_min'))
print (df)
   B_min
0      1
1      5

answered Mar 11, 2020 at 13:50

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

draj Over a year ago

Thanks! I also was playing around with dropna first but I lack knowledge of such helper funcs like .eq(). Thank you very much!

davidbilla · Accepted Answer · 2020-03-11 13:48:40Z

1

All you need to df.groupby() and apply min(). Is this what you are expecting?

df.groupby('A')['B'].min()

Output:

If you don't want the NaNs in your group you can drop them using df.dropna()

df.dropna().groupby('A')['B'].min()

edited Mar 11, 2020 at 13:48

answered Mar 11, 2020 at 13:42

davidbilla

2,2321 gold badge22 silver badges28 bronze badges

1 Comment

draj Over a year ago

Unfortunately not, it’s not the desired output. Thanks anyways!

Collectives™ on Stack Overflow

Group seperated counting values in a pandas dataframe

2 Answers 2

1 Comment

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

1 Comment

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related