Aggregation with sum based on condition

Question

I have a DataFrame like this:

df = pd.DataFrame(data= {'month' : [2,7,4,8], 'sales' : [10,40,70,50]})

I would like to get the sum of sales aggregated by the month. However, I want to have two groups of month combined, the first for months 1-6 (resulting in sales of 80) and the second for the months 7-12 (resulting in 90).

What's the best way to do this?

I won't post this as an answer since it uses extra technology but using duckdb (pip install duckdb) you could answer your query using SQL directly on the dataframe as such: duckdb.query("SELECT CASE WHEN month <= 6 THEN 1 ELSE 2 END as halfyear, sum(sales) FROM df GROUP BY halfyear").to_df(). — orlp
– orlp, Commented Nov 30, 2021 at 15:06

Fredaroo · Accepted Answer · 2021-11-30 15:09:57Z

1

One way to do this is to create a column that acts as a grouping key. This can be done like so:

import numpy as np
import pandas as pd

df = pd.DataFrame(data= {'month': [2, 7, 4, 8], 'sales' : [10, 40, 70, 50]})
df["foo"] = np.where(df['month'] < 7, 0, 1)
bar = df.groupby(['foo']).sum()

Here, a foo column is being created which will assign a group to each column depending on the condition you defined. i.e df['month'] < 7. Then using this created column you can perform a classic groupby() and obtain the sum.

Note you can also use df.groupby(['foo'])['sales'].agg('sum') if you only want to keep the sales column.

answered Nov 30, 2021 at 15:09

Fredaroo

4445 silver badges18 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

not_speshal · Accepted Answer · 2021-11-30 15:22:43Z

1

You can use pd.cut to assign labels to the months and use these in a groupby:

>>> df.groupby(pd.cut(df["month"], bins=[0, 6, 12], labels=["1-6", "7-12"]))["sales"].sum()

month
1-6     80
7-12    90
Name: sales, dtype: int64

answered Nov 30, 2021 at 15:22

not_speshal

23.2k2 gold badges18 silver badges33 bronze badges

Collectives™ on Stack Overflow

Aggregation with sum based on condition

2 Answers 2

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related