Python Pandas Group by date using datetime data

Question

I have a datetime column Date_Time that I wish to groupby without creating a new column. Is this possible? I tried the following and it does not work.

df = pd.groupby(df,by=[df['Date_Time'].date()])

jezrael · Accepted Answer · 2016-09-08 21:40:19Z

107

You can use groupby by dates of column Date_Time by dt.date:

df = df.groupby([df['Date_Time'].dt.date]).mean()

Sample:

df = pd.DataFrame({'Date_Time': pd.date_range('10/1/2001 10:00:00', periods=3, freq='10H'),
                   'B':[4,5,6]})

print (df)
   B           Date_Time
0  4 2001-10-01 10:00:00
1  5 2001-10-01 20:00:00
2  6 2001-10-02 06:00:00

print (df['Date_Time'].dt.date)
0    2001-10-01
1    2001-10-01
2    2001-10-02
Name: Date_Time, dtype: object

df = df.groupby([df['Date_Time'].dt.date])['B'].mean()
print(df)
Date_Time
2001-10-01    4.5
2001-10-02    6.0
Name: B, dtype: float64

Another solution with resample:

df = df.set_index('Date_Time').resample('D')['B'].mean()

print(df)
Date_Time
2001-10-01    4.5
2001-10-02    6.0
Freq: D, Name: B, dtype: float64

edited Sep 8, 2016 at 21:40

answered Sep 8, 2016 at 21:01

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

Comments

wjandrea · Accepted Answer · 2022-10-23 02:00:56Z

92

`resample`

df.resample('D', on='Date_Time').mean()

              B
Date_Time      
2001-10-01  4.5
2001-10-02  6.0

`Grouper`

As suggested by @JosephCottam

df.set_index('Date_Time').groupby(pd.Grouper(freq='D')).mean()

              B
Date_Time      
2001-10-01  4.5
2001-10-02  6.0

Deprecated uses of `TimeGrouper`

You can set the index to be 'Date_Time' and use pd.TimeGrouper

df.set_index('Date_Time').groupby(pd.TimeGrouper('D')).mean().dropna()

              B
Date_Time      
2001-10-01  4.5
2001-10-02  6.0

edited Oct 23, 2022 at 2:00

wjandrea

33.9k10 gold badges69 silver badges105 bronze badges

answered Sep 8, 2016 at 21:19

piRSquared

296k68 gold badges509 silver badges654 bronze badges

4 Comments

GoBlue_MathMan Over a year ago

This is great! How do i prevent it from adding dates that there are no data for? For example if i had data for days 9/1,9/2,and 9/4 it still has 9/3 in there with NaN values.

piRSquared Over a year ago

@GoBlue_MathMan Use .dropna()

k.ko3n Over a year ago

Here, when grouping by 'hour', it adds hours that did not exist in the source file with zero values.

wjandrea Over a year ago

You can avoid .set_index('Date_Time') by doing pd.Grouper(key='Date_Time', freq='D'). Could be useful if the index is significant.

wjandrea · Accepted Answer · 2022-10-23 02:17:06Z

7

df.groupby(pd.Grouper(key='Date_Time', axis=0, freq='M')).sum()

M for month
Y for year
D for day

edited Oct 23, 2022 at 2:17

wjandrea

33.9k10 gold badges69 silver badges105 bronze badges

answered Apr 18, 2022 at 11:48

Avijit Das

831 silver badge5 bronze badges

1 Comment

Community Over a year ago

Your answer could be improved with additional supporting information. Please edit to add further details, such as citations or documentation, so that others can confirm that your answer is correct. You can find more information on how to write good answers in the help center.

Collectives™ on Stack Overflow

Python Pandas Group by date using datetime data

3 Answers 3

Comments

`resample`

`Grouper`

Deprecated uses of `TimeGrouper`

4 Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

Deprecated uses of TimeGrouper

4 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related

Deprecated uses of `TimeGrouper`