Pandas DataFrame grouping by Timestamp

Question

I have a use case where:

Data is of the form: Col1, Col2, Col3 and Timestamp.

Now, I just want to get the counts of the rows vs Timestamp Bins.

i.e. for every half hour bucket (even the ones which have no correponding rows), I need the counts of how many rows are there.

Timestamps are spread over a one year period, so I can't divide it into 24 buckets.

I have to bin them at 30 minutes interval.

cs95 · Accepted Answer · 2019-07-02 17:34:16Z

19

`groupby` via `pd.Grouper`

# optionally, if needed
# df['Timestamp'] = pd.to_datetime(df['Timestamp'], errors='coerce')  
df.groupby(pd.Grouper(key='Timestamp', freq='30min')).count()

`resample`

df.set_index('Timestamp').resample('30min').count()

edited Jul 2, 2019 at 17:34

answered Mar 26, 2018 at 4:34

cs95

406k106 gold badges744 silver badges797 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

david nadal Over a year ago

@COLDSPEED thanks a lot! it works! what does errors=coerce do? And one more question: resample does it sample all the rows?

cs95 Over a year ago

@davidnadal it will convert invalid datetime strings to NaT (instead of throwing parser errors). Resample will sample all rows.

Collectives™ on Stack Overflow

Pandas DataFrame grouping by Timestamp

1 Answer 1

`groupby` via `pd.Grouper`

`resample`

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

groupby via pd.Grouper

resample

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related

`groupby` via `pd.Grouper`

`resample`