For loop after pandas groupby

Question

My input dataframe looks like:

name   uniqueID
kate   0001
sam    0001
lucy   0002
wes    0001
kip    0002

I have the following:

addData =pd.read_csv('/input.csv')
grouped = addData.groupby(['uniqueID'])
filename = addData['uniqueID'][0]
output_csv = '/test/output_{}.csv'.format(filename)

for name, group in grouped:
    group.to_csv(output_csv)

My output is semi-correct. I have a file with all the associated records for that 'uniqueID', EX) output001.csv:

name   uniqueID
kate   0001
sam    0001
wes    0001

The problem is that I am only getting one file - my loop is not working correctly to produce both output0001.csv and output0002.csv

do: group.to_csv('/test/output_{}.csv'.format(group['uniqueID'][0])) — YOLO
– YOLO, Commented Dec 19, 2018 at 20:27
addData['uniqueID'][0] will only ever be a single value, 0001. Therefore output_csv will only ever be that value — G. Anderson
– G. Anderson, Commented Dec 19, 2018 at 20:28

Muffindorf · Accepted Answer · 2018-12-19 20:35:02Z

2

This worked:

grouped = addData.groupby(['uniqueID'])
filename = addData['uniqueID'][0]
output_csv = 'output_{}.csv'

for name, group in grouped:
    group.to_csv(output_csv.format(name))

answered Dec 19, 2018 at 20:35

Muffindorf

1035 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

For loop after pandas groupby

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related