Remove leading comma in header when using pandas to_csv

Question

By default to_csv writes a CSV like

,a,b,c
0,0.0,0.0,0.0
1,0.0,0.0,0.0
2,0.0,0.0,0.0

But I want it to write like this:

a,b,c
0,0.0,0.0,0.0
1,0.0,0.0,0.0
2,0.0,0.0,0.0

How do I achieve this? I can't set index=False because I want to preserve the index. I just want to remove the leading comma.

df = pd.DataFrame(np.zeros((3,3)), columns = ['a','b','c'])
df.to_csv("test.csv") # this results in the first example above.

You do not want to remove that leading comma or all other columns are shifted to left since csv is a comma-separated values text file. — Parfait
– Parfait, Commented Jan 24, 2020 at 13:57
@Parfait The second dataframe in my example above works perfectly well. Give it a try. pd.read_csv("test.csv") (where test.csv is the second example). — JacksonCounty
– JacksonCounty, Commented Jan 24, 2020 at 14:04
Carefully look at that desired result which is exactly what I mention. Column A is no longer aligned to original values but shifted to left and then you have the last column without a header! — Parfait
– Parfait, Commented Jan 24, 2020 at 14:28
It's implied that the index is unnamed, and pd.read_csv interprets that implication correctly. I know this is certainly not best practice, and I don't recommend anyone do it this way, but I needed to do it this way for some legacy reasons. @Parfait — JacksonCounty
– JacksonCounty, Commented Jan 24, 2020 at 15:26
Understood but do note, other than pandas, reading this csv (per accepted answer below) in other applications/languages will result in shifted columns. I had a feeling this was an XY Problem. Your real question should have been handling the legacy reasons! I have yet to met a use case to break best practices. Good luck and happy coding! — Parfait
– Parfait, Commented Jan 24, 2020 at 15:28

jezrael · Accepted Answer · 2020-01-24 14:12:11Z

5

It is possible by write only columns without index first and then data without header in append mode:

df = pd.DataFrame(np.zeros((3,3)), columns = ['a','b','c'], index=list('XYZ'))

pd.DataFrame(columns=df.columns).to_csv("test.csv", index=False)
#alternative for empty df
#df.iloc[:0].to_csv("test.csv", index=False)
df.to_csv("test.csv", header=None, mode='a')

df = pd.read_csv("test.csv")
print (df)
     a    b    c
X  0.0  0.0  0.0
Y  0.0  0.0  0.0
Z  0.0  0.0  0.0

edited Jan 24, 2020 at 14:12

answered Jan 24, 2020 at 14:04

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Parfait · Accepted Answer · 2020-01-24 14:01:52Z

3

Alternatively, try reseting the index so it becomes a column in data frame, named index. This works with multiple indexes as well.

df = df.reset_index()
df.to_csv('output.csv', index = False)

answered Jan 24, 2020 at 14:01

Parfait

108k19 gold badges103 silver badges138 bronze badges

Comments

smarie · Accepted Answer · 2020-01-24 14:05:59Z

3

Simply set a name for your index: df.index.name = 'blah'. This name will appear as the first name in the headers.

import numpy as np
import pandas as pd

df = pd.DataFrame(np.zeros((3,3)), columns = ['a','b','c'])
df.index.name = 'my_index'
print(df.to_csv())

yields

my_index,a,b,c
0,0.0,0.0,0.0
1,0.0,0.0,0.0
2,0.0,0.0,0.0

However if (as per your comment) you wish to have 3 coma-separated names in the headers while there are 4 coma-separated values in the rows of the csv, you'll have to handcraft it. It will NOT be compliant with any csv standard format though.

edited Jan 24, 2020 at 14:05

answered Jan 24, 2020 at 13:53

smarie

5,32434 silver badges50 bronze badges

1 Comment

JacksonCounty Over a year ago

Well I want the header to be "a,b,c", and not "my_index,a,b,c" ? @Parfait

Collectives™ on Stack Overflow

Remove leading comma in header when using pandas to_csv

3 Answers 3

Comments

Comments

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related