pandas add columns when read from a csv file

Question

I want to read from a CSV file using pandas read_csv. The CSV file doesn't have column names. When I use pandas to read the CSV file, the first row is set as columns by default. But when I use df.columns = ['ID', 'CODE'], the first row is gone. I want to add, not replace.

df = pd.read_csv(CSV)
df

    a   55000G707270
0   b   5l0000D35270
1   c   5l0000D63630
2   d   5l0000G45630
3   e   5l000G191200
4   f   55000G703240


df.columns=['ID','CODE']
df

    ID          CODE
0   b   5l0000D35270
1   c   5l0000D63630
2   d   5l0000G45630
3   e   5l000G191200
4   f   55000G703240

Possible duplicate of How to add header row to a pandas DataFrame — Leb
– Leb, Commented Dec 24, 2016 at 13:04

jezrael · Accepted Answer · 2016-12-24 10:03:59Z

13

I think you need parameter names in read_csv:

df = pd.read_csv(CSV, names=['ID','CODE'])

names : array-like, default None

List of column names to use. If file contains no header row, then you should explicitly pass header=None. Duplicates in this list are not allowed unless mangle_dupe_cols=True, which is the default.

answered Dec 24, 2016 at 10:03

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Ando · Accepted Answer · 2020-06-04 18:01:39Z

2

The reason there are extra index columns add is because to_csv() writes an index per default, so you can either disable index when saving your CSV:

df.to_csv('file.csv', index=False)

or you can specify an index column when reading:

df = pd.read_csv('file.csv', index_col=0)

answered Jun 4, 2020 at 18:01

Ando

211 bronze badge

Comments

ZdaR · Accepted Answer · 2016-12-24 10:04:17Z

1

You may pass the column names at the time of reading the csv file itself as :

df = pd.read_csv(csv_path, names = ["ID", "CODE"])

answered Dec 24, 2016 at 10:04

ZdaR

23.1k7 gold badges71 silver badges90 bronze badges

Comments

Carles Mitjans · Accepted Answer · 2016-12-24 10:04:18Z

1

Use names argument in function call to add the columns yourself:

df = pd.read_csv(CSV, names=['ID','CODE'])

answered Dec 24, 2016 at 10:04

Carles Mitjans

4,8663 gold badges22 silver badges39 bronze badges

Comments

MaxU - stand with Ukraine · Accepted Answer · 2016-12-24 13:00:20Z

1

you need both: header=None and names=['ID','CODE'], because there are no column names/labels/headers in your CSV file:

df = pd.read_csv(CSV, header=None, names=['ID','CODE'])

answered Dec 24, 2016 at 13:00

MaxU - stand with Ukraine

212k37 gold badges402 silver badges436 bronze badges

Collectives™ on Stack Overflow

pandas add columns when read from a csv file

5 Answers 5

Comments

Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Comments

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related