Removing index column in pandas when reading a csv

Question

I have the following code which imports a CSV file. There are 3 columns and I want to set the first two of them to variables. When I set the second column to the variable "efficiency" the index column is also tacked on. How can I get rid of the index column?

df = pd.DataFrame.from_csv('Efficiency_Data.csv', header=0, parse_dates=False)
energy = df.index
efficiency = df.Efficiency
print efficiency

I tried using

del df['index']

after I set

energy = df.index

which I found in another post but that results in "KeyError: 'index' "

Community · Accepted Answer · 2021-09-18 15:06:34Z

400

When writing to and reading from a CSV file include the argument index=False and index_col=False, respectively. Follows an example:

To write:

 df.to_csv(filename, index=False)

and to read from the csv

df.read_csv(filename, index_col=False)

This should prevent the issue so you don't need to fix it later.

edited Sep 18, 2021 at 15:06

CommunityBot

11 silver badge

answered Apr 12, 2016 at 11:31

Steve

4,6784 gold badges21 silver badges27 bronze badges

Sign up to request clarification or add additional context in comments.

8 Comments

Ravindra S Over a year ago

Thanks a lot.This is exactly what is the question is looking for.

J.D Over a year ago

"header = False" works for removing headers in the same way

Vedda Over a year ago

should be index_col=False.

cacti5 Over a year ago

Using df.to_sql("table",cursor,if_exists="append",index=False) also fixes the sqlite error sqlite3.OperationalError: table message has no column named index

matt wilkie Over a year ago

@vedda it seems to be index=False for to_excel() and index_col=False with read_csv() in pandas 0.23.4. :-/

|

vvvvv · Accepted Answer · 2024-12-15 13:22:26Z

145

df.reset_index(drop=True, inplace=True)

edited Dec 15, 2024 at 13:22

vvvvv

32.9k19 gold badges70 silver badges103 bronze badges

answered Mar 6, 2018 at 10:57

Subhojit Mukherjee

1,5451 gold badge10 silver badges2 bronze badges

2 Comments

tommy.carstensen Over a year ago

This is actually my favorite solution, but not a very elaborate answer. The manual reads this about the argument drop: "Do not try to insert index into dataframe columns. This resets the index to the default integer index." pandas.pydata.org/pandas-docs/stable/generated/…

questionto42 Over a year ago

@tommy.carstensen Then how would you avoid getting the integers on the index as a replacement of the previous index? I think it is a misunderstanding of the text of your link. The question here is to drop the index. And this is reached here. You get the default integers, since there is no dateframe without an index, but you have dropped the previous index. That is why this answer should be the accepted answer, also because it uses the memory efficient inplace=True.

Jean-François Corbett · Accepted Answer · 2019-05-29 07:58:11Z

90

DataFrames and Series always have an index. Although it displays alongside the column(s), it is not a column, which is why del df['index'] did not work.

If you want to replace the index with simple sequential numbers, use df.reset_index().

To get a sense for why the index is there and how it is used, see e.g. 10 minutes to Pandas.

edited May 29, 2019 at 7:58

Jean-François Corbett

38.7k30 gold badges145 silver badges192 bronze badges

answered Nov 20, 2013 at 21:53

Dan Allan

35.5k6 gold badges72 silver badges64 bronze badges

6 Comments

Bogdan Janiszewski Over a year ago

Thanks! I decided to just import it a different way not using pandas. I have to perform some arithmetic on each of the columns and python wasn't liking have the index column attached. Pandas is certainly the easiest way to import data but not always the best I found out.

Jamie Bull Over a year ago

Did you try using Pandas to do the arithmetic?

Quant Over a year ago

can one remove the index name?

Dan Allan Over a year ago

Yes, index.name = None.

deadcode Over a year ago

Yes, clearly the next answer should be the accepted one.

|

Natheer Alabsi · Accepted Answer · 2017-11-16 01:32:03Z

21

You can set one of the columns as an index in case it is an "id" for example. In this case the index column will be replaced by one of the columns you have chosen.

df.set_index('id', inplace=True)

edited Nov 16, 2017 at 1:32

answered Dec 12, 2016 at 4:18

Natheer Alabsi

2,8804 gold badges23 silver badges29 bronze badges

1 Comment

Azurespot Over a year ago

Hmm, this didn't work for me. I got "None" as a console printout.

Bhanu Pratap Singh · Accepted Answer · 2016-01-21 12:02:35Z

8

If your problem is same as mine where you just want to reset the column headers from 0 to column size. Do

df = pd.DataFrame(df.values);

EDIT:

Not a good idea if you have heterogenous data types. Better just use

df.columns = range(len(df.columns))

edited Jan 21, 2016 at 12:02

answered Jan 21, 2016 at 11:21

Bhanu Pratap Singh

1,0871 gold badge13 silver badges15 bronze badges

1 Comment

Nick_Jo Over a year ago

This is what I wanted, thanks. My situation was where I'd import a text file with sep=' ' that led to extra na columns. On further research I killed the problem earlier now and for others, sep=' ' does not equate to delim_whitespace=True....

007mrviper · Accepted Answer · 2023-06-18 16:30:05Z

5

I tried index_col=False, and index_col=None, from the answers posted for this question but none worked.
But index_col=0 worked.

So do like this when reading a file if you want to drop the unwanted index column.
df = pd.read_csv('filename.csv', index_col=0)

answered Jun 18, 2023 at 16:30

007mrviper

5296 silver badges22 bronze badges

Comments

yemu · Accepted Answer · 2013-11-20 21:47:08Z

3

you can specify which column is an index in your csv file by using index_col parameter of from_csv function if this doesn't solve you problem please provide example of your data

answered Nov 20, 2013 at 21:47

yemu

28.8k10 gold badges34 silver badges30 bronze badges

Comments

Lord Varis · Accepted Answer · 2018-09-14 14:02:55Z

3

One thing that i do is df=df.reset_index() then df=df.drop(['index'],axis=1)

answered Sep 14, 2018 at 14:02

Lord Varis

571 bronze badge

2 Comments

Vasin Yuriy Over a year ago

Error: "labels ['index'] not contained in axis"

questionto42 Over a year ago

@VasinYuriy this is meant like df.reset_index().drop(columns=['yourfirstindex', 'yoursecondindex']), it works with 'index' only in the standard case that the index does not have a name and then becomes a column called 'index' with df.reset_index().drop(columns=['index']). The added parameter axis=1 is the default. This method is not recommended, @SubhojitMukherjee's reset_index(inplace=True) works "inplace" and thus saves memory.

Ali Taheri · Accepted Answer · 2021-08-07 03:41:46Z

3

To remove or not to create the default index column, you can set the index_col to False and keep the header as Zero. Here is an example of how you can do it.

recording = pd.read_excel("file.xls",
                     sheet_name= "sheet1",
                     header= 0,
                     index_col= False)

The header = 0 will make your attributes to headers and you can use it later for calling the column.

edited Aug 7, 2021 at 3:41

answered Aug 7, 2021 at 3:26

Ali Taheri

1463 bronze badges

Comments

Francis Ezeani · Accepted Answer · 2022-08-23 21:49:45Z

-1

It works for me this way:

Df = data.set_index("name of the column header to start as index column" )

answered Aug 23, 2022 at 21:49

Francis Ezeani

11 bronze badge

Collectives™ on Stack Overflow

Removing index column in pandas when reading a csv

10 Answers 10

8 Comments

2 Comments

6 Comments

1 Comment

1 Comment

Comments

Comments

2 Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

10 Answers 10

8 Comments

2 Comments

6 Comments

1 Comment

1 Comment

Comments

Comments

2 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related