Remove top row from a dataframe

Question

I have a dataframe that looks like this:

         level_0              level_1 Repo Averages for 27 Jul 2018
0  Business Date           Instrument                           Ccy
1     27/07/2018  GC_AUSTRIA_SUB_10YR                           EUR
2     27/07/2018    R_RAGB_1.15_10/18                           EUR
3     27/07/2018    R_RAGB_4.35_03/19                           EUR
4     27/07/2018    R_RAGB_1.95_06/19                           EUR

I am trying to get rid of the top row and only keep

   Business Date           Instrument         Ccy
0     27/07/2018  GC_AUSTRIA_SUB_10YR         EUR
1     27/07/2018    R_RAGB_1.15_10/18         EUR
2     27/07/2018    R_RAGB_4.35_03/19         EUR
3     27/07/2018    R_RAGB_1.95_06/19         EUR

I tried df.columns.droplevel(0) but not successful any help is more than welcome

Where are you getting the data from? It looks like an issue in reading the data. — asongtoruin
– asongtoruin, Commented Jul 31, 2018 at 10:39
You are likely to get answers quicker if you have runnable code in your question. — Dov Grobgeld
– Dov Grobgeld, Commented Jul 31, 2018 at 10:40
It is an automated file that has a weird structure. the top row it is like a title. So I have to read in everything and then delete undesirable rows — SBad
– SBad, Commented Jul 31, 2018 at 10:42

Gonçalo Peres · Accepted Answer · 2020-07-22 09:35:51Z

8

You can take advantage of the parameter header (Read here more about the header parameter in pandas).

Let's say that you have the following dataset

df = pd.read_csv("Prices.csv")
print(df)

That outputs

              0       1     2         3         4
0      DATA      SESSAO  HORA  PRECO_PT  PRECO_ES
1      1/1/2020  0       1     41,88     41,88   
2      1/1/2020  0       2     38,60     38,60   
3      1/1/2020  0       3     36,55     36,55

By simply passing the header = 0 like this

df = pd.read_csv("Prices.csv", header=0)
print(df)

You will get what you want

           DATA  SESSAO  HORA PRECO_PT PRECO_ES
0      1/1/2009  0       1     55,01    55,01  
1      1/1/2009  0       2     56,13    56,13  
2      1/1/2009  0       3     50,59    50,59  
3      1/1/2009  0       4     45,83    45,83  
4      1/1/2009  0       5     42,07    41,90

answered Jul 22, 2020 at 9:35

Gonçalo Peres

13.8k5 gold badges73 silver badges95 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Stephen Whitmore Over a year ago

This gives a working solution with a clear explanation AND links to relevant documentation. Thanks!

Joe · Accepted Answer · 2020-08-22 14:48:34Z

7

You can try so:

df.columns = df.iloc[0]
df = df.reindex(df.index.drop(0)).reset_index(drop=True)
df.columns.name = None

Output:

  Business Date           Instrument  Ccy
0    27/07/2018  GC_AUSTRIA_SUB_10YR  EUR
1    27/07/2018    R_RAGB_1.15_10/18  EUR
2    27/07/2018    R_RAGB_4.35_03/19  EUR
3    27/07/2018    R_RAGB_1.95_06/19  EUR

edited Aug 22, 2020 at 14:48

answered Jul 31, 2018 at 10:50

Joe

12.4k7 gold badges44 silver badges58 bronze badges

Comments

Zachary Wyman · Accepted Answer · 2020-08-07 21:29:15Z

3

You can try using slicing.

df = df[1:]

This will remove the first row of your dataframe.

answered Aug 7, 2020 at 21:29

Zachary Wyman

3312 silver badges17 bronze badges

2 Comments

Joe Over a year ago

even if the answer is accepted, have you tested it on the given example?

Arun Over a year ago

agree with @Joe , this example is not working.

vlizana · Accepted Answer · 2020-06-27 02:20:16Z

1

df.drop(row_start, row_end)

This will help

edited Jun 27, 2020 at 2:20

vlizana

3,2821 gold badge20 silver badges28 bronze badges

answered Jun 26, 2020 at 14:29

Emeka Boris Ama

4674 silver badges5 bronze badges

1 Comment

vlizana Over a year ago

don't use code snippets if the code is not executable, use code formatting instead.

Egret · Accepted Answer · 2022-09-17 09:49:20Z

0

I tested the comment by jeremycg. It works very well and is succinct. Just want more people to see, here it is again -

my_df = pd.read_csv(r"C:\path\to\my\file.csv", skiprows = 1)

answered Sep 17, 2022 at 9:49

Egret

4171 gold badge4 silver badges16 bronze badges

Collectives™ on Stack Overflow

Remove top row from a dataframe

5 Answers 5

1 Comment

Comments

2 Comments

1 Comment

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

1 Comment

Comments

2 Comments

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Related