Erase blank rows while reading csv file

Question

I have tried to delete blank rows from my cvs file, however this is not working, it only writes out the first line

please take a look and tell me how i can get all the rows with text and skip the rows that are blank

Here is the code: I just reads out the first line of the csv file

Thank you in advance!

where is the code?

asongtoruin
– asongtoruin

2017-07-27 09:27:24 +00:00
Commented Jul 27, 2017 at 9:27 — asongtoruin
– asongtoruin, Commented Jul 27, 2017 at 9:27
df.read_csv(...).dropna()

cs95
– cs95

2017-07-27 09:27:43 +00:00
Commented Jul 27, 2017 at 9:27 — cs95
– cs95, Commented Jul 27, 2017 at 9:27
Hmmm, blanks rows are omitted by default.

jezrael
– jezrael

2017-07-27 09:28:47 +00:00
Commented Jul 27, 2017 at 9:28 — jezrael
– jezrael, Commented Jul 27, 2017 at 9:28

Ignacio Vergara Kausel · Accepted Answer · 2017-07-27 09:37:50Z

6

First read your csv file with pandas with

df=pd.read_csv('input.csv')

then remove blank rows,

df=df.dropna()

For more details in dropna, check the documentation.

edited Jul 27, 2017 at 9:37

Ignacio Vergara Kausel

6,0844 gold badges35 silver badges42 bronze badges

answered Jul 27, 2017 at 9:34

Mohamed Thasin ah

11.2k11 gold badges65 silver badges120 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Samadi Salahedine Over a year ago

Is there any way to optimize the row deleting part? I need this treatment for very large csv files (70Go..)

Mohamed Thasin ah Over a year ago

@SamadiSalahedine- dropna is efficient way of dropping nan rows. If your file size is large which can't handle by pandas easily then I suggest you to use dask. For more details follow this dask.pydata.org/en/latest/…

jezrael · Accepted Answer · 2017-07-27 09:32:01Z

2

There is problem:

for line in df:
    print (line)

return columns names.

answered Jul 27, 2017 at 9:32

jezrael

868k103 gold badges1.4k silver badges1.3k bronze badges

Comments

Bharath M Shetty · Accepted Answer · 2017-07-27 09:58:13Z

If I have a csv file like below with blank row

B;D;K;N;M;R 

0;2017-04-27 01:35:30;C;3.5;A;01:15:00;23.0 
1;2017-04-27 01:37:30;B;3.5;B;01:13:00;24.0 


2;2017-04-27 01:39:00;K;3.5;C;00:02:00;99.0




4;2017-04-27 01:39:00;K;3.5;C;00:02:00;99.0

df = pd.read_csv('input.csv',delimiter=';') will give the dataframe ignoring the blank lines.

                     B  D    K  N         M    R 
0  2017-04-27 01:35:30  C  3.5  A  01:15:00  23.0
1  2017-04-27 01:37:30  B  3.5  B  01:13:00  24.0
2  2017-04-27 01:39:00  K  3.5  C  00:02:00  99.0
4  2017-04-27 01:39:00  K  3.5  C  00:02:00  99.0

Your code works when you use open. Pandas read_csv will convert the csv file into dataframe. You might be confused with one another.

df = open('input.csv')
new_contents = []
for line in df:
    if not line.strip():
        continue 
    else: 
        new_contents.append(line)

Mukherjee · Accepted Answer · 2021-07-04 20:30:35Z

0

With the latest pandas (v 1.3.0), there is an argument where you can tell it to skip blank rows. It's enabled by default, but if you want to make it True anyway (e.g. self-documenting code), just set that flag to True. This is from the doc: https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html

skip_blank_lines: bool, default True
If True, skip over blank lines rather than interpreting as NaN values.

So, in your code it is:

df = pd.read_csv(path, sep = ';', skip_blank_lines=True)

answered Jul 4, 2021 at 20:30

Mukherjee

5361 gold badge3 silver badges12 bronze badges

Collectives™ on Stack Overflow

Erase blank rows while reading csv file

4 Answers 4

2 Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

2 Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related