File Not Found Error in Dask program run on cluster

Question

I have 4 machines, M1, M2, M3, and M4. The scheduler, client, worker runs on M1. I've put a csv file in M1. Rest of the machines are workers.

When I run the program with read_csv file in dask. It gives me Error, file not found

mdurant · Accepted Answer · 2018-06-22 14:05:34Z

3

When one of your workers tries to load the CSV, it will not be able to find it, because it is not present on that local disc. This should not be a surprise. You can get around this in a number of ways:

copy the file to every worker; this is obviously wasteful in terms of disc space, but the easiest to achieve
place the file on a networked filesystem (NFS mount, gluster, HDFS, etc.)
place the file on an external storage system such as amazon S3 and refer to that location
load the data in your local process and distribute it with scatter; in this case presumably the data was small enough to fit in memory and probably dask would not be doing much for you.

answered Jun 22, 2018 at 14:05

mdurant

28.8k5 gold badges49 silver badges79 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Dhruv Kumar Over a year ago

So will the similar problem arise in to_csv. I mean the workers will write their portion of computed file in their machines?

Dhruv Kumar Over a year ago

It is possible in dask, that worker nodes fetch some (or whole file) itself from the client, when needed?

mdurant Over a year ago

Yes, you can upload files from the client

Dhruv Kumar Over a year ago

Will Dask do it automatically or I have to do some configuration?

mdurant Over a year ago

How would dask know to do that? Also, note that this is not the intended use of file_upload, you would be better off sorting out your own copies, if copying is the method you want to use.

Collectives™ on Stack Overflow

File Not Found Error in Dask program run on cluster

1 Answer 1

5 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

5 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related