I am using python 2.7 with dask and trying to query a db table from a remote machine to a dask dataframe
I have a multiple column index in the table, and I try to read it using the following script
ddf = dd.read_sql_table("table name", "mysql://user:pass@ip:port/Dbname",spesific column name).head()
And getting the following error
start = asanyarray(start) * 1.0 TypeError: ufunc 'multiply' did not contain a loop with signature matching types dtype('S32')dtype('S32') dtype('S32')
I got the sqlalchemy uri as explained here
i'm not sure what's the problem, when I try to query by another column as the index, and only use the ddf head(), i don't get an error, and when I try to compute the whole ddf i get the same error, i assume it's an issue regarding the column not being of unique values, I don't have a single column index, but multiple column, what is the solution to read the entire table here?
Thanks.
full traceback
> Traceback (most recent call last): File "path", line 28, in <module>
> ddf = dd.read_sql_table("tablename", "mysql://user:pass@ip:port/dbname","indexcolumn") File "file", line
> 123, in read_sql_table
> divisions = np.linspace(mini, maxi, npartitions + 1).tolist() File
> "/home/user/.local/lib/python2.7/site-packages/numpy/core/function_base.py",
> line 108, in linspace
> start = asanyarray(start) * 1.0 TypeError: ufunc 'multiply' did not contain a loop with signature matching types dtype('S32')
> dtype('S32') dtype('S32')
startwhen the error happens.