convert numpy array from object dtype to float

Question

How do I convert the foll. numpy from object dtype to float:

array(['4,364,541', '2,330,200', '2,107,648', '1,525,711', '1,485,231',
       '1,257,500', '1,098,200', '1,065,106', '962,100', '920,200',
       '124,204', '122,320', '119,742', '116,627', '115,900', '108,400',
       '108,400', '108,000', '103,795', '102,900', '101,845', '100,900',
       '100,626'], dtype=object)

I tried arr.astype(float) but that does not work because of , in each string.

hpaulj · Accepted Answer · 2018-07-29 00:02:20Z

2

Yet another way

np.frompyfunc(lambda x: x.replace(',',''),1,1)(arr).astype(float)

frompyfunc returns an object dtype array, which is fine in this case. Often I've found that it is 2x faster than than a list comprehension, but here it times about the same as @coldspeed's:

np.array([v.replace(',', '') for v in arr], dtype=np.float32)

That may be because we are starting with an object dtype array. Direct iteration on an object dtype is a bit slower than iteration on a list, but faster than iteration on a regular numpy array. Like a list, the elements of the array are pointers to strings, and don't require the 'unboxing' that a string dtype array would.

(and 2 to 3 x faster than the np.char version).

answered Jul 29, 2018 at 0:02

hpaulj

233k14 gold badges260 silver badges392 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

cs95 · Accepted Answer · 2018-07-28 23:32:17Z

2

Simple way to do it is remove every comma:

np.array([v.replace(',', '') for v in arr], dtype=np.float32)

If you have pandas, to_numeric is a good option. It gracefully handles any invalid values that may creep in post replacement.

pd.to_numeric([v.replace(',', '') for v in arr], errors='coerce',  downcast='float')

Both methods return a float array as output.

answered Jul 28, 2018 at 23:32

cs95

406k106 gold badges744 silver badges797 bronze badges

Comments

dawg · Accepted Answer · 2018-07-28 23:43:05Z

Given:

>>> ar
array(['4,364,541', '2,330,200', '2,107,648', '1,525,711', '1,485,231',
       '1,257,500', '1,098,200', '1,065,106', '962,100', '920,200',
       '124,204', '122,320', '119,742', '116,627', '115,900', '108,400',
       '108,400', '108,000', '103,795', '102,900', '101,845', '100,900',
       '100,626'], dtype=object)

You can use filter to remove all non-digit elements and create floats:

>>> np.array(list(map(float, (''.join(filter(lambda c: c.isdigit(), s)) for s in ar))))
array([4364541., 2330200., 2107648., 1525711., 1485231., 1257500.,
       1098200., 1065106.,  962100.,  920200.,  124204.,  122320.,
        119742.,  116627.,  115900.,  108400.,  108400.,  108000.,
        103795.,  102900.,  101845.,  100900.,  100626.])

rafaelc · Accepted Answer · 2018-07-28 23:52:52Z

1

Can also use numpy.core.defchararray.replace()

>>> numpy.core.defchararray.replace(arr, ',','').astype(np.float)

array([4364541., 2330200., 2107648., 1525711., 1485231., 1257500.,
       1098200., 1065106.,  962100.,  920200.,  124204.,  122320.,
        119742.,  116627.,  115900.,  108400.,  108400.,  108000.,
        103795.,  102900.,  101845.,  100900.,  100626.])

Or np.char.replace as noted in comments by Cold. Naturally, this package provides is built for arrays of type numpy.string_ or numpy.unicode_

If object type,

replace(a.astype(np.unicode_), ',','').astype(np.float)

edited Jul 28, 2018 at 23:52

answered Jul 28, 2018 at 23:44

rafaelc

59.4k15 gold badges64 silver badges87 bronze badges

2 Comments

cs95 Over a year ago

A shorter alias: np.char.replace will also do the same thing.

hpaulj Over a year ago

That won't work if arr is object dtype. First have to convert it to a string dtype. The char functions essentially iterate on the elements of a string dtype and apply the corresponding string method. My guess is the speed will be similar to iterating on a object dtype array.

Collectives™ on Stack Overflow

convert numpy array from object dtype to float

4 Answers 4

Comments

Comments

Comments

2 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Related