Issue with true division with Numpy arrays

Question

Suppose you have this array:

In [29]: a = array([[10, 20, 30, 40, 50], [14, 28, 42, 56, 70], [18, 36, 54, 72, 90]])

Out[30]: a
array([[ 0,  0,  0,  0,  0],
       [14, 28, 42, 56, 70],
       [18, 36, 54, 72, 90]])

Now divide the third row by the first one (using from future import division)

In [32]: a[0]/a[2]
Out[32]: array([ 0.55555556,  0.55555556,  0.55555556,  0.55555556,  0.55555556])

Now do the same with each row in a loop:

In [33]: for i in range(3):
            print a[i]/a[2]   
[ 0.55555556  0.55555556  0.55555556  0.55555556  0.55555556]
[ 0.77777778  0.77777778  0.77777778  0.77777778  0.77777778]
[ 1.  1.  1.  1.  1.]

Everything looks right. But now, assign the first array a[i]/a[2] to a[i]:

In [35]: for i in range(3):
            a[i]/=a[2]
   ....:     

In [36]: a
Out[36]: 
array([[0, 0, 0, 0, 0],
       [0, 0, 0, 0, 0],
       [1, 1, 1, 1, 1]])

Alright, no problem. Turns out this is by design. Instead, we should do:

In [38]: for i in range(3):
            a[i] = a[i]/a[2]
   ....:     

In [39]: a
Out[39]: 
array([[0, 0, 0, 0, 0],
       [0, 0, 0, 0, 0],
       [1, 1, 1, 1, 1]])

But that doesn't work. Why and how can I fix it?

Thanks in advance.

nneonneo · Accepted Answer · 2012-10-20 05:21:56Z

6

You can cast the whole array to a float array first:

a = a.astype('float')
a /= a[2]

answered Oct 20, 2012 at 5:21

nneonneo

181k37 gold badges331 silver badges412 bronze badges

Sign up to request clarification or add additional context in comments.

7 Comments

r_31415 Over a year ago

Thanks. Yes, I thought about that but I was wondering about a solution involving the division since it should work as a[i] = a[i]/a[2]

mgilson Over a year ago

In case it's not obvious, this creates a new array. There is no way (that I know of) to do it that modifies the old array in place and changes the type.

nneonneo Over a year ago

It's not possible to do in general since the old and new types may have different sizes, or the old array might be a view into some other array that shouldn't be modified.

mgilson Over a year ago

@nneonneo -- true -- sort of. From a python API perspective, that doesn't matter. ndarray is a wrapper around a c-array (data). In principle, you could just move the pointer that your ndarray has to a new block of data and from a python perspective, you did the operation "in place". e.g. a = array(...); b = a; a.magic_type_convert(float); b.dtype is a.dtype #true. But, I don't know if that operation exists. And provided that views are holding the same reference to the data (and I think they are), that would work too.

r_31415 Over a year ago

@mgilson Actually, I hadn't considered that. However, numpy arrays are still mutable c = array([1]); id(c) returns 32610992. Then c[0] = 2 changes the array to array([2]) and id(c) still returns 32610992, so I thought it could be doing the same by row.

|

mgilson · Accepted Answer · 2012-10-20 05:43:49Z

4

"Why doesn't this work" -- The reason it doesn't work is because numpy arrays have a datatype when they're created. Any attempt to put a different type into that array will be cast to the appropriate type. In other words, when you try to put a float into your integer array, numpy casts the float to an int. The reasoning behind this is because numpy arrays are designed to be a homogonous type in order for them to have optimal performance. Put another way, they're implemented as arrays in C. And in C, you can't have an array where 1 element is a float and the next is an int. (You can have structs which behave like that, but they're not arrays).

Another solution (in addition to the one proposed by @nneonneo) is to specify the array as a float array from the beginning:

a = array([[10, 20, 30, 40, 50], [14, 28, 42, 56, 70], [18, 36, 54, 72, 90]], dtype=float)

edited Oct 20, 2012 at 5:43

answered Oct 20, 2012 at 5:33

mgilson

312k70 gold badges656 silver badges722 bronze badges

2 Comments

r_31415 Over a year ago

Right. Yes, you're absolutely right as a.dtype returns dtype('int64') Then once an array is created, unless explicitly changed, it keeps its data type. Is that it?

mgilson Over a year ago

@RobertSmith -- It keeps it's data type no matter what. You can't explicitly change the data type. Doing a.astype(float) actually creates a new ndarray which is of type float.

Bi Rico · Accepted Answer · 2012-10-20 05:39:00Z

3

It's not the division that's the issue it's the assignment, ie a[i] = ... (which is also used behind the scene when you do a /= ...). Try this:

>>> a = np.zeros(3, dtype='uint8')
>>> a[:] = [2, -3, 5.9]
>>> print a
[  2 253   5]

When you do intarray[i] = floatarray[i] numpy has to truncate the floating point values to get them to fit into intarray.

answered Oct 20, 2012 at 5:39

Bi Rico

25.9k3 gold badges57 silver badges75 bronze badges

Collectives™ on Stack Overflow

Issue with true division with Numpy arrays

3 Answers 3

7 Comments

2 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

7 Comments

2 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related