Is there any way to delete the specific elements of an numpy array "In-place" in python:

Question

when calling the "np.delete()", I am not interested to define a new variable for the reduced size array. I want to execute the delete on the original numpy array. Any thought?

>>> arr = np.array([[1,2], [5,6], [9,10]])
>>> arr
array([[ 1,  2],
       [ 5,  6],
       [ 9, 10]])
>>> np.delete(arr, 1, 0)
array([[ 1,  2],
       [ 9, 10]])
>>> arr
array([[ 1,  2],
       [ 5,  6],
       [ 9, 10]])
but I want:
>>> arr
array([[ 1,  2],
       [ 9, 10]])

whats wrong with just doing arr = np.delete(arr, 1, 0) ? Or you could just call arr without the sections you don't want using brackets ? — Whud
– Whud, Commented Nov 4, 2016 at 15:47

user2357112 · Accepted Answer · 2016-11-04 16:11:00Z

4

NumPy arrays are fixed-size, so there can't be an in-place version of np.delete. Any such function would have to change the array's size.

The closest you can get is reassigning the arr variable:

arr = numpy.delete(arr, 1, 0)

answered Nov 4, 2016 at 16:11

user2357112

286k32 gold badges490 silver badges571 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

amaurea Nov 6 at 12:10

Of course it could be done in place. E.g. with an array([1,2,3,4]), deleting the 2nd element would involve moving the 3 to the 2's spot, and the 4 to the 3rd's spot, and finally slicing the array to make its new length 3. This is how std::vector.erase works in C++ for example. It wouldn't be more efficient than np.delete, but it would use less memory, and that can be important.

Olian04 · Accepted Answer · 2016-11-04 16:09:04Z

1

The delete call doesn't modify the original array, it copies it and returns the copy after the deletion is done.

>>> arr1 = np.array([[1,2], [5,6], [9,10]])
>>> arr2 = np.delete(arr, 1, 0)
>>> arr1
array([[ 1,  2],
   [ 5,  6],
   [ 9, 10]])
>>> arr2 
array([[ 1,  2],
   [ 9, 10]])

answered Nov 4, 2016 at 16:09

Olian04

6,9002 gold badges31 silver badges56 bronze badges

Comments

armatita · Accepted Answer · 2016-11-07 12:45:54Z

1

If its a matter of performance you might want to try (but test it since I'm not sure) creating a view* instead of of using np.delete. You can do it by slicing which should be an inplace operation:

import numpy as np

arr = np.array([[1,  2], [5,  6], [9, 10]])
arr = arr[(0, 2), :]
print(arr)

resulting in:

[[ 1  2]
 [ 9 10]]

This, however, will not free the memory occupied from the excluded row. It might increase performance but memory wise you might have the same or worse problem. Also notice that, as far as I know, there is no way of indexing by exclusion (for instance arr[~1] would be very useful) which will necessarily make you spend resources in building an indexation array.

For most cases I think the suggestion other users have given, namely:

arr = numpy.delete(arr, 1, 0)

, is the best. In some cases it might be worth exploring the other alternative.

EDIT: *This is actually incorrect (thanks @user2357112). Fancy indexing does not create a view but instead returns a copy as can be seen in the documentation (which I should have checked before jumping to conclusions, sorry about that):

Advanced indexing always returns a copy of the data (contrast with basic slicing that returns a view).

As so I'm unsure if the fancy indexing suggestion might be worth something as an actual suggestion unless it has any performance gain against the np.delete method (which I'll try to verify when opportunity arises, see EDIT2).

EDIT2: I performed a very simple test to see if there is any perfomance gain from using fancy indexing by opposition to delete function. Used timeit (actually the first time I've used but it seems the number of executions per snippet is 1 000 000, thus the hight numbers for time):

import numpy as np
import timeit

def test1():
    arr = np.array([[1, 2], [5, 6], [9, 10]])
    arr = arr[(0, 2), :]

def test2():
    arr = np.array([[1, 2], [5, 6], [9, 10]])
    arr = np.delete(arr, 1, 0)

print("Equality test: ", test1() == test2())

print(timeit.timeit("test1()", setup="from __main__ import test1"))
print(timeit.timeit("test2()", setup="from __main__ import test2"))

The results are these:

Equality test:  True
5.43569152576767
9.476918448174644

Which represents a very considerable speed gain. Nevertheless notice that building the sequence for the fancy indexing will take time. If it is worth or not will surely depend on the problem being solved.

edited Nov 7, 2016 at 12:45

answered Nov 4, 2016 at 16:43

armatita

13.6k9 gold badges54 silver badges53 bronze badges

3 Comments

user2357112 Over a year ago

This doesn't actually create a view. Indexing operations classified as advanced indexing, such as what you get with that (0, 2), don't produce views, since they don't produce the consistent strides necessary to create a view.

armatita Over a year ago

@user2357112 True. I should have check the documentation first. My mistake, I'll edit the post. Do you have any idea if performance wise this choice might be faster? My suggestion would be pretty useless if its not.

user2357112 Over a year ago

I think it might avoid some of the overhead numpy.delete has.

bers · Accepted Answer · 2018-12-19 11:25:20Z

1

You could implement your own version of delete which copies data elements after the elements to be deleted forward, and then returns a view excluding the (now obsolete) last element:

import numpy as np


# in-place delete
def np_delete(arr, obj, axis=None):
    # this is a only simplified example
    assert (isinstance(obj, int))
    assert (axis is None)

    for i in range(obj + 1, arr.size):
        arr[i - 1] = arr[i]
    return arr[:-1]


Test = 10 * np.arange(10)
print(Test)

deleteIndex = 5
print(np.delete(Test, deleteIndex))
print(np_delete(Test, deleteIndex))

answered Dec 19, 2018 at 11:25

bers

6,3093 gold badges47 silver badges88 bronze badges

4 Comments

Jens Renders Over a year ago

I use this in an algorithm where a row and column gets deleted in each step. This solution is already faster than using numpy.delete there. Using the @jit decorator from the Numba module on this function makes it even faster still.

bers Over a year ago

@JensRenders it sounds like you are deleting multiple items - I expect you are taking all deletions into account at once and copy each element only once? Sounds like the implementation I was to lazy to try and post here :) good to know it's faster in some cases. I guess you also copy data block-wise instead of element by element as I imply above?

bers Over a year ago

One more thought: is your implementation faster also in the worst case, deleting elements from the beginning?

Jens Renders Over a year ago

Yes, shifting blocks makes more sense in the code. After using the @jit decorater, it doesn't make a difference anymore though. My current code runs faster then numpy.delete in any case, because it doesn't need to allocate new memory. If I time my code and the allocation of some useless memory, it is about the same speed as numpy.delete

Januka samaranyake · Accepted Answer · 2016-11-04 16:17:50Z

0

Nothing wrong in your code. you just have to override the variable

    arr = np.array([[1,2], [5,6], [9,10]])
    arr = np.delete(arr, 1, 0)

edited Nov 4, 2016 at 16:17

answered Nov 4, 2016 at 16:03

Januka samaranyake

2,6072 gold badges34 silver badges54 bronze badges

1 Comment

Hannes Ovrén Over a year ago

This is not changing the actual object. You are just pointing the arr name at a new object. See the other replies for this answer.

Collectives™ on Stack Overflow

Is there any way to delete the specific elements of an numpy array "In-place" in python:

5 Answers 5

1 Comment

Comments

3 Comments

4 Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

1 Comment

Comments

3 Comments

4 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related