Numpy array counter with reset

Question

I have a numpy array with only -1, 1 and 0, like this:

np.array([1,1,-1,-1,0,-1,1])

I would like a new array that counts the -1 encountered. The counter must reset when a 0 appears and remain the same when it's a 1:

Desired output:

np.array([0,0,1,2,0,1,1])

The solution must be very little time consuming when used with larger array (up to 100 000)

Edit: Thanks for your contribution, I've a working solution for now.

I'm still looking for a non-iterative way to solve it (no for loop). Maybe with a pandas Series and the cumsum() method ?

please add how large is target array to your question

Amin Taghikhani
– Amin Taghikhani

2021-12-09 07:11:25 +00:00
Commented Dec 9, 2021 at 7:11 — Amin Taghikhani
– Amin Taghikhani, Commented Dec 9, 2021 at 7:11

tdy · Accepted Answer · 2021-12-09 07:41:22Z

Maybe with a pandas Series and the cumsum() method?

Yes, use Series.cumsum and Series.groupby:

s = pd.Series([1, 1, -1, -1, 0, -1, 1])

s.eq(-1).groupby(s.eq(0).cumsum()).cumsum().to_numpy()
# array([0, 0, 1, 2, 0, 1, 1])

Step-by-step

Create pseudo-groups that reset when equal to 0:

groups = s.eq(0).cumsum()
# array([0, 0, 0, 0, 1, 1, 1])

Then groupby these pseudo-groups and cumsum when equal to -1:

s.eq(-1).groupby(groups).cumsum().to_numpy()
# array([0, 0, 1, 2, 0, 1, 1])

Timings

not time consuming when used with larger array (up to 100,000)

groupby + cumsum is ~8x faster than looping, given np.random.choice([-1, 0, 1], size=100_000):

%timeit series_cumsum(a)
# 3.29 ms ± 721 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

%timeit miki_loop(a)
# 26.5 ms ± 925 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)

%timeit skyrider_loop(a)
# 26.8 ms ± 1.36 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

Fatemeh Sangin · Accepted Answer · 2021-12-09 10:11:42Z

1

Let's first save your numpy array in a variable:

a = np.array([1,1,-1,-1,0,-1,1])

I define a variabel, count to hold the value you care about, and set it to be zero. Then I define a list to hold the new elements. Let's call it l. Then I iterate on elemnts of a and in each ieration I name the element i. Inside each iteration, I implement the logic:

if i is -1, then increase counter
else, if i is 0, reset the counter
and do nothing otherwise And finally, I append the counter to l. Lastly, convert l to be a numpy array, out.

l = []
count = 0
for i in a:
    if i == -1:
        count+=1
    elif i==0: 
        count = 0
    l.append(count)
out = np.array(l)
out

edited Dec 9, 2021 at 10:11

answered Dec 9, 2021 at 7:46

Fatemeh Sangin

5511 gold badge5 silver badges21 bronze badges

2 Comments

ppwater Over a year ago

While this code may answer the question, including an explanation of how or why this solves the problem would really help to improve the quality of your post. Remember that you are answering the question for readers in the future, not just the person asking now. Please edit your answer to add explanations and give an indication of what limitations and assumptions apply.

Fatemeh Sangin Over a year ago

dear @ppwater, Is it better now?

hilberts_drinking_problem · Accepted Answer · 2021-12-09 11:50:29Z

I seem to get a 10x speedup over Pandas solution with numba for this benchmark:

from numba import jit

inp1 = np.array([1,1,-1,-1,0,-1,1], dtype=int)
inp2 = np.random.randint(-1, 10, size=10**6)

@jit
def with_numba(arr):
  val = 0
  put = np.zeros_like(arr)
  for i in range(arr.size):
    if arr[i] == -1:
      val += 1
    elif arr[i] == 0:
      val = 0
    put[i] = val

  return put

def with_pandas(inp):
  s = pd.Series(inp)
  return s.eq(-1).groupby(s.eq(0).cumsum()).cumsum().to_numpy()
  
assert (with_numba(inp1) == with_pandas(inp1)).all()
assert (with_numba(inp2) == with_pandas(inp2)).all()

%timeit with_numba(inp2)
# 100 loops, best of 5: 4.57 ms per loop
%timeit with_pandas(inp2)
# 10 loops, best of 5: 46.3 ms per loop

Skyrider Feyrs · Accepted Answer · 2021-12-09 06:37:06Z

0

Use a for loop. Set a variable which starts at 1 and reset it each time you encounter a different number. For example:

counter = 1;
outputArray = [];
for number in npArray:
    if number == -1:
        outputArray.append(counter)
        counter += 1
    elif number == 1:
        outputArray.append(0)
    else:
        outputArray.append(0)
        counter = 1
print(outputArray)

edited Dec 9, 2021 at 6:37

answered Dec 9, 2021 at 6:26

Skyrider Feyrs

921 silver badge16 bronze badges

6 Comments

Lénis Parge Over a year ago

Your solution won't work. When 1 is encounterd the counter must be constant but your solution will append a new 0 in the outputArray.

Skyrider Feyrs Over a year ago

If that's a problem, please edit the question to include that.

Miki Over a year ago

This code won't work if npArray is like [-1,-1,1,-1,-1] the output will be [1, 2, 0, 1, 2] but it must be [1,2,0,3,4] if I get the question right

Skyrider Feyrs Over a year ago

Now I get it. I'll edit :)

Lénis Parge Over a year ago

Yes that's right @Miki and thanks Skyrider

|

Miki · Accepted Answer · 2021-12-09 07:03:33Z

0

Here is a fix for @skyrider's code

npArray = [1,1,-1,-1,0,-1,1]
counter = 0
outputArray = []
for number in npArray:
    if number == -1:
        counter += 1
        outputArray.append(counter)
    elif number == 0:
        outputArray.append(0)
        counter = 0
    else:
        outputArray.append(counter)
print(outputArray)

edited Dec 9, 2021 at 7:03

answered Dec 9, 2021 at 6:37

Miki

2122 silver badges12 bronze badges

5 Comments

Lénis Parge Over a year ago

The problem is when 1 is encountered in the middle: the counter must be constant but your solution will append a new 0 in the outputArray instead

Miki Over a year ago

what do you mean by constant your mean like when 1 is encountered in the middle it should not include it or...

Lénis Parge Over a year ago

What i mean is when 1 is encountered the chain must append the current state of the counter. np.array([1,1,-1,-1,0,-1,1]) becomes: np.array([0,0,1,2,0,1,1])

Lénis Parge Over a year ago

It's more like a cumsum with reset when 0 appears

Miki Over a year ago

Ok so I think it's fixed now check it

Collectives™ on Stack Overflow

Numpy array counter with reset

5 Answers 5

Step-by-step

Timings

Comments

2 Comments

Comments

6 Comments

5 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Step-by-step

Timings

Comments

2 Comments

Comments

6 Comments

5 Comments

Your Answer

Sign up or log in

Post as a guest

Related