Python given an array A of N integers, returns the smallest positive integer (greater than 0) that does not occur in A in O(n) time complexity

Question

For example:

input: A = [ 6 4 3 -5 0 2 -7 1 ]

output: 5

Since 5 is the smallest positive integer that does not occur in the array.

I have written two solutions to that problem. The first one is good but I don't want to use any external libraries + its O(n)*log(n) complexity. The second solution "In which I need your help to optimize it" gives an error when the input is chaotic sequences length=10005 (with minus).

Solution 1:

from itertools import count, filterfalse 


def minpositive(a):
    return(next(filterfalse(set(a).__contains__, count(1))))

Solution 2:

def minpositive(a):
    count = 0
    b = list(set([i for i in a if i>0]))
    if min(b, default = 0)  > 1 or  min(b, default = 0)  ==  0 :
        min_val = 1
    else:
        min_val = min([b[i-1]+1 for i, x in enumerate(b) if x - b[i - 1] >1], default=b[-1]+1)
        
    return min_val

Note: This was a demo test in codility, solution 1 got 100% and solution 2 got 77 %.
Error in "solution2" was due to:
Performance tests -> medium chaotic sequences length=10005 (with minus) got 3 expected 10000
Performance tests -> large chaotic + many -1, 1, 2, 3 (with minus) got 5 expected 10000

I think you're assuming list(set(a)) is sorted but it isn't. It's not clear what you're asking -- are you asking for working code? — Paul Hankin
– Paul Hankin, Commented Mar 11, 2018 at 19:27
Both are working but I am looking for a way to optimize that code to make work with O(n) time complexity "as stated in my question". — user8358337
– user8358337, Commented Mar 11, 2018 at 19:36
ThanksPaul for the hint "I think you're assuming list(set(a)) ". It will not impact my second code. I will use sorted in the future. — user8358337
– user8358337, Commented Mar 11, 2018 at 19:53
Before adding yet another answer to this question, please remember that if you have to sort, it's already not O(N). — General Grievance
– General Grievance, Commented Dec 1, 2022 at 21:41

Peter de Rivaz · Accepted Answer · 2018-03-11 22:23:33Z

83

Testing for the presence of a number in a set is fast in Python so you could try something like this:

def minpositive(a):
    A = set(a)
    ans = 1
    while ans in A:
       ans += 1
    return ans

answered Mar 11, 2018 at 22:23

Peter de Rivaz

33.6k4 gold badges51 silver badges80 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

avizzzy Over a year ago

Also this takes care of any possible duplicates that could be present in the given list.

Extreme_Tough Over a year ago

Though a set is generally faster than a list, I disagree converting list to set in this case. Conversion from list still needs iterating over all the N elements. If you don't convert, it takes iterating only an average of N/2 elements to find the number

Dimdimy4 · Accepted Answer · 2021-02-10 22:46:31Z

7

Fast for large arrays.

def minpositive(arr):
    if 1 not in arr: # protection from error if ( max(arr) < 0 )
        return 1
    else:
        maxArr = max(arr) # find max element in 'arr'
        c1 = set(range(2, maxArr+2)) # create array from 2 to max
        c2 = c1 - set(arr) # find all positive elements outside the array
        return min(c2)

edited Feb 10, 2021 at 22:46

answered Feb 10, 2021 at 22:37

Dimdimy4

811 silver badge5 bronze badges

Comments

mks · Accepted Answer · 2022-07-12 22:58:11Z

4

I have an easy solution. No need to sort.

def solution(A):
    s = set(A)
    m = max(A) + 2
    for N in range(1, m):
        if N not in s:
            return N
    return 1

Note: It is 100% total score (Correctness & Performance)

answered Jul 12, 2022 at 22:58

mks

413 bronze badges

Comments

mseromenho · Accepted Answer · 2021-12-16 08:44:03Z

3

def minpositive(A):
    """Given an list A of N integers, 
    returns the smallest positive integer (greater than 0) 
    that does not occur in A in O(n) time complexity

        Args:
            A: list of integers
        Returns:
            integer: smallest positive integer

        e.g:
            A = [1,2,3]
            smallest_positive_int = 4
    """
    len_nrs_list = len(A)
    N = set(range(1, len_nrs_list+2))
    
    return min(N-set(A)) #gets the min value using the N integers

edited Dec 16, 2021 at 8:44

answered May 14, 2021 at 22:46

mseromenho

1731 silver badge8 bronze badges

1 Comment

joanis Over a year ago

You can get rid of the try/except if you start with N = set(range(1, len_nrc_list+2)).

sandrows · Accepted Answer · 2022-03-26 15:56:44Z

1

This solution passes the performance test with a score of 100%

def solution(A):
    n = sorted(i for i in set(A) if i > 0)  # Remove duplicates and negative numbers
    if not n:
        return 1
    ln = len(n)

    for i in range(1, ln + 1):
        if i != n[i - 1]:
            return i

    return ln + 1

answered Mar 26, 2022 at 15:56

sandrows

3032 silver badges12 bronze badges

Comments

Lomtrur · Accepted Answer · 2020-01-17 11:51:07Z

0

def solution(A):
    B = set(sorted(A))
    m = 1
    for x in B:
        if x == m:
            m+=1
    return m

edited Jan 17, 2020 at 11:51

Lomtrur

1,8172 gold badges23 silver badges43 bronze badges

answered Jan 17, 2020 at 10:05

Najeeb Jebreel

251 bronze badge

4 Comments

Kevin Over a year ago

Please provide an explanation to explain how your code fixed the problem.

avizzzy Over a year ago

OP states the time complexity here should be linear time, answer provided here is quadratic time.

Davi Lima Over a year ago

@avizzzy, isn't it O(n * log n)? And it could be O(n) if the sorted call is removed, after all sets are unordered.

joanis Over a year ago

Python sets don't preserve the order in which items were inserted, so this solution should not work. In practice, it seems Python sorts sets of ints up for fairly large max values, but that's probably a consequence of how sets are implemented, it is now a feature you can count on.

nirojshrestha019 · Accepted Answer · 2021-08-27 02:09:55Z

0

Continuing on from Niroj Shrestha and najeeb-jebreel, added an initial portion to avoid iteration in case of a complete set. Especially important if the array is very large.

def smallest_positive_int(A):
  sorted_A = sorted(A)
  last_in_sorted_A = sorted_A[-1]
  #check if straight continuous list
  if len(sorted_A) == last_in_sorted_A:
    return last_in_sorted_A + 1
  else:
    #incomplete list, iterate to find the smallest missing number
    sol=1
    for x in sorted_A:
        if x == sol:
          sol += 1
        else:
          break
    return sol

A = [1,2,7,4,5,6]
print(smallest_positive_int(A))

edited Aug 27, 2021 at 2:09

nirojshrestha019

3,0691 gold badge14 silver badges16 bronze badges

answered May 17, 2021 at 22:45

ips1 38

3193 silver badges5 bronze badges

1 Comment

joanis Over a year ago

This solution does not work if there are duplicates in A.

joanis · Accepted Answer · 2021-12-14 16:30:49Z

0

This question doesn't really need another answer, but there is a solution that has not been proposed yet, that I believe to be faster than what's been presented so far.

As others have pointed out, we know the answer lies in the range [1, len(A)+1], inclusively. We can turn that into a set and take the minimum element in the set difference with A. That's a good O(N) solution since set operations are O(1).

However, we don't need to use a Python set to store [1, len(A)+1], because we're starting with a dense set. We can use an array instead, which will replace set hashing by list indexing and give us another O(N) solution with a lower constant.

def minpositive(a):
    # the "set" of possible answer - values_found[i-1] will tell us whether i is in a
    values_found = [False] * (len(a)+1)
    # note any values in a in the range [1, len(a)+1] as found
    for i in a:
        if i > 0 and i <= len(a)+1:
            values_found[i-1] = True
    # extract the smallest value not found
    for i, found in enumerate(values_found):
        if not found:
            return i+1

We know the final for loop always finds a value that was not marked, because it has one more element than a, so at least one of its cells was not set to True.

answered Dec 14, 2021 at 16:30

joanis

13k23 gold badges38 silver badges50 bronze badges

3 Comments

mseromenho Over a year ago

Not sure if it was intended with this solution, but if you have an integer list like e.g: a = [2,3,2], it returns 2 values -> 1, 4

joanis Over a year ago

@mseromenho I'm not sure what you mean. I just tested print(minpositive([2,3,2])) and it gave me 1.

mseromenho Over a year ago

my bad, I was testing the code wrongly, I was printing directly from the for loop! Good solution!

Tanishq Sharma · Accepted Answer · 2022-03-10 20:39:57Z

0

def check_min(a):
    x= max(a)
    if x-1 in a:
        return x+1
    elif x <= 0:
        return 1
    else:
        return x-1

Correct me if i'm wrong but this works for me.

answered Mar 10, 2022 at 20:39

Tanishq Sharma

1

1 Comment

joanis Over a year ago

I really don't understand how this answers the question.

Stephen · Accepted Answer · 2022-03-11 05:20:42Z

0

def solution(A):
clone = 1
A.sort()
for itr in range(max(A) + 2):
    if itr not in A and itr >= 1:
        clone = itr
        break
return clone

print(solution([2,1,4,7]))

#returns 3

edited Mar 11, 2022 at 5:20

answered Mar 11, 2022 at 4:44

Stephen

174 bronze badges

2 Comments

joanis Over a year ago

This is a quadratic time solution: itr not in A takes linear time, and you might run it a linear number of times. It may look simple, but it really does not answer the question, which is to solve this problem is O(n) time complexity.

General Grievance Over a year ago

If you have to sort, it's already > O(n).

Rubén Vázquez · Accepted Answer · 2022-11-10 19:21:29Z

0

def solution(A):
    n = 1
    for i in A:
        if n in A:
            n = n+1 
        else:
            return n
    return n

answered Nov 10, 2022 at 19:21

Rubén Vázquez

435 bronze badges

1 Comment

General Grievance Over a year ago

While for i in A should work as expected, it semes redundant to define both i and n. More importantly, this is O(n^2) performance. if n in A is O(n), and you are doing this up to n times.

abhinav singh · Accepted Answer · 2022-12-01 09:08:42Z

0

def not_in_A(a):
    a=sorted(a)
    if max(a)<1:
        return(1)
    for i in range(0,len(a)-1):
        if a[i+1]-a[i]>1:
            out=a[i]+1
            if out==0 or out<1:
                continue
            return(out)
            
    return(max(a)+1)

answered Dec 1, 2022 at 9:08

abhinav singh

11 bronze badge

1 Comment

General Grievance Over a year ago

sorted is automatically not O(N)...

王毅钧 · Accepted Answer · 2022-12-01 09:11:04Z

0

mark and then find the first one that didn't find

nums = [ 6, 4, 3, -5, 0, 2, -7, 1 ]
def check_min(nums):
    marks = [-1] * len(nums)
    for idx, num in enumerate(nums):
        if num >= 0:
            marks[num] = idx
    for idx, mark in enumerate(marks):
        if mark == -1:
            return idx
    return idx + 1

answered Dec 1, 2022 at 9:11

王毅钧

162 bronze badges

Comments

nirojshrestha019 · Accepted Answer · 2021-08-26 20:47:53Z

-1

I just modified the answer by @najeeb-jebreel and now the function gives an optimal solution.

def solution(A):
    sorted_set = set(sorted(A))
    sol = 1
    for x in sorted_set:
        if x == sol:
            sol += 1
        else:
            break
    return sol

edited Aug 26, 2021 at 20:47

answered Mar 4, 2021 at 19:37

nirojshrestha019

3,0691 gold badge14 silver badges16 bronze badges

Comments

Ankit Arora · Accepted Answer · 2021-09-19 11:05:38Z

-1

I reduced the length of set before comparing

a=[1,222,3,4,24,5,6,7,8,9,10,15,2,3,3,11,-1]
#a=[1,2,3,6,3]
def sol(a_array):
    a_set=set()
    b_set=set()
    cnt=1
    for i in a_array:

        #In order to get the greater performance
        #Checking if element is greater than length+1 
        #then it can't be output( our result in solution)
        
        if i<=len(a) and i >=1:
            
            a_set.add(i) # Adding array element in set 
            b_set.add(cnt) # Adding iterator in set
            cnt=cnt+1
    b_set=b_set.difference(a_set)
    if((len(b_set)) > 1): 
        return(min(b_set))
    else:
        return max(a_set)+1

sol(a)

answered Sep 19, 2021 at 11:05

Ankit Arora

1

Comments

Ravi · Accepted Answer · 2021-12-15 13:42:45Z

-1

def solution(A):
    nw_A = sorted(set(A))
    if all(i < 0 for i in nw_A):
        return 1
    else:
        ans = 1
        while ans in nw_A:
            ans += 1
            if ans not in nw_A:
                return ans

For better performance if there is a possibility to import numpy package.

def solution(A):
    import numpy as np
    nw_A = np.unique(np.array(A))
    if np.all((nw_A < 0)):
        return 1
    else:
        ans = 1
        while ans in nw_A:
            ans += 1
            if ans not in nw_A:
                return ans

edited Dec 15, 2021 at 13:42

answered Dec 13, 2021 at 6:32

Ravi

12 bronze badges

1 Comment

joanis Over a year ago

You're sorting the array up to N times, so your time complexity is O(n^2 * log n), a long way off the mark!

Olu · Accepted Answer · 2022-03-29 17:36:23Z

-1

def solution(A):
# write your code in Python 3.6
min_num = float("inf")
set_A = set(A)
# finding the smallest number
for num in set_A:
    if num < min_num:
        min_num = num
# print(min_num)

#if negative make positive 
if min_num < 0 or min_num == 0:
    min_num = 1
# print(min_num)

# if in set add 1 until not 
while min_num in set_A:  
    min_num += 1
return min_num

Not sure why this is not 100% in correctness. It is 100% performance

answered Mar 29, 2022 at 17:36

Olu

1

Comments

Tomerikoo · Accepted Answer · 2021-08-04 19:46:18Z

-2

def solution(A):
    arr = set(A)
    N = set(range(1, 100001))
    while N in arr:
       N += 1
    return min(N - arr)

solution([1, 2, 6, 4])
#returns 3

edited Aug 4, 2021 at 19:46

Tomerikoo

19.6k16 gold badges57 silver badges68 bronze badges

answered Aug 4, 2021 at 17:23

O. Falana

11 bronze badge

Collectives™ on Stack Overflow

Python given an array A of N integers, returns the smallest positive integer (greater than 0) that does not occur in A in O(n) time complexity

18 Answers 18

2 Comments

Comments

Comments

1 Comment

Comments

4 Comments

1 Comment

3 Comments

1 Comment

2 Comments

1 Comment

1 Comment

Comments

Comments

Comments

1 Comment

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

18 Answers 18

2 Comments

Comments

Comments

1 Comment

Comments

4 Comments

1 Comment

3 Comments

1 Comment

2 Comments

1 Comment

1 Comment

Comments

Comments

Comments

1 Comment

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related