C# Faster way to filter for loop with array of int as index?

Question

Sorry if this is a duplicate, first question here...

I wanna operate on a large array of structs called notes. But I don't wanna operate on every element of notes. I'm trying to use a filter of an int array (int[]) as to skip quite a few of it as shown in below code.

Note[] notes = new Note[]
{ 
   // Struct stuff ... 
};

int[] filter = new int[]{ 4,20,50,367... };

for (int i = 0; i < notes.Length; i++)
{
     bool flag = false;
     for (int j = 0; j < filter.Length; j++)
     {
          if (i == filter[j])
          {
               flag = true;
               break;
          }
      }

      if (flag) continue;
      // Do something on notes[i]
}

The problem is, the code will run really slow (I think) when both notes array and filter array expands. So, is there a better and faster way to do this? Note that the size of filter can be anything based on other conditions

Typo? Do you mean if (notes[i] == filter[j]) instead of if (i == filter[j])? — Dmitrii Bychenko
– Dmitrii Bychenko, Commented Apr 24, 2019 at 7:45
@DmitryBychenko no not typo, I'm using the index to filter notes[i], but if it's if (notes[i] == filter[j]) shouldn't it cause an error since (struct == int)?? — TheSorrowRaven
– TheSorrowRaven, Commented Apr 24, 2019 at 7:53

Dmitrii Bychenko · Accepted Answer · 2019-04-24 08:03:46Z

2

Note[] notes = new Note[] { 
  ... //Struct stuff 
};

int[] filter = new int[] { 
  4, 20, 50, 367... 
};

HashSet<int> toExclude = new HashSet<int>(filter);

for (int i = 0; i < notes.Length; i++) {
  if (toExclude.Contains(i)) // O(1) time complexity 
    continue;

  //Do something on notes[i] 
}

edited Apr 24, 2019 at 8:03

answered Apr 24, 2019 at 7:47

Dmitrii Bychenko

188k20 gold badges178 silver badges231 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Raul Over a year ago

Do you really claim HashSet.Contains() has a complexity of O(1)? It will only perform better on larger sets, it will still trend somewhere towards log n

Dmitrii Bychenko Over a year ago

@Raul Sebastian: it depends on GetHashCode implementation; Microsoft one is good enough if items are random in filter. Sure an adversary can such items such that we have a lot of hash collistions and inefficient toExclude.Contains

Raul Over a year ago

@Dmitry Bychenko Thanks for making me look it up. It really seems that HashSet lookups complexity stays constant, while in worst case it would be linear.

Raul · Accepted Answer · 2019-04-24 08:03:56Z

1

You could filter the notes using Linq like this:

Note[] notes = new Note[]{ ...//Struct stuff };
int[] filter = new int[]{ 4,20,50,367... };

var filteredNotes = notes.ToList().Where(note => !filter.Contains(note.Id)).ToList();

foreach(var note in filteredNotes)
{
//Do something on note
}

You would need to test the performance though, as Linq tends to be slow in specific circumstances.

edited Apr 24, 2019 at 8:03

answered Apr 24, 2019 at 7:45

Raul

3,1712 gold badges28 silver badges44 bronze badges

4 Comments

TheSorrowRaven Over a year ago

Ermm, actually, this is exactly the opposite I want to do since my filter means "skip this"

Dmitrii Bychenko Over a year ago

Sorry, but it seams that we should iterate all indexes, but filter (i.e. 0, 1, 2, 3, 5,...19, 21, ...). Please, note if (flag) continue; in the code provided

Raul Over a year ago

@TheSorrowRaven sorry, I misunderstood you then. Let me edit it

Dmitrii Bychenko Over a year ago

Final materialization .ToList() is redundant: foreach will do with IEnumerable<Node>

kkica · Accepted Answer · 2019-04-24 07:52:59Z

0

You can loop the filter array and create a new boolean array that has all elements you want to skip as true.

bool[] filterArray= new bool[notes.Length];
foreach(var index in filter)
{
   if(index<filterArray.Length)
       filterArray[index]=true;
}

Then you have to just check the index of this array.

for (int i = 0; i < notes.Length; i++)
{
     if(!filterArray[i]){
     //Do something on notes[i]
     }

}

The complexity of this code will be O(m+n*X) where m is the length of the filter array, n the length of the node array and X the complexity of your operation on notes[i]. Assuming mO(n*X).

Your complexity now is O(m*n*X)

edited Apr 24, 2019 at 7:52

answered Apr 24, 2019 at 7:47

kkica

4,1041 gold badge22 silver badges41 bronze badges

Collectives™ on Stack Overflow

C# Faster way to filter for loop with array of int as index?

3 Answers 3

3 Comments

4 Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

3 Comments

4 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related