How to remove array elements equal to some element in a second array in perl

Question

Just wonder if I am given two arrays, A and B, how to remove/delete those elements in A that can also be found in B? What is the most efficient way of doing this?

And also, as a special case, if B is the resulting array after grep on A, how to do this? Of course, in this case, we can do a grep on the negated condition. But is there something like taking a complement of an array with respect to another in perl?

Thank you.

As a special case, if the two arrays are sorted, you can do a more efficient differencing operation. But it doesn't seem like that's what you're after. — Mike Sokolov
– Mike Sokolov, Commented Sep 23, 2011 at 1:25

Eric Strom · Accepted Answer · 2011-09-23 01:13:45Z

8

Any time you are thinking of found in you are probably looking for a hash. In this case, you would create a hash of your B values. Then you would grep A, checking the hash for each element.

my @A = 1..9;
my @B = (2, 4, 6, 8);
my %B = map {$_ => 1} @B;

say join ' ' => grep {not $B{$_}} @A; # 1 3 5 7 9

As you can see, perl is not normally maintaining any sort of found in table by itself, so you have to provide one. The above code could easily be wrapped into a function, but for efficiency, it is best done inline.

answered Sep 23, 2011 at 1:13

Eric Strom

40.2k2 gold badges83 silver badges152 bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

David W. Over a year ago

mod +1. The map itself is pretty impressive, but the use of grep is amazing. It took me a while to realize what it was doing. I was wondering what you were grepping before realizing that you were not actually using grep to match the line. If the statement not $B{$_} is true (and it will be for all keys not in %B), the value of $_ is kept in the array that the grep command returns.

Qiang Li Over a year ago

I like this, but what if I also want to main the order in @A after deleting all elements in @B? Is there anything in perl like LinkedHashMap in java?

ikegami Over a year ago

@Qiang Li, His code does maintain the order of the elements in @A. (Assuming main = maintain)

ikegami Over a year ago

@Qiang Li, Tie::IxHash is one way of creating an ordered associative array (like LinkedHashMap), but I don't see what that has to do with your question or this solution.

Hynek -Pichi- Vychodil Over a year ago

@Zaid: There will not be any autovivification in this case. You should try it before you write your comments. You don't know Perl as much as you expect.

|

RET · Accepted Answer · 2011-09-22 23:19:46Z

3

Have a look at the none, all, part, notall methods available via List::MoreUtils. You can perform pretty much any set operation using the methods available in this module.

There's a good tutorial available at Perl Training Australia

answered Sep 22, 2011 at 23:19

RET

9,1881 gold badge32 silver badges33 bronze badges

Comments

Community · Accepted Answer · 2017-05-23 12:30:55Z

1

If you ask for most efficient way:

my @A = 1..9;
my @B = (2, 4, 6, 8);

my %x;
@x{@B} = ();
my @AminusB = grep !exists $x{$_}, @A;

But you will notice difference between mine and Eric Strom's solution only for bigger inputs.

You can find handy this functional approach:

sub complementer {
  my %x;
  @x{@_} = ();
  return sub { grep !exists $x{$_}, @_ };
}

my $c = complementer(2, 4, 6, 8);

print join(',', $c->(@$_)), "\n" for [1..9], [2..10], ...;

# you can use it directly of course
print join(' ', complementer(qw(a c e g))->('a'..'h')), "\n";

edited May 23, 2017 at 12:30

CommunityBot

11 silver badge

answered Sep 23, 2011 at 13:17

Hynek -Pichi- Vychodil

26.2k5 gold badges55 silver badges75 bronze badges

1 Comment

ikegami Over a year ago

It's not any more effective than the other working solutions (by definition). Maybe you meant efficient?

Community · Accepted Answer · 2017-05-23 11:44:38Z

0

You're probably better off with the hash, but you could also use smart matching. Stealing Eric Strom's example,

my @A = 1..9;
my @B = (2, 4, 6, 8);

say join ' ' => grep {not $_ ~~ @B } @A; # 1 3 5 7 9

edited May 23, 2017 at 11:44

CommunityBot

11 silver badge

answered Sep 23, 2011 at 3:10

oylenshpeegul

3,4141 gold badge21 silver badges18 bronze badges

1 Comment

ikegami Over a year ago

This doesn't scale nearly as well as Eric Strom's. His solution is worst case Θ(A+B), but yours is Θ(A*B)

Community · Accepted Answer · 2017-05-23 12:23:10Z

0

Again, you're probably better off with the hash, but you could also use Perl6::Junction. Again stealing Eric Strom's example,

use Perl6::Junction qw(none);

my @A = 1..9;
my @B = (2, 4, 6, 8);

say join ' ' => grep {none(@B) == $_} @A; # 1 3 5 7 9

edited May 23, 2017 at 12:23

CommunityBot

11 silver badge

answered Sep 23, 2011 at 12:08

oylenshpeegul

3,4141 gold badge21 silver badges18 bronze badges

Comments

Community · Accepted Answer · 2017-05-23 11:52:23Z

-1

As already mentioned by Eric Strom, whenever you need to search for something specific, it's always easier if you have a hash.

Eric has a nicer solution, but can be difficult to understand. I hope mine is easier to understand.

# Create a B Hash

my %BHash;
foreach my $element (@B) {
   $BHash{$element} = 1;
}

# Go through @A element by element and delete duplicates

my $index = 0;
foreach my $element (@A) {
   if (exists $BHash{$element}) { 
      splice @A, $index, 1;    #Deletes $A[$index]
      $index = $index + 1;
   }
}

In the first loop, we simply create a hash that is keyed by the elements in @B.

In the second loop, we go through each element in @A, while keeping track of the index in @A.

edited May 23, 2017 at 11:52

CommunityBot

11 silver badge

answered Sep 23, 2011 at 3:06

David W.

107k40 gold badges224 silver badges349 bronze badges

3 Comments

ikegami Over a year ago

It fails because it modifies the array over which it iterates. And all that extra complexity deters from its readability.

David W. Over a year ago

@ikegami: Would it make it more readable to copy the array over to a new array, then rename it back to the original one? This is not the way I'd do it. I was trying for readability.

ikegami Over a year ago

You should be worrying about making it work, first. my @C; for my $e (@A) { push @C, $e if !$B{$e}; } @A = @C; would make it work, but it's a really complicated way to do @A = grep { !$B{$e} } @A;.

Collectives™ on Stack Overflow

How to remove array elements equal to some element in a second array in perl

6 Answers 6

6 Comments

Comments

1 Comment

1 Comment

Comments

3 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

6 Comments

Comments

1 Comment

1 Comment

Comments

3 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related