Perl regex replace first name last name with first name last initial

Question

I want to have the output of $var below to be John D

my $var = "John Doe";

I have tried $var =~ s/(.+\b.).+\z],'\1.'//g;

and what do you want if it is "Lester del Rey"?

ysth
– ysth

2013-06-13 15:48:53 +00:00
Commented Jun 13, 2013 at 15:48 — ysth
– ysth, Commented Jun 13, 2013 at 15:48
Not really an issue here could simply be Lester D then

user1754493
– user1754493

2013-06-13 17:51:55 +00:00
Commented Jun 13, 2013 at 17:51 — user1754493
– user1754493, Commented Jun 13, 2013 at 17:51

Matt Ritter · Accepted Answer · 2013-06-13 20:17:48Z

3

Here's a general solution (feel free to swap in '\w' where I used '.', and add a \s where I used \s+)

my $var = "John Doe";
(my $fname, my $linitial) = $var =~ /(.*)\s+(.).*/

Then you have the values

$fname = 'John';
$linitial = 'D';

and you can do:

print "$fname $linitial";

to get

"John D"

EDIT Until you do your next match, each of the capture parentheses creates a variable ($1 and $2, respectively), so the whole thing can be shortened a bit as follows:

my $var = "John Doe";
$var =~ /(.*)\s+(.).*/
print "$1 $2";

edited Jun 13, 2013 at 20:17

answered Jun 13, 2013 at 15:41

Matt Ritter

315 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

user1754493 Over a year ago

is there anyway to do this in one statement without creating new variables?

Matt Ritter Over a year ago

Yep! Perl has a (mostly) awesome penchant for auto-creating variables, which are perfectly suited for this sort of situation. I've updated my answer to include this shortened alternative

Borodin · Accepted Answer · 2013-06-13 15:40:06Z

1

To replace the last sequence of non-whitespace characters with just the initial character, you could write this

use strict;
use warnings;

my $var = "John Doe";

$var =~ s/(\S)\S*\s*$/$1/;

print $var;

output

John D

answered Jun 13, 2013 at 15:40

Borodin

127k9 gold badges72 silver badges146 bronze badges

Comments

Kalyan02 · Accepted Answer · 2013-06-13 15:46:46Z

0

Assuming your string has ascii names this will work

$var =~ s/([a-zA-Z]+)\s([a-zA-Z]+)/$1." ".substr($2,0,1)/ge;

answered Jun 13, 2013 at 15:46

Kalyan02

1,43411 silver badges16 bronze badges

1 Comment

user1754493 Over a year ago

for some reason this just returns a 1 instead of John D

amon · Accepted Answer · 2013-06-14 07:25:38Z

0

$var = "John Doe";
s/^(\w+)\s+(\w)/$1 \u$2/ for $var;

edited Jun 14, 2013 at 7:25

amon

57.8k2 gold badges93 silver badges152 bronze badges

answered Jun 14, 2013 at 6:26

K Tatyana

1

2 Comments

amon Over a year ago

This won't work, and just titlecases the second word, i.e. John van Doe → John Van Doe.

amon Over a year ago

I just wrote an anwer with a similar regex that explains how it works. You should then be able to understand why your regex doesn't show the requested behaviour.

amon · Accepted Answer · 2013-06-14 07:41:21Z

A simple regex that solves this problem is the substitution

s/^\w+\s+\K(\w).*/\U$1/s

What does this do?

^ \w+ \s+ matches a word at the beginning of the string, plus whitespace towards the next word
\K is the keep escape. It keeps the currently matched part outside of that substring that is considered “matched” by the regex engine. This avoids an extra capture group, and is practically a look-behind.
(\w) matches and captures one “word” character. This is the leading character of the second word in the string.
.* matches the rest of the string. I do this to overwrite any other names that may come: you stated that Lester del Ray should be transformed to Lester D, not Lester D Ray as a solution with \w* instead of the .* part would have done. The /s modifier is relevant for this, as it enables . to match every character including newlines (who knows what's inside the string?).
The substitution uses the \U modifier to uppercase the rest of the string, which consists of the value of the capture.

Test:

$ perl -E'$_ = shift; s/^\w+\s+\K(\w).*/\U$1/s; say' "Lester del Ray"
Lester D
$ perl -E'$_ = shift; s/^\w+\s+\K(\w).*/\U$1/s; say' "John Doe"
John D

Zach Leighton · Accepted Answer · 2013-06-13 15:23:11Z

-1

Something like this might be a little more usable/reusable in the long run.

$initial = sub { return substr shift, 0, 1 ; };

make a get initial function

$var =~ s/(\w)\s+(\w)/&$initial($1) &$initial($2)/sge;

Then replace the first and second results using execute in the regex;

answered Jun 13, 2013 at 15:23

Zach Leighton

1,94114 silver badges25 bronze badges

1 Comment

user1754493 Over a year ago

I can't use a sub routine for this case.

Collectives™ on Stack Overflow

Perl regex replace first name last name with first name last initial

6 Answers 6

2 Comments

Comments

1 Comment

2 Comments

Comments

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

2 Comments

Comments

1 Comment

2 Comments

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related