help using command line to extract snippets of data on stdout

Question

I would like the option of extracting the following string/data:

/work/foo/processed/25
/work/foo/processed/myproxy
/work/foo/processed/sample

=or=

25
myproxy
sample

But it would help if I see both.

From this output using cut or perl or anything else that would work:

Found 3 items
drwxr-xr-x   - foo_hd foo_users          0 2011-03-16 18:46 /work/foo/processed/25
drwxr-xr-x   - foo_hd foo_users          0 2011-04-05 07:10 /work/foo/processed/myproxy
drwxr-x---   - foo_hd testcont           0 2011-04-08 07:19 /work/foo/processed/sample

Doing a cut -d" " -f6 will get me foo_users, testcont. I tried increasing the field to higher values and I'm just not able to get what I want.

I'm not sure if cut is good for this or something like perl? The base directories will remain static /work/foo/processed.

Also, I need the first line Found Xn items removed. Thanks.

Why cut this? Why not just ls the directory you're interested in? i.e. ls /work/foo/processed/ — yan
– yan, Commented Apr 8, 2011 at 19:19

kurumi · Accepted Answer · 2011-04-09 00:17:45Z

1

You can do a substitution from beginning to the first occurrence of / , (non greedily)

$ your_command | ruby -ne  'print $_.sub(/.*?\/(.*)/,"/\\1") if /\//'
/work/foo/processed/25
/work/foo/processed/myproxy
/work/foo/processed/sample

Or you can find a unique separator (field delimiter) to split on. for example, the time portion is unique , so you can split on that and get the last element. (2nd element)

$ ruby -ne  'print $_.split(/\s+\d+:\d+\s+/)[-1] if /\//' file
/work/foo/processed/25
/work/foo/processed/myproxy
/work/foo/processed/sample

With awk,

$ awk -F"[0-9][0-9]:[0-9][0-9]" '/\//{print $NF}' file
 /work/foo/processed/25
 /work/foo/processed/myproxy
 /work/foo/processed/sample

answered Apr 9, 2011 at 0:17

kurumi

25.7k5 gold badges47 silver badges52 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

jdamae Over a year ago

thanks! it works great. I went with the awk option. I'm not familiar with ruby.

Sam Choukri · Accepted Answer · 2011-04-09 06:55:32Z

1

perl -lanF"\s+" -e 'print @F[-1] unless /^Found/' file

Here is an explanation of the command-line switches used:

-l: remove line break from each line of input, then add one back on print
-a: auto-split each line of input into an @F array
-n: loop through each line of input
-F: the regexp pattern to use for the auto-split (with -a)
-e: the perl code to execute (for each line of input if using -n or -p)

If you want to just output the last portion of your directory path, and the basedir is always '/work/foo/processed', I would do this:

perl -nle 'print $1 if m|/work/foo/processed/(\S+)|' file

edited Apr 9, 2011 at 6:55

answered Apr 9, 2011 at 0:55

Sam Choukri

1,90411 silver badges17 bronze badges

2 Comments

jdamae Over a year ago

C. - thanks for the perl version. Can you elaborate on the -lanF argument? this is interesting. thanks again.

jdamae Over a year ago

I was playing around with this. What if I want everything after /work/foo/processed? (Basically removing the directory). How would I tweak this? thanks again.

Shalini · Accepted Answer · 2011-04-09 00:52:28Z

0

Try this out :

<Your Command> | grep -P -o '[\/\.\w]+$' 

OR if the directory '/work/foo/processed' is always static then:

<Your Command>| grep -P -o '\/work\/foo\/processed\/.+$' 

-o : Show only the part of a matching line that matches PATTERN.
-P : Interpret PATTERN as a Perl regular expression.

In this example, the last word in the input will be matched . (The word can also contain dot(s)),so file names like 'text_file1.txt', can be matched). Ofcourse, you can change the pattern, as per your requirement.

edited Apr 9, 2011 at 0:52

answered Apr 8, 2011 at 21:05

Shalini

4551 gold badge3 silver badges5 bronze badges

4 Comments

jdamae Over a year ago

I like this. Although I had to edit my question. I did forget to that the output also had Found 3 items which I don't care for. Is there a way to remove that? thanks.

Shalini Over a year ago

Yes, it can be done by changing the pattern you pass with -P. I have edited my reply per your requirement.

jdamae Over a year ago

@Shalini- thanks for your input. How do I tweak his to just grab the values after the static directory? (i.e. 25, myproxy, sample).

Shalini Over a year ago

I thought you were looking for the dirpath. You can re-grep the output of the first grep, something like this : <Your Command>| grep '/work/foo/processed'| grep -P -o '[\.\w]+$' This should give you the filenames

MJB · Accepted Answer · 2011-04-08 19:18:51Z

0

If you know the columns will be the same, and you always list the full path name, you could try something like:

ls -l | cut -c79-

which would cut out the 79th character until the end. That might work in this exact case, but I think it would be better to find the basename of the last field. You could easily do this in awk or perl. Respond if this is not what you want and I'll add the awk and perl versions.

answered Apr 8, 2011 at 19:18

MJB

7,6862 gold badges34 silver badges41 bronze badges

Comments

Vijay · Accepted Answer · 2011-04-08 19:23:02Z

0

take the output of your ls command and pipe it to awk

your command|awk -F'/' '{print $NF}'

answered Apr 8, 2011 at 19:23

Vijay

67.7k94 gold badges238 silver badges327 bronze badges

Comments

tadmc · Accepted Answer · 2011-04-08 20:02:55Z

0

your_command | perl -pe 's#.*/##'

answered Apr 8, 2011 at 20:02

tadmc

3,74518 silver badges14 bronze badges

1 Comment

jdamae Over a year ago

thanks. that's working well. I actually would want to now add that base directory too. /work/foo/processed. Can you show me how to get that? I will change my question to reflect that. thank you.

Collectives™ on Stack Overflow

help using command line to extract snippets of data on stdout

6 Answers 6

1 Comment

2 Comments

4 Comments

Comments

Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

1 Comment

2 Comments

4 Comments

Comments

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related