File sort by time issue

Question

I have a requirement to identify sequence gap in a set of files. Sequence starts at FILENAME_0001 and ends at FILENAME_9999. After this the sequence is restarted from 0001.

To implement a proper sequence check I used ls -rt to pick the files in order of modified time and the compared with the previous files sequence number. If the previous file was 9999 I check whether the next one is 0001 (to accommodate the sequence reset).

Recently I came across a scenario where files were listed in the below order:

FILENAME_0001 
FILENAME_0002
FILENAME_0005
FILENAME_0003
FILENAME_0004
FILENAME_0006
FILENAME_0007

This was because files 3, 4 & 5 had the same modified time to the second. Only the millisecond was different. So I am guessing ls -rt considers only upto the seconds. Could someone suggest a workaround?

If I understand correctly, you're creating the files ordered by the sequence no (which is also increasing time), but not all files are created - so there are missing values in the sequence, which you're trying to identify. If so, cant you just sort the files by their seq nos and look for the missing ones ? — jai_s
– jai_s, Commented Feb 11, 2016 at 10:08
Yes. But note that seq numbers reset at 9999. So the way you suggested it will fail when I have files FILENAME_9999, FILENAME_0001, FILENAME_0002 — Mathew Paret
– Mathew Paret, Commented Feb 11, 2016 at 10:19
What Unix variant are you running this on? Getting at sub-second timestamps isn't portable. — Gilles 'SO- stop being evil'
– Gilles 'SO- stop being evil', Commented Feb 11, 2016 at 22:24

Otheus · Accepted Answer · 2016-02-12 11:13:09Z

1

If your find has printf, print out the mtime in seconds followed by the filename, then use sort, and finally cut:

find . -type f -printf "%T@\t%f\n" |
sort -k 1n -k 2 |
cut -f 2-

The find outputs TIMESTAMP FILENAME on each line. The sort first sorts the timestamps in numerical order. If the timestamps are equal, it will use the filename as a last resort. The cut removes the timestamp from the output.

EDIT: Your perl solution works, but I would do it differently. Here's the simplest:

find . -type f -print | 
perl -lne 'print (((stat($_))[9]."\t".$_)' |
sort -k 1n -k 2 |
cut -f 2-

No need to convert the time to a string and back again. Just output stat's mtime as a numeric value as find would have done.

edited Feb 12, 2016 at 11:13

answered Feb 11, 2016 at 11:01

Otheus

6,4172 gold badges25 silver badges58 bronze badges

Sounds good. I'll test it tomorrow and let you know. I don't have access to the system now.

Mathew Paret
– Mathew Paret

2016-02-11 12:11:00 +00:00
Commented Feb 11, 2016 at 12:11
This work in my home. But the actual box where I needed it to run doesn't support it 😔. The version of find I have installed doesn't up port printf

Mathew Paret
– Mathew Paret

2016-02-12 02:59:18 +00:00
Commented Feb 12, 2016 at 2:59
What kind of system is it? (uname -a should be a clue.) Does it support the stat command with format control? (stat -c ) Can you install GNU find?

Otheus
– Otheus

2016-02-12 07:00:49 +00:00
Commented Feb 12, 2016 at 7:00
I use HP-UX system. However I am marking your answer also as correct since if worked on my home machine.

Mathew Paret
– Mathew Paret

2016-02-12 07:07:48 +00:00
Commented Feb 12, 2016 at 7:07

Add a comment |

Mathew Paret · Accepted Answer · 2016-02-12 07:14:06Z

1

Finally got it working. I used the below code:

for FILENAME in $(ls...); do
FILE_TIME=$(perl -e '@d=localtime ((stat(shift))[9]); printf "%4d%02d%02d%02d%02d%02d\n", $d[5]+1900,$d[4]+1,$d[3],$d[2],$d[1],$d[0]' $FILENAME)
echo "$FILE_TIME $FILENAME"
done | sort -k 1n -k 2 | cut -d" " -f2

I use HP-UX system.

edited Feb 12, 2016 at 7:14

answered Feb 12, 2016 at 7:06

Mathew Paret

934 silver badges13 bronze badges

This works, but for a slightly less cumbersome way, check out my edit to my answer :)

Otheus
– Otheus

2016-02-12 11:13:47 +00:00
Commented Feb 12, 2016 at 11:13

Add a comment |

Stack Exchange Network

File sort by time issue

2 Answers 2

You must log in to answer this question.

Hot Network Questions

File sort by time issue

2 Answers 2

You must log in to answer this question.

Related

Hot Network Questions