How to profile sort algorithms?

Question

I have coded a few sorting methods in C and I would like to find the input size at which the program is optimal (i.e.) profiling each algorithm. But how do I do this? I know to time each method, but I don't know how I can find the size at which it is 'optimal'.

dragon135 · Accepted Answer · 2011-06-06 16:27:21Z

3

It depends on some factors:

Data behaviour: is your data already partially sorted? or it is very random?
Data size: for a big input (say 1 thousand or more) you can assure that O(N^2) sorting methods will lose to O(N*log(N)) methods..
Data structure of the data: is it array or list or ?. Sorting method with non sequential access to data will be slower for something like list

So the answer is by empirically running your program with some real data you will likely handle combined by varying in the input size.

When a slower method (like O(N^2)) gets beaten by some faster method (like O(N*log(N))) when input size is > X then you can say that the slower method is 'empirically optimal' for input size <= X (the value depends on the characteristics of the input data).

edited Jun 6, 2011 at 16:27

answered Jun 6, 2011 at 16:19

dragon135

1,3868 silver badges19 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

R.. GitHub STOP HELPING ICE Over a year ago

One gap in your logic is that "O(n log n) on average" algorithms (i.e. quicksort) will deceptively look fast, but have corner cases that are hard to hit but very slow.

Voo Over a year ago

@R: Yeah I remember some DOS attacks (no idea where I heart that - maybe only hearsay, but well) that used specially crafted input data that caused the used quicksort to behave as a n^2 sort. So that's something to keep in mind! (although for most situations probably not important)

Alan Geleynse · Accepted Answer · 2011-06-06 16:23:31Z

Sort algorithms do not have a single number at which they are optimal.

For pure execution time, almost every sort algorithm will be fastest on a set of 2 numbers, but that it not useful in most cases.

Some sort algorithms may work more efficiently on smaller data sets, but that does not mean they are 'optimal' at that size.

Some sorts may also work better on other characteristics of the data. There are sorts that can be extremely efficient if the data is almost sorted already, but may be very slow if it is not. Others will run the same on any set of a given size.

It is more useful to look at the Big O of the sort (such as O(n^2), O(n log n) etc) and any special properties the sort has, such as operating on nearly sorted data.

maerics · Accepted Answer · 2011-06-06 16:16:14Z

0

To find the input size at which the program is optimal (by which I assume you mean the fastest, or for which the sorting algorithm requires the fewest comparisons) you will have to test it against various inputs and graph the independent axis (input size) against the dependent axis (runtime) and find the minimum.

answered Jun 6, 2011 at 16:16

maerics

158k47 gold badges277 silver badges299 bronze badges

Collectives™ on Stack Overflow

How to profile sort algorithms?

3 Answers 3

2 Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

2 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related