Why does cache-friendly design matter in lock-free queues if threads trash their cache anyway?

I'm trying to understand the practical value of "cache-friendly" design in lock-free queues. I often see people go to great lengths to pad structures, align data, and avoid false sharing — especially around head/tail pointers or buffer elements.

However, in a real-world, high-throughput system with multiple threads (say 10+), each thread is doing a lot of processing after dequeuing a value. For example:

A thread pops a value from the queue
Looks up other data in unordered_maps
Does string manipulations
Allocates temporary memory
Performs various calculations
.....

All this activity trashes the thread's local L1/L2 cache anyway. So what’s the point of carefully optimizing the cache layout of the queue?

If thread A is constantly running and working with new data, and thread B is doing the same, doesn’t that mean any "cache locality" or "cache line isolation" will be short-lived or useless?

To be clear, I'm not questioning the theory behind false sharing — I understand that writing to the same cache line from multiple cores causes coherence traffic. But in practice, does padding and aligning in the queue really matter when everything gets evicted from cache almost immediately during downstream processing?

Would love clarification from someone who has benchmarked or dealt with this in production.

edited Apr 22 at 1:06

Peter Cordes

377k50 gold badges742 silver badges1k bronze badges

asked Apr 22 at 0:21

SpeakX

4272 silver badges6 bronze badges

2

when everything gets evicted from cache almost immediately - IDK how true that is. Large set-associative L2 (per-core private) would get fully evicted by looping over an array somewhat larger than it, but other scattered accesses might not hit the same set as the shared memory. Like you, I'd be curious to hear from folks who have used performance counters to profile real-world code.

Peter Cordes
– Peter Cordes

2025-04-22 01:05:24 +00:00
Commented Apr 22 at 1:05

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Why does cache-friendly design matter in lock-free queues if threads trash their cache anyway?

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest