Newest 'cache-locality' Questions

3 votes

3 answers

360 views

ArrayList vs LinkedList in terms of cache locality

How does cache locality impact the performance of ArrayList compared to LinkedList in Java? I've often heard that ArrayList has an advantage in terms of cache locality, but I don't fully understand ...

Marat Tim

73

asked Apr 26 at 19:50

1 vote

0 answers

82 views

Why does cache-friendly design matter in lock-free queues if threads trash their cache anyway?

I'm trying to understand the practical value of "cache-friendly" design in lock-free queues. I often see people go to great lengths to pad structures, align data, and avoid false sharing — ...

SpeakX

427

asked Apr 22 at 0:21

0 votes

0 answers

216 views

Pimpl idiom without pointer indirection?

I'm developing a C++ API and I want to hide private implementation details from the public interface. Currently, I'm employing the Pimpl idiom for this purpose. However, I'm also mindful of minimizing ...

ehopperdietzel

324

asked Feb 18, 2024 at 23:38

0 votes

2 answers

134 views

Do different ways of initializing a Vec<T> create different memory layout?

fn main() { let vec0 = vec![0; 10]; let mut vec1 = vec![]; for _ in 0..10 { vec1.push(0); } assert_eq!(vec0.len(), vec1.len()); } In this example, vec0 and vec1 are ...

Rahn

5,565

asked Aug 2, 2023 at 3:41

0 votes

2 answers

271 views

Can a custom allocator improve cache locality for lists?

This is a rather hypothetical question. I only have limited knowledge about how the cpu cache works. I know a cpu loads subsequent bytes into the cache. Since a list uses pointers/indirection into ...

Raildex

5,428

asked Feb 10, 2022 at 8:22

-1 votes

2 answers

1k views

Cache Locality - weight of TLB, Cache Lines, and ...?

From my understanding the constructs which give rise to the high level concept of "cache locality" are the following: Translation Lookaside Buffer (TLB) for virtual memory translation. ...

gmaggiol

15

asked Feb 23, 2021 at 22:48

3 votes

1 answer

92 views

Why is inserting sorted keys into std::set so much faster than inserting shuffled keys?

I was accidentally surprised to found that inserting sorted keys into std::set is much much faster than inserting shuffled keys. This is somewhat counterintuitive since a red-black tree (I verified ...

Leon Cruz

375

asked Feb 23, 2021 at 11:20

2 votes

2 answers

2k views

Cache misses when accessing an array in nested loop

So I have this question from my professor, and I can not figure out why vector2 is faster and has less cache misses than vector1. Assume that the code below is a valid compilable C code. Vector2: void ...

Pengibaby

373

asked Jan 27, 2021 at 19:47

0 votes

1 answer

1k views

Determining optimal block size for blocked matrix multiplication

I am trying to implement blocked (tiled) matrix multiplication on a single processor. I have read the literature on why blocking improves memory performance, but I just wanted to ask how to determine ...

user14634701

asked Jan 6, 2021 at 6:52

1 vote

0 answers

454 views

Improving cache locality of binary search by doing local linear search?

Binary search of a sorted array may have poor cache locality, due to random access of memory, but linear search is slow for a large array. Is it possible to design a hybrid algorithm? For example, you ...

felix

647

asked Oct 14, 2020 at 9:18

0 votes

0 answers

83 views

Cache locality considerations

I have been trying to get better awareness of cache locality. I produced the 2 code snippets to gain better understanding of the cache locality characteristics of both. vector<int> v1(1000, some ...

roulette01

2,502

asked Sep 7, 2020 at 16:58

1 vote

0 answers

48 views

Detect whether a cache line is reused due to spatial or temporal locality

Is there a practical tool to detect whether a cache line is reused (a cache miss is avoided) due to either spatial or temporal locality? I could not find a related discussion in cachegrind. I was able ...

Kadir

1,715

asked Aug 6, 2020 at 5:13

2 votes

0 answers

73 views

In Apache Spark, how to make a task to always execute on the same machine?

In its simplest form, RDD is merely a placeholder of chained computations that can be arbitrarily scheduled to be executed on any machine: val src = sc.parallelize(0 to 1000) val rdd = src....

tribbloid

3,822

asked Apr 24, 2020 at 21:33

2 votes

2 answers

1k views

Importance of padding in Dynamic Memory Allocation

I am trying to implement a heap (implicit free list with header/footer) and deciding on whether I should add padding to it. What are the tangible benefits of adding pads? I read that it somehow ...

Silver Flash

1,121

asked Nov 9, 2019 at 16:07

0 votes

1 answer

585 views

Understanding data cache locality in mips code

I have been browsing stackoverflow could not really find a example regarding to this one. I understand the concept of Temporal and Spatial locality for data cache: Temporarl locality: address ...

nihulus

1,495

asked May 26, 2019 at 10:01

2 votes

0 answers

240 views

How to benchmark random access with JMH?

I was trying to observe the effects of CPU cache spatial locality by benchmarking sequential/random reads to an array with JMH. Interestingly, the results are almost the same. So I wonder, is this ...

pistolPanties

1,890

asked Mar 18, 2019 at 22:05

6 votes

2 answers

6k views

Does _mm_clflush really flush the cache?

I'm trying to understand how the hardware cache works by writing and running a test program: #include <stdio.h> #include <stdint.h> #include <x86intrin.h> #define LINE_SIZE 64 #...

xiaogw

765

asked Sep 26, 2018 at 20:52

2 votes

1 answer

172 views

Why can't modern compilers optimize row-major order accesses in loops?

In the textbook Computer Systems: a Programmer's Perspective there are some impressive benchmarks for optimizing row-major order access. I created a small program to test for myself if a simple ...

Adam Thompson

3,546

asked Aug 8, 2018 at 23:19

1 vote

0 answers

109 views

Java mechanical sympathy through thread pinning

Given we have an application that is heavily polluted with concurrency constructs, multiple techniques are used (different people worked without clear architecture in mind), multiple questionable ...

vach

11.5k

asked Jun 26, 2018 at 4:32

0 votes

3 answers

392 views

What is the difference in the Code Snippets?

I am a beginner in operating systems, and I am trying to understand some code snippets. Can you please explain to me the difference between these code snippets?? int sum_array_rows(int a[M][N]) { ...

Agapi

107

asked Jun 4, 2018 at 7:36

3 votes

2 answers

2k views

Understanding spatial and temporal locality

I was studying for my architecture final and came across the following lines of code: for(i = 0; i <= N ;i++){ a[i] = b[i] + c[i]; } The question is: "How does this code snippet demonstrate ...

Carlos Romero

181

asked May 3, 2018 at 22:22

0 votes

1 answer

1k views

Cache effects and importance of locality

I have read this blog and I am still unsure about the importance of locality. Why is locality important for cache performance? Is it because it leads to fewer cache misses? Furthermore, how is a ...

Bab

443

asked Apr 19, 2018 at 13:56

5 votes

2 answers

1k views

CPU spatial cache locality in array iteration

My understanding of the L1 cache was that a memory fetch loads a cache line. Assuming the cache line size is 64 bytes, if I access memory at address p, it will load the entire block from p to p + 64 ...

user1413793

9,427

asked Feb 14, 2018 at 6:58

0 votes

1 answer

164 views

Ranges of nested for-loops when locality is improved (C++)

I have the following nested for loop: int n = 8; int counter = 0; for (int i = 0; i < n; i++) { for (int j = i + 1; j < n; j++) { printf("(%d, %d)\n", i, j); counter++; ...

BodneyC

110

asked Oct 22, 2017 at 18:14

0 votes

1 answer

176 views

Apache Drill database and data locality

I have two servers. The first server (A) contains the zookeeper, a mongodb database and a drillbit. The second server (B) contains a hadoop distribution with several hive tables, a postgresql database ...

Ivan

1

asked Jun 10, 2017 at 17:07

1 vote

5 answers

2k views

data locality for implementing 2d array in c/c++

Long time ago, inspired by "Numerical recipes in C", I started to use the following construct for storing matrices (2D-arrays). double **allocate_matrix(int NumRows, int NumCol) { double **x; int ...

John Smith

1,109

asked May 17, 2017 at 16:20

0 votes

1 answer

2k views

C++ : vector of vector and cache locality

I am looking for a library/solution that will alleviate the rather important number of cache miss I am experiencing in my program class Foo{ std::vector<Foo*> myVec; // Rest of the ...

B. D

7,818

asked Feb 14, 2017 at 15:36

17 votes

4 answers

43k views

Confused between Temporal and Spatial locality in real life code

I was reading this question, I wanted to ask more about the code that he showed i.e for(i = 0; i < 20; i++) for(j = 0; j < 10; j++) a[i] = a[i]*j; The questions are, I understand ...

user379888

asked Oct 18, 2011 at 18:14

Collectives™ on Stack Overflow

ArrayList vs LinkedList in terms of cache locality

Why does cache-friendly design matter in lock-free queues if threads trash their cache anyway?

Pimpl idiom without pointer indirection?

Do different ways of initializing a Vec<T> create different memory layout?

Can a custom allocator improve cache locality for lists?

Cache Locality - weight of TLB, Cache Lines, and ...?

Why is inserting sorted keys into std::set so much faster than inserting shuffled keys?

Cache misses when accessing an array in nested loop

Determining optimal block size for blocked matrix multiplication

Improving cache locality of binary search by doing local linear search?

Cache locality considerations

Detect whether a cache line is reused due to spatial or temporal locality

In Apache Spark, how to make a task to always execute on the same machine?

Importance of padding in Dynamic Memory Allocation

Understanding data cache locality in mips code

How to benchmark random access with JMH?

Does _mm_clflush really flush the cache?

Why can't modern compilers optimize row-major order accesses in loops?

Java mechanical sympathy through thread pinning

What is the difference in the Code Snippets?

Understanding spatial and temporal locality

Cache effects and importance of locality

CPU spatial cache locality in array iteration

Ranges of nested for-loops when locality is improved (C++)

Apache Drill database and data locality

data locality for implementing 2d array in c/c++

C++ : vector of vector and cache locality

Confused between Temporal and Spatial locality in real life code

Hot Network Questions