Skip to main content
Filter by
Sorted by
Tagged with
-5 votes
1 answer
207 views

I have an existing c++ project that compiles fine with msvc, and I'm trying to get it to compile in Visual Studio 2022 with LLVM (clang-cl). At the moment I'm using C++ Language Standard Preview ISO C+...
dts's user avatar
  • 223
28 votes
1 answer
2k views

I've run the same binaries compiled with gcc-13 (https://godbolt.org/z/qq5WrE8qx) on Intel i3-N305 3.8GHz and AMD Ryzen 7 3800X 3.9GHz PCs. This code uses VCL library (https://github.com/vectorclass/...
Paul Jurczak's user avatar
  • 8,630
3 votes
1 answer
1k views

There are similar older questions, but they are using intrinsics and old instruction sets. I have a function f written with C++ vector class library (https://www.agner.org/optimize/#vectorclass): int ...
Paul Jurczak's user avatar
  • 8,630
1 vote
0 answers
351 views

I tried to test original SSE/AVX functions compared with Agner Fog's vector class(My CPU supports both SSE and AVX instructions). Everything was in line with forecast on GNU compiler. Then I switched ...
hamster_watcher's user avatar
1 vote
1 answer
101 views

My goal is to separate all vectorclass-library typenames to a separate namespace, so that vcl::Vec4i will compile, but Vec4i won't. I tried to use example from manual, however it's not working. Failed ...
Vladislav Kogan's user avatar
0 votes
3 answers
231 views

I want to vectorize following code using gather instructions in VCL. Some operations should be performed on the indexes of vSource defined by other vector VInd: vector<int> vSource; vector<...
Vladislav Kogan's user avatar
2 votes
2 answers
480 views

I'm using OpenMP to parallelize the loop, that is internally using AVX-512 with Agner Fog's VCL Vector Class Library. Here is the code: double HarmonicSeries(const unsigned long long int N) { ...
Jirka's user avatar
  • 405
1 vote
0 answers
143 views

I have been trying to speed up a binary tree evaluation algo using AVX2. Actually, I'm using Agner's VCL lib since the difference between hand-coding the algo and using vcl was small for big gain in ...
David Jobet's user avatar
1 vote
0 answers
117 views

I have tried my best to get the vectorclass library to generate AVX2 instructions, but can't get it to. I'm using MSVC2019. Here are the compile options: /permissive- /ifcOutput "x64\Release"...
blmckinley's user avatar
0 votes
0 answers
139 views

here a piece of simplified code that causes me a problem/error (Vec4d is coming from the Agner Fog library VCL) #define AVX256_ALIGNED_MALLOC(type,size) (type *)_aligned_malloc(size * sizeof(type),32)...
Luc's user avatar
  • 11
4 votes
2 answers
7k views

I need to install a software which requires SSE2 on my macbook air with M1 chip (os Monterey). When I am trying to compile the project I receive the following error: /libRootFftwWrapper/vectorclass/...
Alisa Nozdrina's user avatar
0 votes
0 answers
500 views

I have been making a custom vector class, however I have been bumping into a problem. The problem being that my vector just won't resize, the size stays 0. Anyone know what the problem is? Thanks in ...
Robert's user avatar
  • 1
-1 votes
2 answers
735 views

Does anyone know of a fix for an MSVC compiler bug/annoyance where SIMD Extension settings get "stuck" on AVX? The context of this question is coding up SIMD CPU dispatchers, closely ...
dts's user avatar
  • 223
1 vote
1 answer
452 views

I have a legacy Windows DLL (written in c++) for which I need to maintain a 32-bit version, along with the 64-bit version. I'm updating the heavy math code with simd using Agner's vector class library,...
dts's user avatar
  • 223
2 votes
3 answers
978 views

I am using Agner Fog's vectorclass library to use SIMD instructions (AVX specifically) in my application. Since it is best to use struct-of-array datastructures for easily employing SIMD, I quite ...
wvc's user avatar
  • 91
1 vote
3 answers
2k views

I am looking at parallel processing algorithm for processing speed improvement. I want to test Agner Fog's vector class library, VCL. I am wondering how to select different vector classes for example ...
batuman's user avatar
  • 7,346
6 votes
3 answers
810 views

How can I create a library that will dynamically switch between SSE, AVX, and AVX2 code paths depending on the host processor/OS? I am using Agner Fog's VCL (Vector Class Library) and compiling with ...
Aleksandr Dubinsky's user avatar