Thanks for the comments, I will go with pthread for know, knowing Open MPI as fallback if needed.
I created threaded initialization of a 20,000×20,000 matrix in small test code:
https://github.com/Hermann-SW/RR/blob/m ... /20K.D.cpp
If compiling without optimizationreported runtimes in microseconds show the desired speedups.
I did this on a Pi5, so number of threads between 1 and 4.
No explicit allocation of threads to a specific core yet.
P.S:
With small commit before
https://github.com/Hermann-SW/RR/commit ... 6e60fc0791
now time for running "init_dist()" is reported. That will allow to verify the intended speedups.
I created threaded initialization of a 20,000×20,000 matrix in small test code:
https://github.com/Hermann-SW/RR/blob/m ... /20K.D.cpp
If compiling without optimization
Code:
pi@raspberrypi5:~/RR/tsp/pthread $ g++ -Wall -Wextra -pedantic 20K.D.cpp pi@raspberrypi5:~/RR/tsp/pthread $ I did this on a Pi5, so number of threads between 1 and 4.
No explicit allocation of threads to a specific core yet.
Code:
pi@raspberrypi5:~/RR/tsp/pthread $ ./a.out 14860146uspi@raspberrypi5:~/RR/tsp/pthread $ ./a.out 22450638uspi@raspberrypi5:~/RR/tsp/pthread $ ./a.out 31642796uspi@raspberrypi5:~/RR/tsp/pthread $ ./a.out 41244822uspi@raspberrypi5:~/RR/tsp/pthread $ freqmin=cur=3000000=maxpi@raspberrypi5:~/RR/tsp/pthread $ P.S:
With small commit before
https://github.com/Hermann-SW/RR/commit ... 6e60fc0791
now time for running "init_dist()" is reported. That will allow to verify the intended speedups.
Statistics: Posted by HermannSW — Tue Aug 19, 2025 10:03 pm