Spaces:
Runtime error
Runtime error
| Directions for compiling and running the benchmark with Ubuntu Linux: | |
| Install Intel's Threading Building Blocks library (TBB): | |
| $ sudo apt-get install libtbb-dev | |
| Compile the benchmark: | |
| $ nvcc -O3 -arch=sm_20 bench.cu -ltbb -o bench | |
| Run the benchmark: | |
| $ ./bench | |
| Typical output (Tesla C2050): | |
| Benchmarking with input size 33554432 | |
| Core Primitive Performance (elements per second) | |
| Algorithm, STL, TBB, Thrust | |
| reduce, 3121746688, 3739585536, 26134038528 | |
| transform, 1869492736, 2347719424, 13804681216 | |
| scan, 1394143744, 1439394816, 5039195648 | |
| sort, 11070660, 34622352, 673543168 | |
| Sorting Performance (keys per second) | |
| Type, STL, TBB, Thrust | |
| char, 24050078, 62987040, 2798874368 | |
| short, 15644141, 41275164, 1428603008 | |
| int, 11062616, 33478628, 682295744 | |
| long, 11249874, 33972564, 219719184 | |
| float, 9850043, 29011806, 692407232 | |
| double, 9700181, 27153626, 224345568 | |
| The reported numbers are performance rates in "elements per second" (higher is better). | |