Integrated Omni-Path GPU * 11 Intel Xeon E5-2680 v3 @2.50 GHz (2 CPUs/node, 24 cores/node) Nvidia Tesla K80 (2 accelerators/node) 2133 MT/s, Dual Rank, x4 Data Width RDIMM (128 GiB/node) 200 GB SSD (1 ...
All benchmarks were run on bare-metal box with the following specifications: 2 x Xeon E5-2680 2.50GHz (HT enabled, 24 cores, 48 threads) 128 GB of RAM. 1 Samsung PM863 960 GB SSD with XFS file system.
test.py <size of matrices> performance comparison between cython+numpy version, numpy+normal loop version and numpy.dot() version (with intel xeon E5-2680, 16 cores): $ ./test.py 256 matrix size: 256 ...