Performance
From Gw-qcd-wiki
Revision as of 13:33, 14 July 2010 by 128.164.237.205 (talk)
Carver Tester: Ben Gamari Test date: 14 Jul 2010 Commit: e3e4ffafd158abd004c483694a27f4f6bc7d2185 Hardware: CUDA version 3.0
Kernel | Configuration | Bandwidth | FLOPs |
---|---|---|---|
Dslash_cuda | 73 GB/s |32 GFLOP/s | ||
74 GB/s |34 GFLOP/s | |||
Dslash_multi_gpu (double) | 79 GB/s |35 GFLOP/s | ||
145 GB/s |64 GFLOP/s | |||
256 GB/s |114 GFLOP/s | |||
Dslash_multi_gpu (double) | 79 GB/s |76 GFLOP/s | ||
156 GB/s |140 GFLOP/s | |||
283 GB/s |252 GFLOP/s | |||
82 GB/s |3.4 GFLOP/s | |||
88 GB/s |N/A | |||
84 GB/s |N/A |