Serial vs Parallel Evaluation

Serial vs Parallel Evaluation

We evaluate the performance of both serial and parallel evaluation strategies across systems.

System Tesla_K80

BatchSize 1 on System Tesla_K80

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 2 on System Tesla_K80

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 4 on System Tesla_K80

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 8 on System Tesla_K80

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 16 on System Tesla_K80

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 32 on System Tesla_K80

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

System Tesla_M60

BatchSize 1 on System Tesla_M60

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 2 on System Tesla_M60

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 4 on System Tesla_M60

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 8 on System Tesla_M60

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 16 on System Tesla_M60

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 32 on System Tesla_M60

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

System TITAN_Xp

BatchSize 1 on System TITAN_Xp

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 2 on System TITAN_Xp

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 4 on System TITAN_Xp

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 8 on System TITAN_Xp

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 16 on System TITAN_Xp

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 32 on System TITAN_Xp

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

System TITAN_V

BatchSize 1 on System TITAN_V

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 2 on System TITAN_V

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 4 on System TITAN_V

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 8 on System TITAN_V

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 16 on System TITAN_V

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 32 on System TITAN_V

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

System Tesla_V100-SXM2-16GB

BatchSize 1 on System Tesla_V100-SXM2-16GB

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 2 on System Tesla_V100-SXM2-16GB

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 4 on System Tesla_V100-SXM2-16GB

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 8 on System Tesla_V100-SXM2-16GB

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 16 on System Tesla_V100-SXM2-16GB

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 32 on System Tesla_V100-SXM2-16GB

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

System Quadro_RTX_6000

BatchSize 1 on System Quadro_RTX_6000

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 2 on System Quadro_RTX_6000

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 4 on System Quadro_RTX_6000

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 8 on System Quadro_RTX_6000

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 16 on System Quadro_RTX_6000

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 32 on System Quadro_RTX_6000

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

System Tesla_T4

BatchSize 1 on System Tesla_T4

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 2 on System Tesla_T4

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 4 on System Tesla_T4

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 8 on System Tesla_T4

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 16 on System Tesla_T4

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime

BatchSize 32 on System Tesla_T4

Normalized Against MXNet End-to-End Runtime

Comparing Latency of Parallel, Serial, and MXNet End-to-End Runtime