Benanza: Automatic Benchmark Generation to Characterize “Lower-Bound” Latency of ML Models and Inform Optimizations on GPUs

Benanza: Automatic Benchmark Generation to Characterize “Lower-Bound” Latency of ML Models and Inform Optimizations on GPUs