Accelerating Bandwidth-Bound Deep Learning Inference with Main-Memory Accelerators - 42Papers