Balancing efficiency and flexibility for DNN acceleration via temporal GPU-systolic array integration

Published in DAC 20, 2020

Paper