Apr 4, 2021
Speaker · 0 followers
Speaker · 0 followers
Speaker · 0 followers
Speaker · 0 followers
There is growing interest in multi-branch CNN architectures, due to the higher accuracy compared to single branch architectures under the same computation budget. With recent advances in high-performance hardware, single operator can no longer fully utilize the available parallelism. However, to perform CNN computations on a modern GPU, existing deep learning frameworks focus on optimizing intra-operator parallelization, resulting in a large gap between the peak performance and the real performance. This performance gap is more severe under smaller batch sizes. In this work, we extensively study the parallelism between DL operators and propose Inter-Operator Scheduler (IOS) to automatically schedule the execution of multiple operators in parallel. IOS utilizes dynamic programming to find a scheduling policy specialized for the target hardware. IOS consistently outperforms state-of-the-art libraries (e.g., TensorRT) by 1.1 to 1.5x on modern CNN benchmarks.There is growing interest in multi-branch CNN architectures, due to the higher accuracy compared to single branch architectures under the same computation budget. With recent advances in high-performance hardware, single operator can no longer fully utilize the available parallelism. However, to perform CNN computations on a modern GPU, existing deep learning frameworks focus on optimizing intra-operator parallelization, resulting in a large gap between the peak performance and the real performa…
Account · 159 followers
Category · 10.8k presentations
The Conference on Machine Learning and Systems targets research at the intersection of machine learning and systems. The conference aims to elicit new connections amongst these fields, including identifying best practices and design principles for learning systems, as well as developing novel learning methods and theory tailored to practical machine learning workflows.
Professional recording and live streaming, delivered globally.
Presentations on similar topic, category or speaker
Kiwan Maeng, …
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%
Hongyi Wang, …
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%
Akhil Mathur, …
Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%