Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior

by · Nov 16, 2020 · 27 views ·

EMNLP 2020