Jul 24, 2023
Speaker · 1 follower
Speaker · 1 follower
Speaker · 0 followers
Speaker · 0 followers
The goal of data attribution is to trace model predictions back to the training data. Prior approaches to this task exhibit a strict tradeoff between computational demand and efficacy—e.g., methods that are effective for deep neural networks require training thousands of models, making them impractical for large models or datasets.In this work, we introduce TRAK (Tracing by Rewinding the After Kernel), a data attribution method for overparameterized models that brings us closer to the best of both worlds. Using only a handful of model checkpoints, TRAK matches the performance of attribution methods that use thousands of trained models, reducing costs by up to three orders of magnitude. We demonstrate the utility of TRAK by applying it to a variety of large-scale settings: to study CLIP models; to study large language models (MT5-small); and to accelerate model comparison algorithms.The goal of data attribution is to trace model predictions back to the training data. Prior approaches to this task exhibit a strict tradeoff between computational demand and efficacy—e.g., methods that are effective for deep neural networks require training thousands of models, making them impractical for large models or datasets.In this work, we introduce TRAK (Tracing by Rewinding the After Kernel), a data attribution method for overparameterized models that brings us closer to the best of bo…
Professional recording and live streaming, delivered globally.
Presentations on similar topic, category or speaker
Zhiqi Bu, …
Jakwang Kim, …
Qi Zhang, …
Ziqiao Wang, …