Mayee F. Chen, Nicholas Roberts, Kush Bhatia, Jue Wang, Ce Zhang, Fred Scala, Christopher Ré · Skill-it! A data-driven skills framework for understanding and training language models · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: Skill-it! A data-driven skills framework for understanding and training language models

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-008-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-008-alpha.b-cdn.net
sl-yoda-v2-stream-008-beta.b-cdn.net
1159783934.rsc.cdn77.org
1511376917.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Skill-it! A data-driven skills framework for understanding and training language models

Skill-it! A data-driven skills framework for understanding and training language models

10. prosince 2023

Řečníci

Mayee F. Chen

Řečník · 0 sledujících

Nicholas Roberts

Řečník · 1 sledující

Kush Bhatia

Řečník · 0 sledujících

O prezentaci

The quality of training data impacts the performance of pre-trained large language models (LMs). Given a fixed budget of tokens, it is unclear what data to best select for the model’s performance across tasks. To study this, we develop a new framework based on a simple hypothesis: similar to how humans acquire interdependent skills in a deliberate order, there exists a natural order in how the LM best learns a set of skills from its training data. If such order exists, it can be exploited for im…

Organizátor

NeurIPS 2023

Účet · 646 sledujících

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Guiding Large Language Models via Directional Stimulus Prompting

04:59

Guiding Large Language Models via Directional Stimulus Prompting

Zhlédnout později

Oblíbené

NeurIPS 2023 16 months ago

Divergence at the Interpolation Threshold: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle

05:07

Divergence at the Interpolation Threshold: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle

Zhlédnout později

Oblíbené

Rylan Schaeffer, …

NeurIPS 2023 16 months ago

SmoothHess: ReLU Network Feature Interactions via Stein's Lemma

05:04

SmoothHess: ReLU Network Feature Interactions via Stein's Lemma

Zhlédnout později

Oblíbené

NeurIPS 2023 16 months ago

Datasets and Benchmarks for Nanophotonic Structure and Parametric Design Simulations

04:55

Datasets and Benchmarks for Nanophotonic Structure and Parametric Design Simulations

Zhlédnout později

Oblíbené

Jungtaek Kim, …

NeurIPS 2023 16 months ago

MiliPoint: A Point Cloud Dataset for mmWave Radar

03:46

MiliPoint: A Point Cloud Dataset for mmWave Radar

Zhlédnout později

Oblíbené

NeurIPS 2023 16 months ago

46:14

Round Table

Zhlédnout později

Oblíbené

Donato Crisostomi, …

NeurIPS 2023 16 months ago