Nikhil Kandpal, Haikang Deng, Adam Roberts, Eric Wallace, Colin Raffel · Large Language Models Struggle to Learn Long-Tail Knowledge · SlidesLive

Categories

EN

Log in Talk to sales

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: Large Language Models Struggle to Learn Long-Tail Knowledge

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-003-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-003-alpha.b-cdn.net
sl-yoda-v2-stream-003-beta.b-cdn.net
1544410162.rsc.cdn77.org
1005514182.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

Large Language Models Struggle to Learn Long-Tail Knowledge

Large Language Models Struggle to Learn Long-Tail Knowledge

Jul 24, 2023

Speakers

Nikhil Kandpal

Speaker · 0 followers

Haikang Deng

Speaker · 0 followers

Adam Roberts

Speaker · 0 followers

About

The internet contains a wealth of knowledge—from the birthdays of historical figures to tutorials on how to code—all of which may be learned by language models. However, there is a huge variability in the number of times a piece of information appears on the web. In this paper, we study the relationship between the knowledge memorized by large language models and the information in their pre-training datasets. In particular, we show that a language model's ability to answer a fact-based question…

Organizer

ICML 2023

Account · 657 followers

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Hierarchical Neural Coding for Controllable CAD Model Generation

05:03

Hierarchical Neural Coding for Controllable CAD Model Generation

Watch later

Favorite

ICML 2023 2 years ago

LeadFL: Client Self-Defense against Model Poisoning in Federated Learning

05:15

LeadFL: Client Self-Defense against Model Poisoning in Federated Learning

Watch later

Favorite

Chaoyi Zhu, …

ICML 2023 2 years ago

Positional Encodings for Light Curve Transformers: Playing with Positions and Attention

13:30

Positional Encodings for Light Curve Transformers: Playing with Positions and Attention

Watch later

Favorite

Daniel Moreno-Cartagena, …

ICML 2023 2 years ago

BiRT: Bio-inspired Replay in Vision Transformers for Continual Learning

05:14

BiRT: Bio-inspired Replay in Vision Transformers for Continual Learning

Watch later

Favorite

Kishaan Jeeveswaran, …

ICML 2023 2 years ago

Efficient Interaction-Aware Interval Analysis of Neural Network Feedback Loops

36:25

Efficient Interaction-Aware Interval Analysis of Neural Network Feedback Loops

Watch later

Favorite

ICML 2023 2 years ago

A theory of Continuous Generative Flow Networks

05:12

A theory of Continuous Generative Flow Networks

Watch later

Favorite

Salem Lahlou, …

ICML 2023 2 years ago