Peter Shaw, Mandar Joshi, James Cohan, Jonathan Berant, Panupong Pasupat, Hexiang Hu, Urvashi Khandelwal, Kenton Lee, Kristina Toutanova · From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces · SlidesLive

Kategorien

DE

Anmelden Vertrieb kontaktieren

Next

Livestream will start soon!

Livestream has already ended.

Presentation has not been recorded yet!

SlidesLive

title: From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces

0:00 / 0:00

Report Issue
Settings
Playlists
Bookmarks
Subtitles Off
Playback rate
Quality

Settings
Debug information
Server sl-yoda-v2-stream-004-alpha.b-cdn.net
Subtitles size Medium

Bookmarks

Server
sl-yoda-v2-stream-004-alpha.b-cdn.net
sl-yoda-v2-stream-004-beta.b-cdn.net
1685195716.rsc.cdn77.org
1239898752.rsc.cdn77.org

Subtitles
Off
English

Playback rate

Quality

Subtitles size
Large
Medium
Small

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces

From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces

Dez 10, 2023

Sprecher:innen

Peter Shaw

Sprecher:in · 0 Follower:innen

Mandar Joshi

Sprecher:in · 0 Follower:innen

James Cohan

Sprecher:in · 0 Follower:innen

Über

Much of the previous work towards digital agents for graphical user interfaces (GUIs) has relied on text-based representations (derived from HTML or other structured data sources), which are not always readily available. These input representations have been often coupled with custom, task-specific action spaces. This paper focuses on creating agents that interact with the digital world using the same conceptual interface that humans commonly use — via pixel-based screenshots and a generic actio…

Organisator

NeurIPS 2023

Konto · 645 Follower:innen

Gefällt euch das Format? Vertraut auf SlidesLive, um euer nächstes Event festzuhalten!

Professionelle Aufzeichnung und Livestreaming – weltweit.

Freigeben

Empfohlene Videos

Präsentationen, deren Thema, Kategorie oder Sprecher:in ähnlich sind

SHARCS: Shared Concept Space for Explainable Multimodal Learning

05:57

SHARCS: Shared Concept Space for Explainable Multimodal Learning

Später ansehen

Favorit

Gabriele Dominici, …

NeurIPS 2023 16 months ago

Active Observing in Continuous-time Control

05:00

Active Observing in Continuous-time Control

Später ansehen

Favorit

Samuel Holt, …

NeurIPS 2023 16 months ago

CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss

05:03

CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss

Später ansehen

Favorit

Rakshith Sharma Srinivasa, …

NeurIPS 2023 16 months ago

FedSoL: Bridging Global Alignment and Local Generality in Federated Learning

05:48

FedSoL: Bridging Global Alignment and Local Generality in Federated Learning

Später ansehen

Favorit

NeurIPS 2023 16 months ago

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

05:00

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

Später ansehen

Favorit

NeurIPS 2023 16 months ago

LithoBench: Benchmarking AI Computational Lithography for Semiconductor Manufacturing

04:39

LithoBench: Benchmarking AI Computational Lithography for Semiconductor Manufacturing

Später ansehen

Favorit

NeurIPS 2023 16 months ago