Next
Livestream will start soon!
Livestream has already ended.
Presentation has not been recorded yet!
  • title: FELM: Benchmarking Factuality Evaluation of Large Language Models
      0:00 / 0:00
      • Report Issue
      • Settings
      • Playlists
      • Bookmarks
      • Subtitles Off
      • Playback rate
      • Quality
      • Settings
      • Debug information
      • Server sl-yoda-v2-stream-004-alpha.b-cdn.net
      • Subtitles size Medium
      • Bookmarks
      • Server
      • sl-yoda-v2-stream-004-alpha.b-cdn.net
      • sl-yoda-v2-stream-004-beta.b-cdn.net
      • 1685195716.rsc.cdn77.org
      • 1239898752.rsc.cdn77.org
      • Subtitles
      • Off
      • English
      • Playback rate
      • Quality
      • Subtitles size
      • Large
      • Medium
      • Small
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      My playlists
        Bookmarks
          00:00:00
            FELM: Benchmarking Factuality Evaluation of Large Language Models
            • Settings
            • Sync diff
            • Quality
            • Settings
            • Server
            • Quality
            • Server

            FELM: Benchmarking Factuality Evaluation of Large Language Models

            Dec 10, 2023

            Speakers

            SC

            Shiqi Chen

            Speaker · 0 followers

            YZ

            Yiran Zhao

            Speaker · 0 followers

            JZ

            Jinghan Zhang

            Speaker · 0 followers

            About

            Assessing factuality of text generated by large language models (LLMs) is an emerging yet crucial research area, aimed at alerting users to potential errors and guiding the development of more reliable LLMs. Nonetheless, the evaluators assessing factuality necessitate suitable evaluation themselves to gauge progress and foster advancements. This direction remains under-explored, resulting in substantial impediments to the progress of factuality evaluators. To mitigate this issue, we introduce a…

            Organizer

            N2
            N2

            NeurIPS 2023

            Account · 645 followers

            Like the format? Trust SlidesLive to capture your next event!

            Professional recording and live streaming, delivered globally.

            Sharing

            Recommended Videos

            Presentations on similar topic, category or speaker

            Agnostically Learning Single-Index Models using Omnipredictors
            04:43

            Agnostically Learning Single-Index Models using Omnipredictors

            Aravind Gollakota, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Loss Dynamics of Temporal Difference Reinforcement Learning
            04:27

            Loss Dynamics of Temporal Difference Reinforcement Learning

            Blake Bordelon, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Norm-guided latent space exploration for text-to-image generation
            04:27

            Norm-guided latent space exploration for text-to-image generation

            Dvir Samuel, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            The Drunkard’s Odometry: Estimating Camera Motion in Deforming Scenes
            04:31

            The Drunkard’s Odometry: Estimating Camera Motion in Deforming Scenes

            David Recasens, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            On the Role of Randomization in Adversarially Robust Classification
            04:30

            On the Role of Randomization in Adversarially Robust Classification

            Lucas Gnecco Heredia, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Exploring Loss Functions for Time-based Training Strategy in Spiking Neural Networks
            05:12

            Exploring Loss Functions for Time-based Training Strategy in Spiking Neural Networks

            Yaoyu Zhu, …

            N2
            N2
            NeurIPS 2023 16 months ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Interested in talks like this? Follow NeurIPS 2023