Building complex ML pipelines to tackle business document understanding

May 29, 2022

Speakers

About

Milan Šulc, Rossum.ai Petr Baudiš, Rossum.ai Solving real-world problems at scale often requires more than direct application of straightforward ML models. Let's journey together through architecting a complex ML pipeline and show how a challenging high-level task can be decomposed into a series of trainable sub-tasks while not compromising on a pure machine learning approach. We will demonstrate this on the problem of document information extraction that we are solving at Rossum, and look at how it can be decomposed to (still hard, but attackable) sub-tasks such as named entity (field) localisation, tables recognition, key-value detection, few-shot learning via similar document retrieval, and, of course, OCR. And perhaps we will manage to show why building an AI system capable of understanding document content is so much more than

Organizer

Categories

About Machine Learning Prague

Machines can learn. Incredibly fast. Faster than you. They are getting smarter and smarter every day. They are already changing your world, your business and your life. Artificial intelligence revolution is here. Come and learn how to turn this threat into your biggest opportunity. This is not another academic conference. Our goal is to foster discussion between machine learning practitioners and all people who are interested in applications of modern trends in artificial intelligence. You can look forward to inspiring people, algorithms, data, applications, workshops and a lot of fun during three days as well as at two great parties.

Store presentation

Should this presentation be stored for 1000 years?

How do we store presentations

Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Interested in talks like this? Follow Machine Learning Prague