Dec 2, 2022
Dialogue state tracking (DST) is an important step in dialogue management to keep track of users' beliefs. Existing works fine-tune all language model (LM) parameters to tackle the DST task, which requires significant data and computing resources for training and hosting. The cost grows exponentially in the real-world deployment where dozens of fine-tuned LM are used for different domains and tasks. To develop domain-specific models that better utilize slot-related information with less training data and fewer parameters, we propose to use soft prompt tokens to learn task properties, incorporate segment information and reiterate the task before predicting value. Without tuning LM parameters, our method drastically reduces the number of parameters needed to less than 0.5
Professional recording and live streaming, delivered globally.
Presentations on similar topic, category or speaker