Local-to-global learning for iterative training of production SLU models on new features

Yulia Grishina; Daniil Sorokin

Publication

Local-to-global learning for iterative training of production SLU models on new features

By Yulia Grishina, Daniil Sorokin

2022

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

In production SLU systems, new training data becomes available with time so that ML models need to be updated on a regular basis. Specifically, releasing new features adds new classes of data while the old data remains constant. However, retraining the full model each time from scratch is computationally expensive. To address this problem, we propose to consider production releases from the curriculum learning perspective and to adapt the local-to-global learning (LGL) schedule (Cheng et al., 2019) for a neural model that starts with fewer output classes and adds more classes with each iteration. We report experiments for the tasks of intent classification and slot filling in the context of a production voice-assistant. First, we apply the original LGL schedule on our data and then adapt LGL to the production setting where the full data is not available at initial training iterations. We demonstrate that our method improves model error rates by -7.3% and saves up to 25% training time for individual iterations.

Local-to-global learning for iterative training of production SLU models on new features

Latest news

Work with us