Speaker-Sensitive Dual Memory Networks for Multi-Turn Slot Tagging

Young-Bum Kim; Sungjin Lee; Ruhi Sarikaya

Speaker-Sensitive Dual Memory Networks for Multi-Turn Slot Tagging

Young-Bum Kim ,
Sungjin Lee ,
Ruhi Sarikaya

2017 IEEE Automatic Speech Recognition and Understanding Workshop | December 2017

Download BibTex

In multi-turn dialogs, natural language understanding models can introduce obvious errors by being blind to contextual information. To incorporate dialog history, we present a neural architecture with Speaker-Sensitive Dual Memory Networks which encode utterances differently depending on the speaker. This addresses the different extents of information available to the system – the system knows only the surface form of user utterances while it has the exact semantics of system output. We performed experiments on real user data from Microsoft Cortana, a commercial personal assistant. The result showed a significant performance improvement over the state-of-the-art slot tagging models using contextual information.