Speaker-Sensitive Dual Memory Networks for Multi-Turn Slot Tagging

  • Young-Bum Kim ,
  • Sungjin Lee ,
  • Ruhi Sarikaya

2017 IEEE Automatic Speech Recognition and Understanding Workshop |

In multi-turn dialogs, natural language understanding models can introduce obvious errors by being blind to contextual information. To incorporate dialog history, we present a neural architecture with Speaker-Sensitive Dual Memory Networks which encode utterances differently depending on the speaker. This addresses the different extents of information available to the system – the system knows only the surface form of user utterances while it has the exact semantics of system output. We performed experiments on real user data from Microsoft Cortana, a commercial personal assistant. The result showed a significant performance improvement over the state-of-the-art slot tagging models using contextual information.