A Short Survey of Pre-trained Language Models for Conversational AI-A New Age in NLP

@article{Zaib2020ASS,  title={A Short Survey of Pre-trained Language Models for Conversational AI-A New Age in NLP},  author={Munazza Zaib and Quan Z. Sheng and W. Zhang},  journal={Proceedings of the Australasian Computer Science Week Multiconference},  year={2020},  url={https://api.semanticscholar.org/CorpusID:211040895}}

Munazza ZaibQuan Z. ShengW. Zhang
Published inAustralasian Computer Science…29 January 2020
Computer Science, Linguistics

This paper intends to establish whether these pre-trained models can overcome the challenges pertinent to dialogue systems, and how their architecture could be exploited in order to overcome these challenges.

View on ACM

[PDF] Semantic Reader

78 Citations

Highly Influential Citations

Background Citations

Methods Citations

Figures from this paper

Topics

Dialogue System (opens in a new tab)Natural Language Processing (opens in a new tab)ImageNet (opens in a new tab)Architecture (opens in a new tab)Agent-Based Computing (opens in a new tab)Conversational Agents (opens in a new tab)Long-term Dependencies (opens in a new tab)Syntax (opens in a new tab)

78 Citations

Advances in Multi-turn Dialogue Comprehension: A Survey

Zhuosheng ZhangHai Zhao

Computer Science, Linguistics

ArXiv

2021

The characteristics and challenges of dialogue comprehension in contrast to plain-text reading comprehension are summarized and categorize dialogue-related pre-training techniques which are employed to enhance PrLMs in dialogue scenarios.

Movatterモバイル変換

A Short Survey of Pre-trained Language Models for Conversational AI-A New Age in NLP

Figures from this paper

Topics

78 Citations

Advances in Multi-turn Dialogue Comprehension: A Survey

technical review on knowledge intensive NLP for pre-trained language development

Conversational question answering: a survey

Fusing Sentence Embeddings Into LSTM-based Autoregressive Language Models

Towards a Universal NLG for Dialogue Systems and Simulators with Future Bridging

Keeping the Questions Conversational: Using Structured Representations to Resolve Dependency in Conversational Question Answering

Towards End-to-End Open Conversational Machine Reading

Pretrained Language Models for Text Generation: A Survey

Pre-Trained Language Models for Text Generation: A Survey

Effective Sequence-to-Sequence Dialogue State Tracking

20 References

Language Models are Unsupervised Multitask Learners

Hello, It’s GPT-2 - How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems

Improving Language Understanding by Generative Pre-Training

Deep Contextualized Word Representations

QuAC: Question Answering in Context

MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

CoQA: A Conversational Question Answering Challenge

Semantics-aware BERT for Language Understanding

A Simple but Effective Method to Incorporate Multi-turn Context with BERT for Conversational Machine Comprehension

Related Papers