Paper-Reading-ConvAI

Paper reading list in Conversational AI, mainly encompassing 💬 dialogue systems and 📝 natural language generation. This repository is constantly updating 🤗 ...

Deep Learning in NLP
Dialogue Systems
Natural Language Generation

Deep Learning in NLP

iNLP: "Interactive Natural Language Processing". arXiv(2023) [paper] :star::star::star::star:
Data Augmentation: "A Survey of Data Augmentation Approaches for NLP". ACL-Findings(2021) [paper]
Prompting: "Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing". arXiv(2021) [paper] :star::star::star::star::star:
NLP World Scope: "Experience Grounds Language". EMNLP(2020) [paper] :star::star::star::star::star:
Transformer-XL: "Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context". ACL(2019) [paper] [code]
Transformer: "Attention is All you Need". NeurIPS(2017) [paper] [code-official] [code-tf] [code-py] :star::star::star::star::star:
VAE: "An Introduction to Variational Autoencoders". arXiv(2019) [paper]
Survey on Attention: "An Introductory Survey on Attention Mechanisms in NLP Problems". arXiv(2018) [paper] :star::star::star::star::star:
Additive Attention: "Neural Machine Translation by Jointly Learning to Align and Translate". ICLR(2015) [paper]
Multiplicative Attention: "Effective Approaches to Attention-based Neural Machine Translation". EMNLP(2015) [paper]
Memory Net: "End-To-End Memory Networks". NeurIPS(2015) [paper]
Copy Mechanism (PGN): "Get To The Point: Summarization with Pointer-Generator Networks". ACL(2017) [paper] [code] :star::star::star::star::star:
Copy Mechanism: "Incorporating Copying Mechanism in Sequence-to-Sequence Learning". ACL(2016) [paper]
ELMo: "Deep contextualized word representations". NAACL(2018) [paper] [code]
Glove: "GloVe: Global Vectors for Word Representation". EMNLP(2014) [paper] [code]
Word2Vec Tutorial: "word2vec Parameter Learning Explained". arXiv(2016) [paper] :star::star::star::star::star:
Multi-task Learning: "An Overview of Multi-Task Learning in Deep Neural Networks". arXiv(2017) [paper]
Gradient Descent: "An Overview of Gradient Descent Optimization Algorithms". arXiv(2016) [paper] :star::star::star::star::star:

👆 Back to Top

Dialogue Systems

Survey on Dialogue

Data Generation: "A Survey on Recent Advances in Conversational Data Generation". arXiv(2024) [paper]
Proactive Dialogue: "A Survey on Proactive Dialogue Systems: Problems, Methods, and Prospects". IJCAI(2023) [paper]
Responsible Dialogue: "Recent Advances towards Safe, Responsible, and Moral Dialogue Systems: A Survey". arXiv(2023) [paper]
Negotiation Dialogue: "Let's Negotiate! A Survey of Negotiation Dialogue Systems". arXiv(2022) [paper]
DL-based Dialogue: "Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey". arXiv(2021) [paper] :star::star::star::star:
Open-domain Dialogue: "Challenges in Building Intelligent Open-domain Dialog Systems". TOIS(2020) [paper]
Dialogue Systems: "A Survey on Dialogue Systems: Recent Advances and New Frontiers". SIGKDD Explorations(2017) [paper]
Dialogue Corpora: "A Survey of Available Corpora For Building Data-Driven Dialogue Systems". arXiv(2017) [paper] [data]

👆 Back to Top

Conversational LLMs

Parrot: "Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions". arXiv(2023) [paper]
MemoChat: "MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation". arXiv(2023) [paper]
Llama 2-Chat: "Llama 2: Open Foundation and Fine-Tuned Chat Models". Meta(2023) [paper] [code]
ChatGLM3: "ChatGLM3 Series: Open Bilingual Chat LLMs". Tsinghua(2023) [code]
ChatGLM2-6B: "ChatGLM2-6B: An Open Bilingual Chat LLM". Tsinghua(2023) [code]
MPC: "Prompted LLMs as Chatbot Modules for Long Open-domain Conversation". ACL-Findings(2023) [paper] [code]
MemoryBank-SiliconFriend: "MemoryBank: Enhancing Large Language Models with Long-Term Memory". arXiv(2023) [paper] [code]
UltraChat: "Enhancing Chat Language Models by Scaling High-quality Instructional Conversations". arXiv(2023) [paper] [data]
ChatAlpaca: "ChatAlpaca: A Multi-Turn Dialogue Corpus based on Alpaca Instructions". Github(2023) [data]
Phoenix: "Phoenix: Democratizing ChatGPT across Languages". arXiv(2023) [paper] [code]
Dolly: "Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM". Databricks(2023) [code]
Baize: "Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data". arXiv(2023) [paper] [code]
Vicuna: "Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90% ChatGPT Quality". LMSYS Org(2023) [Blog] [code]
Koala: "Koala: A Dialogue Model for Academic Research". UC Berkeley(2023) [Blog] [code]
BELLE: "BELLE: Be Everyone's Large Language model Engine". LianjiaTech(2023) [code]
Alpaca: "Alpaca: A Strong, Replicable Instruction-Following Model". Stanford(2023) [Blog] [code] [alpaca-lora]
ChatGLM-6B: "An Open Bilingual Dialogue Language Model". Tsinghua(2023) [code]
Open-Assistant: "Open Assistant: Conversational AI for everyone". Github(2023) [project] [code]
ChatGPT: "ChatGPT: Optimizing Language Models for Dialogue". OpenAI(2022) [Blog] :star::star::star::star::star:
Sparrow: "Improving alignment of dialogue agents via targeted human judgements". arXiv(2022) [paper] [data]
BlenderBot3: "BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage". arXiv(2022) [paper]
LaMDA: "LaMDA: Language Models for Dialog Applications". arXiv(2022) [paper]
GODEL: "GODEL: Large-Scale Pre-Training for Goal-Directed Dialog". arXiv(2022) [paper] [code]
Anthropic Assistant-v2: "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback". arXiv(2022) [paper]
Anthropic Assistant: "A General Language Assistant as a Laboratory for Alignment". arXiv(2021) [paper]

👆 Back to Top

Multimodal Dialogue

Situated and Embodied Dialogue

SLL: "Large Language Model based Situational Dialogues for Second Language Learning". arXiv(2024) [paper]
Emb-Plan: "Multimodal Embodied Plan Prediction Augmented with Synthetic Embodied Dialogue". EMNLP(2023) [paper]
WTaG: "Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?". EMNLP-Findings(2023) [paper] [code]
SIMMC-VR: "SIMMC-VR: A Task-oriented Multimodal Dialog Dataset with Situated and Immersive VR Streams". ACL(2023) [paper] :star::star::star::star:
SURE: "Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark". ACL(2023) [paper] [data]
SUGAR: "A Textual Dataset for Situated Proactive Response Selection". ACL(2023) [paper] [data]
MindDial: "MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation". arXiv(2023) [paper]
HoloAssist: "HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World". ICCV(2023) [paper] [data] :star::star::star::star:
Collab: "Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue". IJCAI(2023) [paper] [code]
Alexa Arena: "Alexa