Paper-Reading-ConvAI
Paper reading list in Conversational AI, mainly encompassing 💬 dialogue systems and 📝 natural language generation. This repository is constantly updating 🤗 ...
- Deep Learning in NLP
- Dialogue Systems
- Natural Language Generation
Deep Learning in NLP
- iNLP: "Interactive Natural Language Processing". arXiv(2023) [paper] :star::star::star::star:
- Data Augmentation: "A Survey of Data Augmentation Approaches for NLP". ACL-Findings(2021) [paper]
- Prompting: "Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing". arXiv(2021) [paper] :star::star::star::star::star:
- NLP World Scope: "Experience Grounds Language". EMNLP(2020) [paper] :star::star::star::star::star:
- Transformer-XL: "Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context". ACL(2019) [paper] [code]
- Transformer: "Attention is All you Need". NeurIPS(2017) [paper] [code-official] [code-tf] [code-py] :star::star::star::star::star:
- VAE: "An Introduction to Variational Autoencoders". arXiv(2019) [paper]
- Survey on Attention: "An Introductory Survey on Attention Mechanisms in NLP Problems". arXiv(2018) [paper] :star::star::star::star::star:
- Additive Attention: "Neural Machine Translation by Jointly Learning to Align and Translate". ICLR(2015) [paper]
- Multiplicative Attention: "Effective Approaches to Attention-based Neural Machine Translation". EMNLP(2015) [paper]
- Memory Net: "End-To-End Memory Networks". NeurIPS(2015) [paper]
- Copy Mechanism (PGN): "Get To The Point: Summarization with Pointer-Generator Networks". ACL(2017) [paper] [code] :star::star::star::star::star:
- Copy Mechanism: "Incorporating Copying Mechanism in Sequence-to-Sequence Learning". ACL(2016) [paper]
- ELMo: "Deep contextualized word representations". NAACL(2018) [paper] [code]
- Glove: "GloVe: Global Vectors for Word Representation". EMNLP(2014) [paper] [code]
- Word2Vec Tutorial: "word2vec Parameter Learning Explained". arXiv(2016) [paper] :star::star::star::star::star:
- Multi-task Learning: "An Overview of Multi-Task Learning in Deep Neural Networks". arXiv(2017) [paper]
- Gradient Descent: "An Overview of Gradient Descent Optimization Algorithms". arXiv(2016) [paper] :star::star::star::star::star:
Dialogue Systems
Survey on Dialogue
- Data Generation: "A Survey on Recent Advances in Conversational Data Generation". arXiv(2024) [paper]
- Proactive Dialogue: "A Survey on Proactive Dialogue Systems: Problems, Methods, and Prospects". IJCAI(2023) [paper]
- Responsible Dialogue: "Recent Advances towards Safe, Responsible, and Moral Dialogue Systems: A Survey". arXiv(2023) [paper]
- Negotiation Dialogue: "Let's Negotiate! A Survey of Negotiation Dialogue Systems". arXiv(2022) [paper]
- DL-based Dialogue: "Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey". arXiv(2021) [paper] :star::star::star::star:
- Open-domain Dialogue: "Challenges in Building Intelligent Open-domain Dialog Systems". TOIS(2020) [paper]
- Dialogue Systems: "A Survey on Dialogue Systems: Recent Advances and New Frontiers". SIGKDD Explorations(2017) [paper]
- Dialogue Corpora: "A Survey of Available Corpora For Building Data-Driven Dialogue Systems". arXiv(2017) [paper] [data]
Conversational LLMs
- Parrot: "Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions". arXiv(2023) [paper]
- MemoChat: "MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation". arXiv(2023) [paper]
- Llama 2-Chat: "Llama 2: Open Foundation and Fine-Tuned Chat Models". Meta(2023) [paper] [code]
- ChatGLM3: "ChatGLM3 Series: Open Bilingual Chat LLMs". Tsinghua(2023) [code]
- ChatGLM2-6B: "ChatGLM2-6B: An Open Bilingual Chat LLM". Tsinghua(2023) [code]
- MPC: "Prompted LLMs as Chatbot Modules for Long Open-domain Conversation". ACL-Findings(2023) [paper] [code]
- MemoryBank-SiliconFriend: "MemoryBank: Enhancing Large Language Models with Long-Term Memory". arXiv(2023) [paper] [code]
- UltraChat: "Enhancing Chat Language Models by Scaling High-quality Instructional Conversations". arXiv(2023) [paper] [data]
- ChatAlpaca: "ChatAlpaca: A Multi-Turn Dialogue Corpus based on Alpaca Instructions". Github(2023) [data]
- Phoenix: "Phoenix: Democratizing ChatGPT across Languages". arXiv(2023) [paper] [code]
- Dolly: "Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM". Databricks(2023) [code]
- Baize: "Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data". arXiv(2023) [paper] [code]
- Vicuna: "Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90% ChatGPT Quality". LMSYS Org(2023) [Blog] [code]
- Koala: "Koala: A Dialogue Model for Academic Research". UC Berkeley(2023) [Blog] [code]
- BELLE: "BELLE: Be Everyone's Large Language model Engine". LianjiaTech(2023) [code]
- Alpaca: "Alpaca: A Strong, Replicable Instruction-Following Model". Stanford(2023) [Blog] [code] [alpaca-lora]
- ChatGLM-6B: "An Open Bilingual Dialogue Language Model". Tsinghua(2023) [code]
- Open-Assistant: "Open Assistant: Conversational AI for everyone". Github(2023) [project] [code]
- ChatGPT: "ChatGPT: Optimizing Language Models for Dialogue". OpenAI(2022) [Blog] :star::star::star::star::star:
- Sparrow: "Improving alignment of dialogue agents via targeted human judgements". arXiv(2022) [paper] [data]
- BlenderBot3: "BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage". arXiv(2022) [paper]
- LaMDA: "LaMDA: Language Models for Dialog Applications". arXiv(2022) [paper]
- GODEL: "GODEL: Large-Scale Pre-Training for Goal-Directed Dialog". arXiv(2022) [paper] [code]
- Anthropic Assistant-v2: "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback". arXiv(2022) [paper]
- Anthropic Assistant: "A General Language Assistant as a Laboratory for Alignment". arXiv(2021) [paper]
Multimodal Dialogue
Situated and Embodied Dialogue
- SLL: "Large Language Model based Situational Dialogues for Second Language Learning". arXiv(2024) [paper]
- Emb-Plan: "Multimodal Embodied Plan Prediction Augmented with Synthetic Embodied Dialogue". EMNLP(2023) [paper]
- WTaG: "Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?". EMNLP-Findings(2023) [paper] [code]
- SIMMC-VR: "SIMMC-VR: A Task-oriented Multimodal Dialog Dataset with Situated and Immersive VR Streams". ACL(2023) [paper] :star::star::star::star:
- SURE: "Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark". ACL(2023) [paper] [data]
- SUGAR: "A Textual Dataset for Situated Proactive Response Selection". ACL(2023) [paper] [data]
- MindDial: "MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation". arXiv(2023) [paper]
- HoloAssist: "HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World". ICCV(2023) [paper] [data] :star::star::star::star:
- Collab: "Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue". IJCAI(2023) [paper] [code]
- Alexa Arena: "Alexa