Research Topics in Knowledge Graph
Papers and Materials from All Areas
- Note 1: The papers from the database/data science communities are marked with 🌟.
- Note 2: In Nov 2023, I start to add a new session related to LLM in each topic. 🔥
0. Overview
- Survey of Knowledge Graphs - General Concepts, Scientific Problems, Applications, Challenges, and Directions) [link]
1. Knowledge Extraction and Integration to Construct KGs
-
Knowledge Base Construction (Demo or System) [link]
-
About Domain-Specific Knowledge Bases [link]
-
About Multi-Modal Knowledge Graph (MMKG) [link]
-
Named Entity Recognition, Entity Extraction and Entity Typing [link]
-
Coreference Resolution [link]
-
Entity Linking and Entity Disambiguation [link]
-
Entity Resolution, Entity Matching and Entity Alignment [link]
-
General Relation Extraction [link]
-
General Information Extraction and Open Information Extraction [link]
-
Relation Linking and Relation Disambiguation [link]
2. Mining and Refinement of KGs
-
Knowledge Graph Embedding, Learning, Reasoning, Rule Mining, and Path Finding [link]
-
Knowledge Base Refinement (Incompleteness, Incorrectness, and Freshness) [link]
-
Knowledge Fusion, Cleaning, Evaluation and Truth Discovery [link]
3. Applications Supported by KGs
-
Knowledge Graph Question Answering (KGQA) [link]
-
Knowledge Graph Recommendation [link]
-
Knowledge Graph Enhanced Machine Learning [link]
-
Knowledge Graphs and Large Language Models (LLMs) [link] 🔥🔥🔥
4. Schema and Query of KGs
- Knowledge Graph Representation (RDF and Property Graph), Schema and Query [link]
- Knowledge Graph Taxonomy Construction and Improvement [link] (TBD)
5. Others
Papers and Materials from the Database Communities
Note: Papers from SIGMOD/VLDB/ICDE/KDD/TKDE/VLDBJ
[2018] [2019] [2020] [2021] [2022] [2023] [2024]
Tutorials and Notes from Talented People
Tutorials, Discussion, Courses and Communities
- An introduction to knowledge graph and knowledge extraction from unstructured text. [Link]
- Information Extraction by Niranjan Balasubramanian {Slides in my Mac}
- CS 520 - Knowledge Graphs (seminar) - provided by Stanford
- OpenKG.cn
GitHub Repos that Summarize the Papers/Projects/Data related to Knowledge Graphs
- A Collection of KG Surveys, Papers (WWW+ACL+AAAI) and Data [GitHub]
- KG SOTA [GitHub]
- Awesome KG tutorials/papers/projects/communities [GitHub]
- Knowledge Graph Construction (from zero to everything, in Chinese) [GitHub]
- KG SOTA (Chinese) [Zhihu]
- Tracking Progress in Natural Language Processing [GitHub]
- KG Embedding SOTA [GitHub]
- Entity Related Papers [GitHub]
- Information Extraction Resources [GitHub]
- KGQA [Giters]
- Open-Environment Knowledge Graph Construction and Reasoning: Challenges, Approaches, and Opportunities [GitHub]
- KG-LLM-Papers [Link]
- Awesome LLM-KGs [Link]
Tutorials and Notes of Other Related Insightful Topics
- Probabilistic Graphical Models: Lagrangian Relaxation Algorithms for Natural Language Processing [Slides]
- Introduction to Conditional Random Fields [Blog]
- Network Community Detection: A Review and Visual Survey [Paper]
- Section 2.3. Community Detection Techniques
- Fast unfolding of communities in large networks [Paper]
- [A discussion of the Louvain method], [wiki of the Louvein Modularity]
- How do they design the function Q: Finding and evaluating community structure in networks [Paper]
- A compendium of NP optimization problems [Paper]
- [Notes about LSH]
- [Survey about Min Hash Sketch]
- MinHash Tutorial with Python Code: [Notes] [Code]
- Must-read papers on GNN [GitHub]
- Graph-based deap learning literatures [GitHub]
- Data Management for Machine Learning Applications [Course site]
- Stanford CS224W: Machine Learning with Graphs [Course site]
- Explainability for Natural Language Processing (AAAI 2020 tutorial) [Link] [Video]
- Graph Mining & Learning (Neurips 2020 tutorial) [Link]
- Discussion about GNN (Chinese) [Link]
- Stanford CS224n: Natural Language Processing with Deep Learning [Course site]
- Clique Relaxation Models in Networks: Theory, Algorithms, and Applications [Slides]
- KG Applications in Baidu (Chinese) [Link]
- Paper Digest (Database area) [Link]
- Complex Network (Collection of Notes and Tutorials) [GitHub]
- INTRODUCTION TO GENERATIVE AI (NTU, Prof HUNG-YI LEE) [Syllabus and Course Materials of Spring 2024]
- A very good and brief overview of Genearative AI! After listening to this course, you are able to clarify the important concepts in generative AI. For example, you can at least tell the difference between prompt engineering and fine-tuning (I think this is a very common mistake made by most people and even papers XD)
Useful Tools or APIs
Named Entity Recogntion and Entity Linking
- TagMe [Python API] [API] [GitHub1] [GitHub2]
- Stanford NER [Link]
- DBpedia Spotlight [Link]
- NLTK Tagger [Link]
- SpaCy [Link1] [Link2]
- spacy-llm [Link]
- EARL (including Relation Linking) [Link]
- Falcon (including Relatoin Linking) [DBpedia version] [Wikidata version]
- MonkeyLearn [Link]
- GERBIL - General Entity Annotator Benchmark [Link]
- PIKES [Link]
Benchmark Datasets
- Entity Disambiguation:
- MSNBC and ACE2004 [Link]
- QA:
- WebQuestions
- QA datasets summary [GitHub]
- Entity Resolution [GitHub]
- KGE, KBC and KG Reasoning
Other Useflul Tools
- From Freebase to Wikidata: The Great Migration [Paper and useful links]
- SPARQL tutorial [Link]
- Installing and running ElasticSearch [Link]
- Open KG on COVID-19 [Link]
- BOOKNLP [Link] (Pronominal Coreference Resolution, a natural language processing pipeline that scales to books and other long documents (in English))
- Wikidata Integrator [GitHub]
- OpenTapioca [Link]
- Grakn KGLIB (Knowledge Graph Library) [GitHub]
- SPASQL server on Freebase