Entity Matching Theory

Background This blog post discusses the reasoning process I went through while addressing a particular problem in entity matching. Although libraries exist to address this task, my focus here is on the conceptual questions that arise during problem solving. Entity Ambiguity In principle, entity matching should not involve ambiguity. Ambiguity may occur in relations (triplets), but not in the entities themselves. The following discussion focuses on the notion of semantic ambiguity. ...

September 2, 2025 · 5 min · 973 words · Anton

Entity Matching: Solving the Problem of Duplicate Entities in Knowledge Graphs

Entity Matching: Solving the Problem of Duplicate Entities in Knowledge Graphs In this blog, I’ll walk through what entity matching is, why it matters in knowledge graphs, and how I designed an efficient solution—available on my GitHub repo. There are huge issues in the repo (cannot handle large amount of entity), the following ideas are just for reference. Background When working with knowledge graphs (KGs), one of the biggest challenges is that the same entity can appear under different names. ...

September 1, 2025 · 5 min · 892 words · Anton