THE SORREK APPROACH
Standard definition
Why this fails
Actual duplicates that don’t look the same are overlooked (changed names or abbreviations).
Non-duplicates that look the same are flagged.
Sorrek's definition
Benefits of our approach
We uncover all potential duplicates
More accuracy. No more false positives.
Entity-based resolution solves more than duplicates: we update names, flag defunct companies, highlight acquisition activity and link related entities.
ENTITY RESOLUTION IN PRACTICE
Our process
Crawl a global index of domains
Did the site load? Did it redirect somewhere else?
Parse to extract company data
What is the company name?
Use AI to fill-in the gaps
Did this company change its name? Was it acquired?
Refresh our source of truth database
Index updated with current information