Different databases utilize unique formats for storing data entries, leading to the presence of numerous data entries pertaining to the same individual across these databases. These entries often exhibit discrepancies in the data they contain. Hence, our objective is to devise an efficient algorithm capable of resolving these entities, distinguishing which entries correspond to the same real-world entity.
Sudipta Chattopadhyay
Bernard Tan Chee Seng
Entity Resolution is a technique that uses blocking and matching algorithms to identify data entries across multiple datasets that refer to the same real-world entity.
The blocking algorithm serves to reduce the search space by grouping similar entries together into blocks.
Matching is then performed within a subset of blocks to match and search the desired entity.
In the process, relevant attributes from each entity are selected to create blocking keys, which consist of combinations of informative attribute values. Entities with identical or similar blocking keys are grouped together in the same block for subsequent comparison during the matching phase.
Matching is employed to assess the likeness or similarity between all pairs of entities identified as potential matches by the Blocking process, which serves as its input. The matching model subsequently evaluates and calculates a similarity score for each candidate pair provided.
Special thanks to KLASS Engineering and Solutions for their continuous support and giving us the opportunity to work on this project.
Vote for our project at the exhibition! Your support is vital in recognizing our creativity. Join us in celebrating innovation and contributing to our success. Thank you for being part of our journey!
At Singapore University of Technology and Design (SUTD), we believe that the power of design roots from the understanding of human experiences and needs, to create for innovation that enhances and transforms the way we live. This is why we develop a multi-disciplinary curriculum delivered v ia a hands-on, collaborative learning pedagogy and environment that concludes in a Capstone project.
The Capstone project is a collaboration between companies and senior-year students. Students of different majors come together to work in teams and contribute their technology and design expertise to solve real-world challenges faced by companies. The Capstone project will culminate with a design showcase, unveiling the innovative solutions from the graduating cohort.
The Capstone Design Showcase is held annually to celebrate the success of our graduating students and their enthralling multi-disciplinary projects they have developed.