R Govys: Fellegi-Sunter Decisions beyond Pairwise Record Linkage


Details
Fellegi and Sunter (1969) formalize the probabilistic theory for linking pairs of records. Their theory gave rise to a powerful methodology based on expanding exhaustive classifications of record pairs. This approach is powerful and widely used but has implicit limitations. Sadinle and Fienberg (2013) expose explicitly the limitations of linking general configurations of records proceeding strictly from classified pairs. They propose an expanded universe of linkage structures transcending the limitations of pair-based linkage. I discuss simple instances of such linkage structures -record triples. I show how binary Fellegi-Sunter decisions -"match or "nonmatch"- can be applied successively to take advantage of these more complex structures to construct principled sets of linked records. Tools and approaches for record linkage in R will be used to convey the concepts.
Presented by: Yves Thibaudeau, Center for Statistical Research and Methodology, U.S. Census Bureau ~ After completing a PhD at Carnegie Mellon University in 1988, Yves joined the record-linkage group headed by Bill Winkler in the Statistical Research Division. Over his career Yves has worked on problems involving missing data and classification, in addition to record linkage.
Register here: https://amstat.zoom.us/webinar/register/WN_emap57YPSkGWxZ8muurqKA


R Govys: Fellegi-Sunter Decisions beyond Pairwise Record Linkage