Sequential Coverage Algorithm (SCA) in Record Linkage

CDC’s National Center for Health Statistics (NCHS) Data Linkage Program has implemented a supervised machine learning algorithm, known as the Sequential Coverage Algorithm (SCA) in their linkage programs. The SCA was used to develop joining methods (or blocking groups) when working with very large datasets. The SCA method improved the efficiency of
blocking.