This project is a data processing and matching pipeline designed to handle multiple CSV files, perform data preprocessing, and execute various matching algorithms to find similarities between records.