Overview: The goal of this challenge will be to identify properties throughout Croatia owned by public officials. To do this, we will use lists of Croatian and foreign PEPs to search across data collected from Croatia’s property registry.
C4ADS has provided PEP lists from Croatia, Hungary, and Russia. Participants can use these lists to programmatically search and identify name matches or name similarities. We suggest that participants develop a method of fuzzy matching (Levenshtein distance) in order to account for unknown variations.
For the Croatian public officials, participants may attempt to compare real estate records to official asset declarations records to identify any assets that public officials did not declare.
Data provided:
- Croatian property data collected from državna geodetska uprava
- Croatian asset declarations data collected from registar dužnosnika
- Dataset provided here
- List of Croatian PEPs
- List of Hungarian PEPs
- List of Russian PEPs
Outcome: Develop a method for fuzzing searching a list of names against the Owners Details CSV file. Create a dataset of possible matches and the details of those properties.
Bonus: In the case of Croatian PEPs, identify any discrepancies between declared data and data found in real estate database