Skip to content

Latest commit

 

History

History
 
 

ofacdata

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 

How we process OFAC lists

Data Collection

  • We collect the data from the official website of the U.S. Department of the Treasury's Office of Foreign Assets Control (OFAC) and download the data in the form of a CSV file from here

  • The SDN list contains the names of individuals, entities and groups designated by OFAC as well as the listing of maritime vessels and aircraft that are blocked by OFAC.

ofacdata/original

  • The data is stored in the form of 2 CSV files named sdn.csv and add.csv. dataspec.txt explains the data specification for the CSV data files.
  • The data is cleaned to obtain the required information for individuals from sdn.csv file. A ballpark number of 6917 individuals (at the time of writing this document) entries are present in sdn.csv. Remaining entries are entities, vessels, and aircrafts.

Data Processing

ofacdata/scripts

  • The ofac.ipynb script extracts the data from both the csv's and parses them in json format.
  • We parse all ethereum addresses, regardless of individual or entity in eth_addresses.json.
  • For individuals, we parse:
    • full name (first name, last name), dob(day, month, year) in names.json
    • passports and passport issuing country in passport.json
  • The jsons are stored at ofacdata/inputs to be used by SMT's.

Data Usage

These jsons are later used to create sparse merkle trees for non-membership proofs. We provide 3 levels of proofs.

  • Match through Passport Number: level 3 (Absolute Match)
  • Match through Names and Dob combo tree: level 2 (High Probability Match)
  • Match only through Names: level 1 (Partial Match) The merkle tree is also exported as json in ofacdata/outputs for time constrained export and import.

Check out src/utils/smtTree.ts for more details.