1. How do data cleaning tools like Google Refine actually work? How do these tools streamline inputs that mean the same thing but are labeled differently (Ex. from the reading – inputs that read “attorney,” “counsel”, “lawyer,” etc.)? Is it necessary to spot-check the data for things Refine may have missed?
  2. How useful is it for journalists to learn basic coding skills? While a few contributors to the chapter said it was not necessary, a lot of the journalists cited tools that require a basic fluency in some coding language, like Python.