- How do data cleaning tools like Google Refine actually work? How do these tools streamline inputs that mean the same thing but are labeled differently (Ex. from the reading – inputs that read “attorney,” “counsel”, “lawyer,” etc.)? Is it necessary to spot-check the data for things Refine may have missed?
- How useful is it for journalists to learn basic coding skills? While a few contributors to the chapter said it was not necessary, a lot of the journalists cited tools that require a basic fluency in some coding language, like Python.
Google Refine does a lot of find/replace and text conversion extremely fast.
Coding – yes, I think getting some coding experience is important.