I got a database dump as my input data. With some light research, it looks like it *is* technically possible to read this in directly to python and a pandas dataframe, but.... I don't want to do that, because it seems pretty hairy.
So....
First things first, I decided to convert it to CSV.
Based on https://blog.twineworks.com/converting-a-mysql-dump-to-csv-files-b5e92d7cc5dd