Machine learning is taking hold in all kinds of applications, from self-driving cars to image recognition to online recommendation engines. But unless you’re a Google or a Facebook,, it’s hard to get your hands on the kind of massive, real world data sets required to test and validate machine learning programs.