Machine learning is taking hold in all kinds of applications, from self-driving cars to image recognition to online recommendation engines. But unless you’re a Google or a Facebook, it’s hard to get your hands on the kind of massive, real world data sets required to test and validate machine learning programs.
Yahoo has helped to rectifying that with the release Thursday of what it called the “largest ever” data set made available to machine learning scientists. It’s a collection of anonymized user interactions with the news steams on sites like Yahoo News and Yahoo Sports.
Lire la suite sur : PC World