This is the sixth post of a project on collaborative filtering based on the MovieLens 100K dataset.

Previously, I showed how to do matrix factorization

Previously, I showed how to use similarity-based approaches

Now that we've established some simple baseline models

This is the first post of a project on the MovieLens dataset to learn about collaborative filtering algorithms. Here, I do an exploratory data analysis to see what the data looks like.