Author: Colin Pham

  • Introduction In this project, I decided to discover patterns in the housing prices of California. Using a dataset of the housing prices in California’s districts, I aimed to use clustering to identify if there were any separating features within the dataset. I used k-means clustering to group the districts based on their population, prices, and…

  • Introducing the Problem The data set I chose to look at is Hudl Statsbomb’s free release of their match record for the 2020 UEFA Euros. The “problem” that I chose to tackle with this data is to discover the key players for the top teams in the tournament. The definition of key player I chose…

  • Welcome to WordPress! This is your first post. Edit or delete it to take the first step in your blogging journey.