For this blog post I wanted to go over a topic that was slightly confusing the first time it was introduced to me, and hopefully help others that might still be perplexed.

JOIN commands are utilized in SQL when we want to combine information from multiple tables into a single query. Below are the four types of JOINs and their definitions according to :

  • (INNER) JOIN: Returns records that have matching values in both tables
  • LEFT (OUTER) JOIN: Returns all records from the left table, and the matched records from the right table
  • RIGHT (OUTER) JOIN: Returns all records…

As I conclude my data science bootcamp with General Assembly, I wanted to take a moment to share some project ideas that I hope to undertake in the coming weeks.

As a lifelong football player/fan I am particularly interested in football analytics and wanted to narrow the scope of my ideas to this subject. Over the last 10 years, the amount of data available to NFL teams to analyze has grown exponentially. Amazon Web Services now powers the NFL’s “Next Gen Stats.”

By placing radio frequency identification tags in each player’s shoulder pads, as well as inside the balls used…

Manuel Sainz de la Pena

For my second blog post I will be diving into one of my favorite machine learning models: Random Forest. Before I go any further I want to give credit to General Assembly and their lesson on this topic, written by Matt Brems, Riley Dallas, and Patrick Wales-Dinan , for providing much of the content which I will be discussing.

Random Forest models are an attempt to remedy one of the main problems with bagged decision tree models, namely their tendency to be overfit and have high variance. This is due to the fact that the…

by Manuel Sainz de la Pena

This is the first question I receive when I tell a family member or friend that I am currently studying to become a data scientist.

The truth is that I didn’t really know the answer to that question when I began my Data Science boot camp with General Assembly. Sure, I had a rough idea. I was aware of the prevalence of data in all of our lives. And I knew that companies would certainly pay for someone to analyze that data and provide actionable insights.

However, I definitely did not have a complete…

Manuel Sainz de la Pena

I am a Data Scientist at General Assembly. I hope to help others entering this field by sharing the wisdom, tips, and best practices I learned along the way.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store