Sport Analytics in R

Emre Usta 9/18/2019

Quick Introduction

I’m working as Data Warehouse and Business Intelligence developer for three years. So, I’ve been working with huge data sets, I need to learn how to extract, transform and load the data with big data tools. Which means in the short term, data engineer side would be more useful for me. Besides I would like to improve my data analytics skills in order to add predictive analytics knowledge into my skillset and make my client’s decision making more effective and more precise which is almost everything.

Also I am a huge fan of sports. I enjoy watching nearly every branch of it. I would like to work on data analytics in sport industry. So, I decided to put some materials about data analytics in sport.

Python vs R: Head to Head Data Analysis

There have been dozens of articles written comparing Python vs R from a subjective standpoint. This article aims to look at the languages more objectively. We’ll analyze a data set side by side in Python and R, and show what code is needed in both languages to achieve the same result. This will let us understand the strengths and weaknesses of each language without the conjecture. You can read more here

Python vs R

Sport Analytics in R

Statistical Models for Sport in R


Formula 1 Race Data Analysis

Formula One (also Formula 1 or F1 and officially the FIA Formula One World Championship) is the highest class of single-seat auto racing that is sanctioned by the Fédération Internationale de l’Automobile (FIA). The FIA Formula One World Championship has been one of the premier forms of racing around the world since its inaugural season in 1950.

This dataset contains data from 1950 all the way through the 2017 season, and consists of tables describing constructors, race drivers, lap times, pit stops and more.

head(circuits,5)
##   circuitId  circuitRef                           name     location   country       lat       lng
## 1         1 albert_park Albert Park Grand Prix Circuit    Melbourne Australia -37.84970 144.96800
## 2         2      sepang   Sepang International Circuit Kuala Lumpur  Malaysia   2.76083 101.73800
## 3         3     bahrain  Bahrain International Circuit       Sakhir   Bahrain  26.03250  50.51060
## 4         4   catalunya Circuit de Barcelona-Catalunya    MontmelÌ_     Spain  41.57000   2.26111
## 5         5    istanbul                  Istanbul Park     Istanbul    Turkey  40.95170  29.40500

You can explore this dataset here


The role of a Performance Analyst in Sports

Performance analysis is the process of assessing performance in a sport to develop an understanding of actions that can inform decision-making, optimize performance and support coaches and players in their journey towards optimal results. In many team sports this would consist on tactical assessment, movement analysis, video and statistical databasing and modeling and coach and player data presentations.

Click here to read more