1  Assignment 1

Author

Nida Donmez

Published

November 10, 2023

1.1 | ABOUT ME

I am Nida Donmez, currently studying in MEF University Big Data Analytics Master’s Program 2023-24. I work as an Indirect Online Channels Account Manager in Turkish Airlines. In my field, there is big bulk of data that requires presentation, visualisation and interpretation, which surely will bring up more sophisticated insight via deeper analysis. I aim to enrich these skills and put them in practice by making use of various tools and approaches learnt during this course.

1.2 | POSIT PBC YOUTUBE CHANNEL

This is a Youtube channel created by RStudio community aiming to inform users about open-source tools for R language and data science updates, with interviews and presentations from data science professionals.

1.2.1 | Leveraging R & Python in Tableau with RStudio Connect

I chose to watch the video titled “Leveraging R & Python in Tableau with RStudio Connect | James Blair”.

James Blair is a Product Manager in Posit PBC. In this video, he presents the useful extensions of R, Python and Shiny functions for their use in Tableau. He starts with an example of implementing a machine learning model, random forest, via extensions from R and Python, but then explains that Python and R extensions might be useful not only for ML models, and also for more advanced statistical computations that Tableau is not capable of. All these extensions are possible by installing specific Tableau API packages in R and Python, and integrating them to Tableau with the extension paths created in R Studio. At the end of the video, he also presents how to provide Shiny extension. Its use is simpler, just a file version is downloaded to then be uploaded to Tableau app that is being used. At the end of the video, he illustrates an example of how the graphs created with four different tools can be displayed together in a Tableau dashboard.

In order to check for further resources, these extension names are:

  • plumbertableau for R
  • fastapitableau for Python
  • shinytableau for Shiny

1.4 | RELEVANT R POSTS

1.4.1 | Exploration for R Built-in Datasets

This post outlines that R-built-in datasets do not require any installation and can directly be called and functioned, as below:

head(mtcars, 6)
                   mpg cyl disp  hp drat    wt  qsec vs am gear carb
Mazda RX4         21.0   6  160 110 3.90 2.620 16.46  0  1    4    4
Mazda RX4 Wag     21.0   6  160 110 3.90 2.875 17.02  0  1    4    4
Datsun 710        22.8   4  108  93 3.85 2.320 18.61  1  1    4    1
Hornet 4 Drive    21.4   6  258 110 3.08 3.215 19.44  1  0    3    1
Hornet Sportabout 18.7   8  360 175 3.15 3.440 17.02  0  0    3    2
Valiant           18.1   6  225 105 2.76 3.460 20.22  1  0    3    1

It is also easy to visualise the data with a histogram, below example to see the frequency of hp types:

hist(mtcars$hp)

Post source

1.4.2 | Graphics in R

This post here creates a simple dataset and briefly explains the types of useful graphs that can be built out of a dataset in R, by introducing the code for each graph type. It also provides links to more detailed content and tutorials for different graph types, and a rich content of video tutorials for the package that can be used to create more advanced formats of these graphs (ggplot2).

I hereby share the code for building a density graph as an example from the post:

set.seed(123)
x <- rnorm(30)
y <- x + rnorm(30)  
plot(density(x))   

Post source