I am Nida Donmez, currently studying in MEF University Big Data Analytics Master’s Program 2023-24. I work as an Indirect Online Channels Account Manager in Turkish Airlines. In my field, there is big bulk of data that requires presentation, visualisation and interpretation, which surely will bring up more sophisticated insight via deeper analysis. I aim to enrich these skills and put them in practice by making use of various tools and approaches learnt during this course.
1.2 | POSIT PBC YOUTUBE CHANNEL
This is a Youtube channel created by RStudio community aiming to inform users about open-source tools for R language and data science updates, with interviews and presentations from data science professionals.
1.2.1 | Leveraging R & Python in Tableau with RStudio Connect
I chose to watch the video titled “Leveraging R & Python in Tableau with RStudio Connect | James Blair”.
James Blair is a Product Manager in Posit PBC. In this video, he presents the useful extensions of R, Python and Shiny functions for their use in Tableau. He starts with an example of implementing a machine learning model, random forest, via extensions from R and Python, but then explains that Python and R extensions might be useful not only for ML models, and also for more advanced statistical computations that Tableau is not capable of. All these extensions are possible by installing specific Tableau API packages in R and Python, and integrating them to Tableau with the extension paths created in R Studio. At the end of the video, he also presents how to provide Shiny extension. Its use is simpler, just a file version is downloaded to then be uploaded to Tableau app that is being used. At the end of the video, he illustrates an example of how the graphs created with four different tools can be displayed together in a Tableau dashboard.
In order to check for further resources, these extension names are:
plumbertableau for R
fastapitableau for Python
shinytableau for Shiny
1.3 | DAILY TRENDING YOUTUBE VIDEOS DATASET
This dataset in Kaggle contains top 50 latest trending videos on YouTube across 113 countries, and is updated daily. This dataset provides comprehensive information about the top trending videos, including daily rankings, movement trends, view counts, likes, comments, and more
Below, I imported the library “dplyr”, then imported the file in RStudio, and applied a basic formula to list the top 10 most watched Youtube channels in Turkey today (10.11.2023):
library(dplyr)
Attaching package: 'dplyr'
The following objects are masked from 'package:stats':
filter, lag
The following objects are masked from 'package:base':
intersect, setdiff, setequal, union
This post here creates a simple dataset and briefly explains the types of useful graphs that can be built out of a dataset in R, by introducing the code for each graph type. It also provides links to more detailed content and tutorials for different graph types, and a rich content of video tutorials for the package that can be used to create more advanced formats of these graphs (ggplot2).
I hereby share the code for building a density graph as an example from the post:
set.seed(123)x <-rnorm(30)y <- x +rnorm(30) plot(density(x))