About Me

Hi there, I’m Cem. My bachelor degree is from Istanbul University on Tourism Administration. After graduating, I started my career at Jolly Tour. I was planning and preparing corporate events and trips for the dealers of large global companies such as LG Electronics, Michelin, Huawei and Goodyear. I would also organise trips to major commercial fairs of different sectors. Throughout my time at Jolly Tour, i had a great chance to visit more than 40 countries in the world.

I moved to the United States in order to begin an MBA program at the end of 2019 but due to difficult and unfavorable conditions occurred in the beginning of 2020, i left my master’s program halfway through and had to come back on July 2020. During my time in the USA, I lived in Ashburn, Virginia. While I was spending most of my day on the internet due to the pandemic, I learned that the Loudoun County region which I live is the data center capital of the world which hosts the huge data storage centers of companies such as Google, Amazon and Facebook. I wanted to visit one of the data centers in order to get more detailed information about this subject that interests me. When I went to visit Amazon’s data warehouse to see what is going on, the speech I had with officials was so impressive.

After returning to Turkey, I decided to complete my post grad in here and applied several schools which offers master programs about data analytics. I achieved Big Data and Business Analytics Specialization certification from Istanbul Technical University and Business Analysis Methodology, Modelling Techniques & Test Management certification from Bahcesehir University. I did my individual studies through Datacamp during lockdown. I also study MIS at Istanbul University as a second bachelor degree.

I am planning to use my data science skills for saving the world. I want to attend researches about global warming, oceans pollution, animal’s habitats etc. I am working as a Business Analyst at Global Kapital Group beyond my studies. I love playing video games, especially Dota since i was 14. I have a cat, she is 7. That’s all for now.

useR 2021 Tutorials: Introduction to TileDB for R

Lecturers: Aaron Wolen - Senior Software Engineer at TileDB Inc. & Dirk Eddelbuettel - Statistician & Principal Software Engineer at TileDB Inc.

TileDB is a serverless data management tool designed for enterprise data analytics professionals and data scientists to collaborate and work faster. Universal data engine of TileDB can handle storage and analysis of any data and complex formats using multi-dimensional arrays.

Introduction to TileDB for R is one of the tutorial session of useR 2021. I personally interested in big data analysis and would like to analyze a big data sets by using this kind of tools. Mr. Aaron Wolen and Mr. Dirk Eddelbuettel who are the software engineers from TileDB Inc. explained by examples how TileDB works integrated with R . The speech generally introduces to the participants the ‘tiledb’ R package on CRAN which allows users to efficiently operate on large arrays using familiar R techniques and data structures.

In addition to the speech, i came across Medium page of Founder & CEO of TileDB Inc, Mr. Stavros Papadopoulos and i read 2 articles about TileDB which were by him. The links are below if anyone interested.

TileDB 2.0 and The Future of Data Science & TileDB: A Database for Data Scientists

Track COVID-19 Data Yourself Using R

I honestly can say that the most important contribution of the pandemic which had a great impact on our lives is that it taught us how much we need science and that we cannot exist without science. In my opinion, the data announced at the beginning of the pandemic were not reliable. But at least we managed to track the pandemic by integrating the HES application to different fields. This article is about how to write a script in R to pull and analyse current coronavirus data. Beyond that once the script was written, you may save it and rerun it again. So, get your vaccine shot quick but keep not trusting government’s stats.

Exploratory Data Analysis in R for Beginners

When I first dive into my data studies, one of the first thing I heard is that %70-80 of the time I study with would be cleaning and processing data. They are so right! EDA (Exploratory Data Analysis) is the process of analyzing and visualizing the data to get a better understanding of it and glean insight from it. You may find useful examples of tidyverse and ggplot libraries in this article as both libraries will be in our lives from now on.

August 2021: “Top 40” New R Packages

The author mention 40 new published R libraries in August 2021 in this article. I think the most interesting thing in this article may be the taylor library under the heading “Data” if you are a fan of Taylor Swift. The library provides access to a curated data set of Taylor Swift songs, including lyrics and audio characteristics.