Data Science

Tracking ggplot2 Extensions

The purpose of this blog post is to inform R users of a website that I created to track and list ggplot2 extensions. The site is available at: http://ggplot2-exts.github.io. The purpose of this site is to help other R users easily find ggplot2 extensions that are coming in “fast and furious” from the R community

Using Apache SparkR to Power Shiny Applications: Part I

The objective of this blog post is demonstrate how to use Apache SparkR to power Shiny applications. I have been curious about what the use cases for a “Shiny-SparkR” application would be and how to develop and deploy such an app.

Launch Apache Spark on AWS EC2 and Initialize SparkR Using RStudio

In this blog post, we shall learn how to launch a Spark stand alone cluster on Amazon Web Services (AWS) Elastic Compute Cloud (EC2) for analysis of Big Data. This is a continuation from our previous blog, which showed us how to download Apache Spark and start SparkR locally on windows OS and RStudio