Posts

Window Functions and Parameterized SQL Queries

SQL
PostgreSQL
DBI
ggplot2

Practice with SQL window functions and parameterized queries from R using Olympics data.

Modern Big Data Analysis with SQL

SQL
Hive
Impala
S3
HDFS

An introduction to distributed SQL query engines like Hive and Impala.

Functional Dependencies and Relational Database Design

SQL
relational database theory

Determine functional dependencies to structure relations in third normal form.

Text Data in PostgreSQL

SQL
PostgreSQL
DBI
RPostgres
NLP

Use regular expressions and text analysis functions on India's Independence day speeches.

PostgreSQL Database Design

SQL
PostgreSQL
DBI
RPostgres

Create a normalized SQL database from a CSV file of India's Independence Day speeches.

An R Data Package for Indian Independence Day Speeches

NLP
open-data
r-packages
shiny

An R package including a dataset of full-text English renderings of Indian Independence Day speeches, delivered annually on 15 August since 1947.

An R Data Package for Preparing and Visualizing India-tagged Data from the NYT Article Search API

open-data
r-packages
shiny
leaflet
tidyverse
ggiraph
ggplot2
dygraphs
DT
gt

An R package to query the Article Search API of The New York Times for articles with an “India” location keyword. It also includes functions to prepare this data to be ready for analysis, as well as a shiny app to visualize the output dataset.

Working with Databases in R

SQL
dbplyr
sf
PostGIS
rpostgis

Use R to structure and query database tables of Indian census data

Dashboards with shinydashboard & ggiraph

shiny
shinydashboard
sf
ggiraph
geospatial

Explore how access to electricity in India varies with respect to latrine access at state and district levels through a scatterplot, dumbbell plot and a bivariate bubble map.

Comparative Thematic Mapping with Mapdeck

shiny
mapdeck
sf
geospatial

Compare mapping styles like a choropleth, dot density map, proportional symbols map, and 3D choropleth using Indian electricity and latrine access data

Animated and Interactive Maps in R

tmap
sf
gganimate
mapview
leaflet
plotly
ggiraph
crosstalk
shiny

Tutorial for creating animated maps using packages like {tmap} and {gganimate} and interactive maps using packages like {ggiraph}, {mapview}, {leaflet} and {plotly}

Static Maps in R

tmap
sf
ggplot2
cartogram
geogrid
geofacet

Tutorial for creating static choropleths and cartograms using packages like {tmap}, {ggplot2}, {cartogram}, {geogrid} and {geofacet}

Spatial Subsetting in R

sf
leaflet
ggplot2
tidycensus
tigris
geospatial

Learn different topological relations to spatially subset data via the {sf} package

Interactive Choropleths with Shiny and Leaflet

sf
shiny
leaflet
ggplot2
geospatial

Explore electricity, latrine and water access data from the Indian Census

Zoom-Triggered Actions in Leaflet and Shiny

sf
leaflet
shiny
ggplot2
tigris
tidycensus
geospatial

Explore median household income data in the Delaware Valley at various levels of scope

Generate Choropleths and Cartograms in Shiny

sf
ggiraph
shiny
cartogram
geogrid
geospatial

Visualize India's states through a range of geospatial representations

Plotting with Pygal

python
pygal

An introduction to Python’s Pygal plotting library

Mapping Walmart's Growth

gganimate

Recreating a D3 visualization of Walmart’s US growth in R with {gganimate}

Text Mining: TF-IDF & Sentiment Analysis

tidytext
shiny
ggplot2
rvest
NLP

Tidy text analysis of India PM’s radio addresses in a shiny app

Animating Dosas

gganimate
leaflet
tidyverse
rvest

Scraping Saravana Bhavan’s web site to map restaurant locations in time and space

Exploring Campaign Contribution Data

tidyverse
ggiraph

Investigation and visualization of 2016 Presidential election campaign contributions in PA

Next Word Prediction

shiny
tidyverse
data.table
NLP

Next word prediction app in support of JHU Data Science Capstone on Coursera

Supervised Learning: Exercise Classification

caret
ML

Exploring the effectiveness of different ML models to classify motion data into exercise categories

Comparing Fuel Efficiency via Linear Regression

regression
mtcars

Using linear regression models to quantify the difference in fuel efficiency among automatic and manual transmission cars

Predicting Titanic Survival

caret
ML

My attempt at the classic Kaggle competition to predict survival on the Titanic

More articles »

Posts

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY-NC 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".