An R Data Package for Indian Independence Day Speeches

An R package including a dataset of full-text English renderings of Indian Independence Day speeches, delivered annually on 15 August since 1947.

NLP
open-data
r-packages
shiny
Published

September 5, 2021

While living in Delhi, I made trips to various libraries to access the English renderings of Indian Independence Day speeches in the volumes of collected speeches from every Prime Minister. I digitized them and collected the results, along with those already accessible, into an R package.

Since 1947, I’m only missing two years.

You can find the details of what’s included, how to install it (or just download the final CSV), some basic analysis, and a shiny app in the package’s GitHub repository.

Words per speech plot