Time Series Data in R
Time Series Data in R
Join the DZone community and get the full member experience.Join For Free
Learn how to stop testing everything every sprint and only test the code you’ve changed. Brought to you by Parasoft.
There is no shortage of time series data available on the web for use in student projects, or self-learning, or to test out new forecasting algorithms. It is now relatively easy to access these data sets directly in R.
M Competition data
The 1001 series from the M-competition and the 3003 series from the M3-competition are available as part of the Mcomp package in R.
DataMarket and Quandl
Both DataMarket and Quandl contain many thousands of time series that can be downloaded directly into R. A search for “Australian Real GDP per capita” on both sites returned many variants. The version from the Federal Reserve Bank in 2010 US dollars was available on both sites (Datamarket and Quandl). These data can be downloaded to R using the rdatamarket and Quandl packages respectively:
library(rdatamarket) library(Quandl) ausgdp <- as.ts(dmseries("http://data.is/1jDQwpr")[,1]) ausgdp2 <- ts(rev(Quandl("FRED/AUSRGDPC", type="ts")), end=2011)
The two series should be identical. For some bizarre reason, the Quandl data comes in reverse time order so
rev needs to be used, and then the time series attributes applied. The
Quandl function will also generate a warning that no authentication token has been used. Unauthenticated users are limited to 50 downloads per day. See the help page for details.
dmseries function from the rdatamarket package is simpler to use. The short URL is provided on the “Export” tab of the page for the data set on Datamarket. The data come in
zoo format, but can easily be converted to a
ts object using
For many years, I maintained the Time Series Data Library consisting of about 800 time series including many from well-known textbooks. These were transferred to DataMarket in June 2012 and are now available here.
A number of other R packages contain time series data. The following packages are listed in the Time Series Analysis Task View
- Data from Makridakis, Wheelwright and Hyndman (1998) Forecasting: methods and applications are provided in the fma package.
- Data from Hyndman, Koehler, Ord and Snyder (2008) Forecasting with exponential smoothing are in the expsmooth package.
- Data from Hyndman and Athanasopoulos (2013) Forecasting: principles and practice are in the fpp package.
- Data from Cryer and Chan (2010) Time series analysis with applications in R are in the TSA package.
- Data from Shumway and Stoffer (2011) Time series analysis and its applications are in the astsa package.
- Data from Tsay (2005) Analysis of financial time series are in the FinTS package, along with some functions and script files required to work some of the examples.
- TSdbi provides a common interface to time series databases.
- fame provides an interface for FAME time series databases
- AER and Ecdat both contain many data sets (including time series data) from many econometrics text books
Published at DZone with permission of Rob J Hyndman , DZone MVB. See the original article here.
Opinions expressed by DZone contributors are their own.