WarpDLM Reproducibility Guide

This repository contains the code necessary to reproduce the results in the paper "Warped Dynamic Linear Models for Time Series of Counts". There are three main folders:

Data: Contains raw and cleaned data
Code: Contains all R scripts needed to run analysis
Outputs: Contains any model outputs as well as paper figures

Dependencies and Environments

All analysis was run in R, version 4.0.0 or higher. The bulk of the analysis was performed on a Windows laptop, with the exception of the simulations, which were run on an Rstudio Server instance hosted on a multi-core Linux server.

There are a variety of required packages, which can be installed using the following lines of code.

#Install packages from Github
install.packages("remotes")
remotes::install_github("drkowal/rSTAR")

#Install other 
install.packages(c("doParallel", "foreach", "KFAS", "truncdist", "doSNOW", 
                    "tscount", "VGAM", "tidyverse", "dlm", "mc2d",
                    "bayesplot", "TruncatedNormal", "mvnfast", "magrittr",
                    "lubridate", "spatstat", "wesanderson", "ddst"))

Data

In this article, we use the warpDLM methodology to analyze counts of overdose calls due to heroin and other drugs in the the city of Cincinnati. This is derived from the full set of incident reports to the Cincinnati Fire Department, publicly available at this link. In addition to the data set itself, data dictionaries and descriptions of coding are also provided. This information was used to determine which calls corresponded to overdoses.

The raw data used for this project was downloaded in February 2021 and can be found here.

Application

There are two main scripts for the application. The first file is a script which inputs the raw data file and outputs the formatted count time series of drug overdoses that we are analyzing. That formatted data can be found in the Data folder as well. The second file has the code to run the offline Gibbs sampler and online particle filter, as well as produce the figures from the paper. The intermediate model outputs are stored here. Also note that this second file requires the helper functions script, a collection of functions used in different parts of the analysis.

There are four associated figures

Simulations

The simulation forecasts are generated in four different scripts, designed to be run in a multi-core environment as they are quite computationally intensive.

The forecasts are stored here, and these files are used in the simulation analysis script to compute various metrics and create figures. In its current form, the analysis script produces figures for the INGARCH forecasts, but you can simply change one line at the beginning to use the zero-inflated Poisson forecasts instead.

There are four associated figures

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Code		Code
Data		Data
Outputs		Outputs
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WarpDLM Reproducibility Guide

Dependencies and Environments

Data

Application

Simulations

About

Releases

Packages

Languages

bking124/warpDLM-reproducible-code

Folders and files

Latest commit

History

Repository files navigation

WarpDLM Reproducibility Guide

Dependencies and Environments

Data

Application

Simulations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages