This is a repository for the final project in PSTAT 234 at UCSB (Spring 2021)
This project should be run with R-Studio. For a smoother experience please follow the instructions:
- Download and Install R for your platform;
- Download and Install R-studio;
- Download the file with mobility data from Google drive and store it under data/ directory, so that the full path to the object is set to data/mobility_subset.zip. Please, note that you need to be logged in from the UCSB gmail account in order to access the file. Due to file size limitation on Github and due to privacy issues, we decided not to host the file in the public Github repo. The data directory is thus included in the .gitignore file and is not git synced;
- Git clone the repository to your local machine;
- Run the *.rproj file;
- Open and run the *.R script of your interest;
This project tests five different algorithms for imputation of missing values in mobility data:
- Amelia
- missMDA
- missForest
- softImpute
- MICE
The accuracy is assessed via RMSE(D) and is provided in the figure below: