Topic Modeling using BERTopic with Llama Integration on Crash Narratives of SUV involved Bicycle Crashes
This repository addresses the critical concern of safety in transportation, particularly focusing on the vulnerability of road users such as cyclists, pedestrians, and motorcyclists. The aim is to highlight the underreporting of injuries, especially those involving cyclists, in official road crash statistics. The significance of considering all road users in policy decisions is emphasized throughout. Below is a plot showing the latent topics within the crash narratives recieved from the Ohio Department of Transportation
All of these are installed within the notebook, but are included below for local environments
- llama-cpp-python
- bertopic
- datamapplot
- cudf-cu12
- dask-cudf-cu12
- cuml-cu12
- cugraph-cu12
- cupy-cuda12x
- wget
- huggingface.co/TheBloke/OpenHermes-2.5-Mistral-7B-GGUF
- octis
- genism
- datasets