Data and Code for Joshi et al paper identifying non-random patterns of serotype switching and exploring biological correlates of these patterns. The preprint for the article is here https://www.biorxiv.org/content/10.1101/811406v1 To use the repository: download the files or clone the repository, maintaining the file structure. The file "diversity and co_occurrence analysis.Rmd' contains most of the analyses. The file "microreact_data.Rmd" was used to download and format the GPSC data available on microreact.