Skip to content

Commit

Permalink
Cosmetic updates to prepare for public release
Browse files Browse the repository at this point in the history
In README:
* Fixes for typos and consistent formatting
* Added "Getting Started" Section to help with reproducibility

Initial commit for README files for "code", "data", and "figures" subdirectories
* Added to help with reproducibility

Updates to .gitignore:
* Unignore "data", and "figures" subdirectories
  • Loading branch information
Ian Buller, PhD, MA committed Jun 7, 2022
1 parent 519e978 commit 9bb32ef
Show file tree
Hide file tree
Showing 5 changed files with 79 additions and 11 deletions.
9 changes: 7 additions & 2 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -40,10 +40,15 @@ rsconnect/
*.DS_Store

# Data
/data
data/*
!data/*.md

# Figures
/figures
figures/*
!figures/*.md

# HTML
*.html

# Sandbox
sandbox.R
24 changes: 15 additions & 9 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,25 +3,25 @@ Geographic Patterns in U.S. Lung Cancer Mortality and Cigarette Smoking <img src

![license](https://img.shields.io/badge/license-apache-yellow)

**Date repository last updated**: May 17, 2022
**Date repository last updated**: June 07, 2022

### Authors

* **Alaina H. Shreves**<sup>1,2</sup> [ORCID](https://orcid.org/0000-0002-0127-4391)
* **Ian D. Buller**<sup>3,4</sup> [ORCID](https://orcid.org/0000-0001-9477-8582)
* **Alaina H. Shreves**<sup>1,2</sup> - [ORCID](https://orcid.org/0000-0002-0127-4391)
* **Ian D. Buller**<sup>3,4</sup> - [ORCID](https://orcid.org/0000-0001-9477-8582)
* **Elizabeth Chase**<sup>5,6</sup> - [ORCID](https://orcid.org/0000-0003-0452-2976)
* **Hannah Creutzfeldt**<sup>3,7</sup>
* **Jared A. Fisher**<sup>3</sup> [ORCID](https://orcid.org/0000-0001-9203-5742)
* **Barry I. Graubard**<sup>6</sup> [ORCID](https://orcid.org/0000-0002-6787-1105)
* **Jared A. Fisher**<sup>3</sup> - [ORCID](https://orcid.org/0000-0001-9203-5742)
* **Barry I. Graubard**<sup>6</sup> - [ORCID](https://orcid.org/0000-0002-6787-1105)
* **Robert N. Hoover**<sup>8</sup>
* **Debra T. Silverman**<sup>3</sup>
* **Susan S. Devesa**<sup>5</sup> - *Co-Senior Author*
* **Rena R. Jones**<sup>3</sup> - *Co-Senior Author* & *Corresponding Author* - [ORCID](https://orcid.org/0000-0003-1294-1679)

1. Department of Epidemiology, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, 02115, USA
2. Trans-Divisional Research Program, Division of Cancer Epidemiology and Genetics (DCEG), National Cancer Institute (NCI), National Institutes of Health (NIH), Rockville, MD, 20850, USA
3. Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, 20850, USA, Rockville, MD, 20850, USA
4. Cancer Prevention Fellowship Program, Division of Cancer Prevention, National Cancer Institute, Rockville, MD, 20850, USA
3. Occupational and Environmental Epidemiology Branch, DCEG, NCI, Rockville, MD, 20850, USA
4. Cancer Prevention Fellowship Program, Division of Cancer Prevention, NCI, Rockville, MD, 20850, USA
5. Infections and Immunology Branch, DCEG, NCI, NIH, Rockville, MD, 20850, USA
6. Department of Biostatistics, University of Michigan School of Public Health, University of Michigan, Rockville, MD, 20850, USA
7. Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA, 90095, USA
Expand Down Expand Up @@ -68,7 +68,7 @@ Lung cancer is the leading cause of cancer death in the United States (US) and v

### R Scripts Included In This Repository

This repository includes R-scripts use to calculate the geospatial techniques and render the figures found in the following peer-reviewed manuscript:
This repository includes R scripts use to calculate the geospatial techniques and render the figures found in the following peer-reviewed manuscript:

Shreves AH, Buller ID, Chase E, Creutzfeld H, Fisher JA, Graubard BI, Hoover RN, Silverman DT, Devesa SS, Jones RR. Geographic Patterns in U.S. Lung Cancer Mortality and Cigarette Smoking. _Cancer Epidemiology, Biomarkers & Prevention_ (Submitted)

Expand All @@ -79,7 +79,7 @@ Shreves AH, Buller ID, Chase E, Creutzfeld H, Fisher JA, Graubard BI, Hoover RN,
</colgroup>
<thead>
<tr class="header">
<th>R-Script</th>
<th>R Script</th>
<th>Description</th>
</tr>
</thead>
Expand Down Expand Up @@ -107,6 +107,12 @@ Shreves AH, Buller ID, Chase E, Creutzfeld H, Fisher JA, Graubard BI, Hoover RN,

The repository also includes the code to create the project hexsticker.

### Getting Started

* Step 1: You must download the data (see Data Availability section)
* Step 2: Save the data set to the data directory in this repository. Currently specified as a CSV file, but modify the path on Line 58 of the `preparation.R` file based on data location and file name
* Step 3: Run R scripts for figures. The `preparation.R` file will source the `functions.R` file.

### Data Availability

County-level U.S. lung cancer mortality rates and smoking prevalences are downloadable from [Model-based Small Area Estimates of Cancer-Related Measures](https://sae.cancer.gov/nhis-brfss/) from the [Surveillance Research Program](https://surveillance.cancer.gov/) within the [Division of Cancer Control and Population Sciences](https://cancercontrol.cancer.gov/) of the [National Cancer Institute](https://www.cancer.gov/) and the [National Vital Statistics System](https://www.cdc.gov/nchs/nvss/index.htm) from the [National Center for Health Statistics](https://www.cdc.gov/nchs/index.htm) of the [Centers for Disease Control and Prevention](https://www.cdc.gov/).
Expand Down
41 changes: 41 additions & 0 deletions code/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
Geographic Patterns in U.S. Lung Cancer Mortality and Cigarette Smoking <img src="../hex/hex.png" width="120" align="right" />
===================================================

### R Scripts Included In This Repository

This repository includes R scripts use to calculate the geospatial techniques and render the figures found in the following peer-reviewed manuscript:

Shreves AH, Buller ID, Chase E, Creutzfeld H, Fisher JA, Graubard BI, Hoover RN, Silverman DT, Devesa SS, Jones RR. Geographic Patterns in U.S. Lung Cancer Mortality and Cigarette Smoking. _Cancer Epidemiology, Biomarkers & Prevention_ (Submitted)

<table>
<colgroup>
<col width="20%" />
<col width="80%" />
</colgroup>
<thead>
<tr class="header">
<th>R Script</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<td><p align="center"><code>functions.R</code></td>
<td>Custom functions to calculate the local Lee's L statistic with correction for multiple testing</td>
</tr>
<td><p align="center"><code>preparation.R</code></td>
<td>Calculate the local Lee's L statistics for the four comparisons. Requires a data set to run (not included; see notes within). </td>
</tr>
<td><p align="center"><code>figure1.R</code></p></td>
<td>Generate Figure 1</td>
</tr>
<td><p align="center"><code>figure2.R</code></p></td>
<td>Generate Figure 2</td>
</tr>
<td><p align="center"><code>supplemental1.R</code></p></td>
<td>Generate Supplemental Figure 1</td>
</tr>
<td><p align="center"><code>supplemental2.R</code></p></td>
<td>Generate Supplemental Figure 2</td>
</tr>
</tbody>
<table>
10 changes: 10 additions & 0 deletions data/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
Geographic Patterns in U.S. Lung Cancer Mortality and Cigarette Smoking <img src="../hex/hex.png" width="120" align="right" />
===================================================

Place data here (data not included).

The `preparation.R` file calls a CSV file named "lung_cancer_mortality_and_smoking_prevalence.csv"

### Data Availability

County-level U.S. lung cancer mortality rates and smoking prevalences are downloadable from [Model-based Small Area Estimates of Cancer-Related Measures](https://sae.cancer.gov/nhis-brfss/) from the [Surveillance Research Program](https://surveillance.cancer.gov/) within the [Division of Cancer Control and Population Sciences](https://cancercontrol.cancer.gov/) of the [National Cancer Institute](https://www.cancer.gov/) and the [National Vital Statistics System](https://www.cdc.gov/nchs/nvss/index.htm) from the [National Center for Health Statistics](https://www.cdc.gov/nchs/index.htm) of the [Centers for Disease Control and Prevention](https://www.cdc.gov/).
6 changes: 6 additions & 0 deletions figures/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
Geographic Patterns in U.S. Lung Cancer Mortality and Cigarette Smoking <img src="../hex/hex.png" width="120" align="right" />
===================================================

Save generated figures here (figures not included). See the figures in the following peer-reviewed manuscript:

Shreves AH, Buller ID, Chase E, Creutzfeld H, Fisher JA, Graubard BI, Hoover RN, Silverman DT, Devesa SS, Jones RR. Geographic Patterns in U.S. Lung Cancer Mortality and Cigarette Smoking. _Cancer Epidemiology, Biomarkers & Prevention_ (Submitted)

0 comments on commit 9bb32ef

Please sign in to comment.