Legionnaire’s Disease in the USA
Setting up R Packages
Plot Theme
Show the Code
# https://stackoverflow.com/questions/74491138/ggplot-custom-fonts-not-working-in-quarto
# Chunk options
knitr::opts_chunk$set(
fig.width = 7,
fig.asp = 0.618, # Golden Ratio
# out.width = "80%",
fig.align = "center"
)
### Ggplot Theme
### https://rpubs.com/mclaire19/ggplot2-custom-themes
theme_custom <- function() {
font <- "Roboto Condensed" # assign font family up front
theme_classic(base_size = 14) %+replace% # replace elements we want to change
theme(
panel.grid.minor = element_blank(), # strip minor gridlines
text = element_text(family = font),
# text elements
plot.title = element_text( # title
family = font, # set font family
size = 20, # set font size
face = "bold", # bold typeface
hjust = 0, # left align
# vjust = 2 #raise slightly
margin = margin(0, 0, 10, 0)
),
plot.subtitle = element_text( # subtitle
family = font, # font family
size = 14, # font size
hjust = 0,
margin = margin(2, 0, 5, 0)
),
plot.caption = element_text( # caption
family = font, # font family
size = 8, # font size
hjust = 1
), # right align
axis.title = element_text( # axis titles
family = font, # font family
size = 10 # font size
),
axis.text = element_text( # axis text
family = font, # axis family
size = 8
) # font size
)
}
# Set graph theme
theme_set(new = theme_custom())
#
Introduction
Legionnaires’ disease (LD) is a severe form of pneumonia (∼10–25% fatality rate) caused by inhalation of aerosols containing Legionella
, a pathogenic gram-negative bacteria. These bacteria can grow, spread, and aerosolize through building water systems. A recent dramatic increase in LD incidence has been observed globally, with a 9-fold increase in the United States from 2000 to 2018,
Records were also maintained of atmospheric Sulphur Dioxide (SO2) and the acidity i.e. pH of the atmosphere around building water systems such as Cooling Towers (CT) and in Rainwater.
This data is from this paper: Yu F, Nair AA, Lauper U (2024), https://doi.org/10.6084/m9.figshare.25157852.v2
Read the Modified Data
Inspect the Data
```{r}
#| label: inspect-skim-glimpse
# Write in
```
Data Dictionary
Write in.
Write in.
Write in.
Describe how you may plan to transform the data.
Research Question
Write in! Look first at the Charts below!
Join the Data
```{r}
#| label: data-preprocessing
#
# Write in your code here
# to prepare this data as shown below
# to generate the plot that follows
```
Here is the plot-ready data:
Plot the Data
Two plots were generated by the researchers with this data. Can you reproduce these? Do these graphs prove/disprove any of your hypotheses? What might have been the Hypotheses that led the creating of these graphs?
Tasks and Discussion
- Complete the Data Dictionary.
- Select and Transform the variables as shown. Combine the multiple datasets into one if needed!
- Create the graphs shown and discuss the following questions:
- Identify the type of charts
- Identify the variables used for various geometrical aspects (x, y, fill…). Name the variables appropriately.
- What is a peculiar feature of these graphs?
- What might have been the Hypothesis/Research Question to which the response was Chart?
- What data gathering / research activity might have been carried out to obtain the data graphed here? Provide some details.
- Write a short story based on the chart, describing your inference/surprise.
- Is there a paradox in this case study? Hint: SO2 is caused by cars/busses running on fossil fuels.
- What Statistical Tests might you run to confirm what the charts are saying?