Session 5: Data Viz

# .title-wrap[Intro to Programming with R for Political Scientists]

<br />

## .header-fancy[Session 5: Data Visualization]

### Markus Freitag

### Geschwister Scholl Institute of Political Science, LMU

### [<svg viewBox="0 0 512 512" style="height:1em;position:relative;display:inline-block;top:.1em;fill:#415564;" xmlns="http://www.w3.org/2000/svg">  <path d="M459.37 151.716c.325 4.548.325 9.097.325 13.645 0 138.72-105.583 298.558-298.558 298.558-59.452 0-114.68-17.219-161.137-47.106 8.447.974 16.568 1.299 25.34 1.299 49.055 0 94.213-16.568 130.274-44.832-46.132-.975-84.792-31.188-98.112-72.772 6.498.974 12.995 1.624 19.818 1.624 9.421 0 18.843-1.3 27.614-3.573-48.081-9.747-84.143-51.98-84.143-102.985v-1.299c13.969 7.797 30.214 12.67 47.431 13.319-28.264-18.843-46.781-51.005-46.781-87.391 0-19.492 5.197-37.36 14.294-52.954 51.655 63.675 129.3 105.258 216.365 109.807-1.624-7.797-2.599-15.918-2.599-24.04 0-57.828 46.782-104.934 104.934-104.934 30.213 0 57.502 12.67 76.67 33.137 23.715-4.548 46.456-13.32 66.599-25.34-7.798 24.366-24.366 44.833-46.132 57.827 21.117-2.273 41.584-8.122 60.426-16.243-14.292 20.791-32.161 39.308-52.628 54.253z"></path></svg>](https://twitter.com/MarkusGFreitag) [<svg viewBox="0 0 496 512" style="height:1em;position:relative;display:inline-block;top:.1em;fill:#415564;" xmlns="http://www.w3.org/2000/svg">  <path d="M336.5 160C322 70.7 287.8 8 248 8s-74 62.7-88.5 152h177zM152 256c0 22.2 1.2 43.5 3.3 64h185.3c2.1-20.5 3.3-41.8 3.3-64s-1.2-43.5-3.3-64H155.3c-2.1 20.5-3.3 41.8-3.3 64zm324.7-96c-28.6-67.9-86.5-120.4-158-141.6 24.4 33.8 41.2 84.7 50 141.6h108zM177.2 18.4C105.8 39.6 47.8 92.1 19.3 160h108c8.7-56.9 25.5-107.8 49.9-141.6zM487.4 192H372.7c2.1 21 3.3 42.5 3.3 64s-1.2 43-3.3 64h114.6c5.5-20.5 8.6-41.8 8.6-64s-3.1-43.5-8.5-64zM120 256c0-21.5 1.2-43 3.3-64H8.6C3.2 212.5 0 233.8 0 256s3.2 43.5 8.6 64h114.6c-2-21-3.2-42.5-3.2-64zm39.5 96c14.5 89.3 48.7 152 88.5 152s74-62.7 88.5-152h-177zm159.3 141.6c71.4-21.2 129.4-73.7 158-141.6h-108c-8.8 56.9-25.6 107.8-50 141.6zM19.3 352c28.6 67.9 86.5 120.4 158 141.6-24.4-33.8-41.2-84.7-50-141.6h-108z"></path></svg>](https://markusfreitag.netlify.app/)

### 19.07.2021

<a href="https://github.com/m-freitag" class="github-corner" aria-label="View
source on Github"><svg width="80" height="80" viewBox="0 0 250 250"
style="fill:#415564; color:#f6f3f2; position: absolute; top: 0; border: 0;
right: 0;" aria-hidden="true"><path d="M0,0 L115,115 L130,115 L142,142 L250,250
L250,0 Z"></path><path d="M128.3,109.0 C113.8,99.7 119.0,89.6 119.0,89.6
C122.0,82.7 120.5,78.6 120.5,78.6 C119.2,72.0 123.4,76.3 123.4,76.3 C127.3,80.9
125.5,87.3 125.5,87.3 C122.9,97.6 130.6,101.9 134.4,103.2" fill="currentColor"
style="transform-origin: 130px 106px;" class="octo-arm"></path><path
d="M115.0,115.0 C114.9,115.1 118.7,116.5 119.8,115.4 L133.7,101.6 C136.9,99.2
139.9,98.4 142.2,98.6 C133.8,88.0 127.5,74.4 143.8,58.0 C148.5,53.4 154.0,51.2
159.7,51.0 C160.3,49.4 163.2,43.6 171.4,40.1 C171.4,40.1 176.1,42.5 178.8,56.2
C183.1,58.6 187.2,61.8 190.9,65.4 C194.5,69.0 197.7,73.2 200.1,77.6 C213.8,80.2
216.3,84.9 216.3,84.9 C212.7,93.1 206.9,96.0 205.4,96.6 C205.1,102.4
203.0,107.8 198.3,112.5 C181.9,128.9 168.3,122.5 157.7,114.1 C157.9,116.9
156.7,120.9 152.7,124.9 L141.0,136.5 C139.8,137.7 141.6,141.9 141.8,141.8 Z"
fill="currentColor"
class="octo-body"></path></svg></a><style>.github-corner:hover
.octo-arm{animation:octocat-wave 560ms ease-in-out}@keyframes
octocat-wave{0%,100%{transform:rotate(0)}20%,60%{transform:rotate(-25deg)}40%,80%{transform:rotate(10deg)}}@media
(max-width:500px){.github-corner:hover .octo-arm{animation:none}.github-corner
.octo-arm{animation:octocat-wave 560ms ease-in-out}}</style>

---

# Overview

1. Intro + R-Studio and (Git)Hub

2. Base R & Tidyverse Basics

3. Data Wrangling I

4. Data Wrangling II

5. .hl[Data Viz]

6. Writing Functions

---

# Workflow

- Navigate to `Session Scripts` and open `Session_5_script.R`.

- You will see a pre-formatted Script with all the steps I do on the slides.

- Explore as you follow.

- If you have a second monitor, great! If not, split your screen.

---

# Why is data viz important?

---

# Why is data viz important?

- We have always tried to represent information in an (abstract) visual form...

[The Babylonian Map of the World.](https://en.wikipedia.org/wiki/Babylonian_Map_of_the_World) Oldest known world map.

---

# Why is data viz important?

- Useful for...
    - exploring data structure/cleaning data (e.g., spotting missing or outliers)
    - identifying trends, clusters, descriptive patterns
    - descriptive as well as for statistical and causal inference

.hl2[BUT:] Visual Inferences can also misguide in several ways (for one perspective on this, 
see, e.g., [here](https://www.richardtraunmueller.com/wp-content/uploads/2019/01/Traunmueller-Visual-Inference-CCCP.pdf)).

- More generally, graphical abstractions can aid in other aspects when working with data 
(e.g., visual representations of database structures or DAGs as a tool to see conditions for causal identification more easily).

---

# Why is data viz important?

<img src="data:image/png;base64,#Figs/traunmuelller.png" width="813" style="display: block; margin: auto;" />
Source: [Traunmueller, 2018](https://www.richardtraunmueller.com/wp-content/uploads/2019/01/Traunmueller-Visual-Inference-CCCP.pdf).

---

# Useful for Description...

- We observe (see) some data and have various ways to transport summarising information
to our eyes, e.g.:

---

# Useful for Statistical Inference...

.font80[
- Inferring the unknown (some population quantity/parameter, e.g. `$E[Y|X]$`) from the known (the data at hand).

- Does what we see in the data accurately represent what we would see in the population we are interested in? How certain are we?

- Visualization is important for expressing our uncertainty about parameters...
]

---

# Useful for Statistical Inferences...

Not optimal ([NJEM 2020, Comparing Covid and Influenza Lungs](https://www.nejm.org/doi/full/10.1056/nejmoa2015432)):

]

A little [better](https://www.estimationstats.com/#/background) (not the same data, just a sim. example):
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/unnamed-chunk-7-1.png" width="80%" style="display: block; margin: auto;" />

]

---

# Visualizations are also useful for Causal Inference...

- We can represent our theoretical knowledge about (the absence) of causal 
relationships between variables with graphical models:

---
  
# Visualizations are also useful for Causal Inference...

Or for sensitivity analysis:

---

# But beware...

- If you conduct exploratory data analysis and plot relationships, e.g., in a scatter plot:

- What you see in the data does not tell you anything about causation. You need theory and assumptions. 
  
- `$E[Y|X]$` `$\neq$` `$E[Y|do(X=x')]$` (correlation is not causation!)

---

# But beware...

- Conditioning on the collider „Star“ (filtering our data and looking at subsamples of the „star“ variable)
would lead us to wrongly infer a negative relationship between talent and beauty, even if there is, in fact,
no relationship.

---

# .font70[Still, theory + data + viz can put you on the right tracks...]

John Snow and Cholera in London:

- In the 1840s and 50s, most scientists thought Cholera was transmitted via air.

- John Snow did not and forensically identified contaminated water as the cause of infection.

- A water company moved their production up the Thames during the pandemic, resulting in a [natural 
experiment (diff-in-diff)](https://www.semanticscholar.org/paper/Causality-in-the-Time-of-Cholera%3A-John-Snow-As-a-Coleman/eec8864302732a862e614789767bb0edc068a2fe).

- He also visualized that Cholera deaths were concentrated around a contaminated pump (from another company).

---

# .font70[Still, theory + data + viz can put you on the right tracks...]

Some people still did not believe him. E.g. [Max Pettenkofer](https://de.wikipedia.org/wiki/Max_von_Pettenkofer) in Munich...

---

# Some Principles of Data Viz

- Raw data first. Erase everything unnecessary.

The principle of proportional ink (PPI):

> The representation of numbers, as physically measured on the surface of the 
graphic itself, should be directly proportional to the numerical quantities represented. ([Tufte, 1983](https://www.edwardtufte.com/tufte/books_vdqi))

- Use color (incl. brightness, hue, saturation, etc.), but use it wisely.

- Most tables should be figures (except for the case when there is little data)

---

# Some Principles of Data Viz

---

# Gestalt Principles

- [Gestalt psychology](https://en.wikipedia.org/wiki/Gestalt_psychology) is a somewhat antiquated school of perceptual theory

- However, some of the "laws"/cognitive principles are pretty uncontroversial for [figure](https://www.researchgate.net/profile/Susan-Vanderplas/publication/338386060_Testing_Statistical_Charts_What_Makes_a_Good_Graph/links/600597e645851553a0522eef/Testing-Statistical-Charts-What-Makes-a-Good-Graph.pdf) 
or web design:

- Proximity: Objects or shapes that are spatially close appear to be related.
    - Similarity: Humans group things that look alike (e.g., use of color in figures). 
    - Change Blindness: We have a hard time comparing multiple visually similar plots or 
    facets of plots.
    - Common region: Things in a common region are related, easy to grasp (e.g., confidence intervals)

---

# Some No-Noes

Pie charts. Almost always bad (except for maybe dist. of seats in a parliament with <= 4 parties).

Source: [https://www.dph.illinois.gov/covid19/location-exposure?regionID=0](https://www.dph.illinois.gov/covid19/location-exposure?regionID=0).

---
  
# Some No-Noes

3D when there is no third dimension. Also, avoid 3D charts in general and break them down into multiple 2D charts.

---
  
# Some No-Noes

Source: [https://xkcd.com/1138/](https://xkcd.com/1138/).

---

# From Data to the Eye

[Grammar of Graphics](https://www.springer.com/de/book/9780387245447):

- Data can come in many types: num. continuous, num. discrete, categorical (ordered/unordered),
time/date, text, spatial.

- To make a Figure, we need to map the data onto components of 
graphical elements: onto .hl2[aesthetics] that form geometric objects.

> Some aesthetics are: coordinate positions, shapes, colors, line widths, etc.

We need to program which cells of our data correspond to which aesthetics values. I.e.,
specify a .hl[scale] that connects those two "layers".

---

# Which Plot to Choose?

- This depends, ofc., on your data (make sure you know it well) and the question you want to answer.

- We will not go through types of charts systematically.

- Take a look at [this](https://clauswilke.com/dataviz/index.html) book and this nice [decision tree](https://www.data-to-viz.com/)
with code examples!

.font70[.hl[Some Tipps:] [Cleveland dot plots](https://edav.info/cleveland.html) are an often useful alternative to bar charts. 
Line graphs need an ordered variable on the x-axis. Scatter plots need cont. y 
and x and can be hard to read with big data. For a single numeric or numeric x categorical variables,
I like [raincloud plots](https://github.com/RainCloudPlots/RainCloudPlots).]

Workflow:

> Theory/understanding of the data &rightarrow; think about appropriate 
aesthetics, and types of geometric objects (i.e., plot type) &rightarrow; sketch it by hand! &rightarrow; compute it &rightarrow; details last!!

---

# Data Viz with ggplot

---
  
# ggplot's Grammar

- `ggplot` [builds on](https://vita.had.co.nz/papers/layered-grammar.pdf) the grammar of graphics.

- `ggplot(data = df, mapping = aes(x = x, y = y, color = group))` is the core function where
we specify the aesthetic mapping.

- We then add layers with `+`.

]

]

---
  
# A Plot from Scratch

We will use the UN voting data again, but this time the ideal point estimates of 
[Bailey et al. 2017](https://d1wqtxts1xzle7.cloudfront.net/53137356/jcrpiece.pdf?1494871088=&response-content-disposition=inline%3B+filename%3DEstimating_Dynamic_State_Preferences_fro.pdf&Expires=1625062433&Signature=LCQ0Ad4kSlgDCF6HRlj0oElIcpXyEBtQRg9v7Fsm6wAPHbMTITurc35WYkVvVHunXLCd3uIafxYqPmt8RmsYumGg-zVrAwuMqDsaiO5YaVksmiu0NOabR7seJv92NFA2qTcVEtoy7X4zUmOOvJhTe-DjfgQ8PFNz-MEaVacIMPNAAMl0iRVXVJJR0B4Ks-r0zr75UlYXYhQUvvEnku3xXkhjUtcvWUmZBUEWiRnfgidZaKXXytHmnbNx4MBK-HRKw-Rj7hsis8j4ahIIxSF-YcFzH6CPrdtvIlS5c-Pt6HP3zu-9lM0dC7KqV5A6T6OHxHNWr3voCnoch8pYdOxdZQ__&Key-Pair-Id=APKAJLOHF5GGSLRBV4ZA).

- Based on an IRT, they compute session-year ideal points that can also be used to compute
  dyadic similarity measures.

- We will replicate and extend one of their time series figures to the post 2010 period.

Preprocessing:
.code60[

```r
unvotes <- import("data/UNVotes2.parquet")
sessions <- unvotes %>%
  select(session, date) %>%
  group_by(session) %>%
  summarise(year = year(min(date)))
ideal <- import("data/ideal.csv")
ideal <- left_join(ideal, sessions, by = "session")
ideal <- filter(ideal, iso3c %in% c("USA", "GBR", "FRA", "CHN", "RUS", "DEU"))
```
]

---
  
# A Plot from Scratch

```r
*ggplot(data = ideal,
*      mapping = aes(x = year,
*                    y = IdealPointAll,
*                    color = iso3c))
```
Other aesthetics are, e.g.: size, shape, fill, alpha.

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer1-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

```r
ggplot(data = ideal,
       mapping = aes(x = year,
                     y = IdealPointAll,
                     color = iso3c)) +
* geom_point()
```

We prefer to show every single data point. Since the data 
is yearly this works ok with points. But we want lines too...

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer2-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

```r
ggplot(data = ideal,
       mapping = aes(x = year,
                     y = IdealPointAll,
                     color = iso3c)) +
  geom_point() +
* geom_line()
```
]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer3-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

```r
ggplot(data = ideal,
       mapping = aes(x = year,
                     y = IdealPointAll,
                     color = iso3c)) +
* geom_point(size = 2) +
* geom_line(size = 1)
```

If we dont want the size to map to our data, we can omit `aes()`and set it to some value.

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer4-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

```r
ggplot(data = ideal,
       mapping = aes(x = year,
                     y = IdealPointAll,
                     color = iso3c)) +
  geom_point(size = 2) + 
  geom_line(size = 1) +
* scale_colour_brewer(palette = "Dark2")
```

We can add a decent color-blind friendly
scale from [ColorBrewer](https://colorbrewer2.org/#type=sequential&scheme=BuGn&n=3). 
We need a discrete color scale.

.font70[.hl[Tipp:] MORE COLORS!?! Take a look [here](https://github.com/EmilHvitfeldt/r-color-palettes).]
]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer5-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

```r
ggplot(data = ideal,
       mapping = aes(x = year,
                     y = IdealPointAll,
                     color = iso3c)) +
  geom_point(size = 2) + 
  geom_line(size = 1) +
  scale_colour_brewer(palette = "Dark2") +
* scale_x_continuous(breaks = seq(1950, 2020, 10))
```
As our date variable is simply of type integer, we use `scale_*_continuous` instead
of `scale_*_date`.

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer6-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

But we don't want this in our case.

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer7-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer8-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

.font70[.hl[Tipp:] MORE THEMES!?! Take a look [here](https://github.com/hrbrmstr/hrbrthemes), 
[here](https://rstudio.github.io/thematic/index.html), [here](https://github.com/vankesteren/firatheme),
[here](https://jrnold.github.io/ggthemes/index.html) or [here](https://github.com/bbc/bbplot/).]

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer9-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer10-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

```r
ggplot(data = ideal,
       mapping = aes(x = year,
                     y = IdealPointAll,
                     color = iso3c)) +
  geom_point(size = 2) +
  geom_line(size = 1) +
  scale_colour_brewer(palette = "Dark2") +
  scale_x_continuous(breaks = seq(1950, 2020, 10)) +
  labs(x = "\nYear", 
       y = "Ideal Point",
       color = "Country", 
       title = "State foreign policy ideal points from 1946 to 2020",
       subtitle = "Estimates based on votes in the UN General Assembly (Bailey et al. 2017)",
       caption = "Higher values indicate more 'Western' ideal points.") +
  hrbrthemes::theme_ipsum() + 
  theme(text = element_text(colour = "#415564", family = "IBM Plex Sans"), 
        plot.title = element_text(colour = "#415564", family = "IBM Plex Sans"), 
        plot.subtitle = element_text(colour = "#415564", family = "IBM Plex Sans"), 
        plot.background = element_rect(fill = "#f6f3f2", color = "#f6f3f2"), 
        panel.border = element_blank(), 
        axis.text = element_text(colour = "#415564"), 
        axis.title = element_text(colour = "#415564")) +
* annotate(geom = "curve", xend = 1990, yend = 0.7, x = 1983, y = 0.5,
*          curvature = -.3, arrow = arrow(length = unit(2, "mm")), color = "#415564") +
* annotate(geom = "text", x = 1971, y = 0.37, label = "End of Cold War", hjust = "left", color = "#415564") +
* annotate(geom = "curve", xend = 1994, yend = -1.7, x = 2003, y = -2.1,
*          curvature = -.4, arrow = arrow(length = unit(2, "mm")), color = "#415564") +
* annotate(geom = "text", x = 2004, y = -2.1, label = "Post-Tianmen \nSquare", hjust = "left", color = "#415564")
```
]

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer11-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

```r
*ideal <- ideal %>%
* mutate(iso3c = fct_relevel(iso3c, c("USA", "GBR", "FRA", "DEU", "RUS", "CHN")))

ggplot(data = ideal,
       mapping = aes(x = year,
                     y = IdealPointAll,
                     color = iso3c,
*                    shape = iso3c)) +
  geom_point(size = 2) +
  geom_line(size = 1) +
  scale_colour_brewer(palette = "Dark2") +
  scale_x_continuous(breaks = seq(1950, 2020, 10)) +
  labs(x = "\nYear", 
       y = "Ideal Point",
       color = "Country",
*      shape = "Country",
       title = "State foreign policy ideal points from 1946 to 2020",
       subtitle = "Estimates based on votes in the UN General Assembly (Bailey et al. 2017)",
       caption = "Higher values indicate more 'Western' ideal points.") +
  hrbrthemes::theme_ipsum() + 
  theme(text = element_text(colour = "#415564", family = "IBM Plex Sans"), 
        plot.title = element_text(colour = "#415564", family = "IBM Plex Sans"), 
        plot.subtitle = element_text(colour = "#415564", family = "IBM Plex Sans"), 
        plot.background = element_rect(fill = "#f6f3f2", color = "#f6f3f2"), 
        panel.border = element_blank(), 
        axis.text = element_text(colour = "#415564"), 
        axis.title = element_text(colour = "#415564")) +
  annotate(geom = "curve", xend = 1990, yend = 0.7, x = 1983, y = 0.5, 
           curvature = -.3, arrow = arrow(length = unit(2, "mm")), color = "#415564") + 
  annotate(geom = "text", x = 1971, y = 0.37, label = "End of Cold War", hjust = "left", color = "#415564") + 
  annotate(geom = "curve", xend = 1994, yend = -1.7, x = 2003, y = -2.1, 
           curvature = -.4, arrow = arrow(length = unit(2, "mm")), color = "#415564") + 
  annotate(geom = "text", x = 2004, y = -2.1, label = "Post-Tianmen \nSquare", hjust = "left", color = "#415564")
```
]

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer12-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# A Plot from Scratch

```r
*ideal_fin <- filter(ideal, year == 2020)

ggplot(data = ideal,
       mapping = aes(x = year,
                     y = IdealPointAll,
                     color = iso3c)) + 
  geom_line(size = 1) +
  scale_colour_brewer(palette = "Dark2") +
  scale_x_continuous(breaks = seq(1950, 2020, 10)) +
  labs(x = "\nYear", 
       y = "Ideal Point",
       color = "Country",
       title = "State foreign policy ideal points from 1946 to 2020",
       subtitle = "Estimates based on votes in the UN General Assembly (Bailey et al. 2017)",
       caption = "Higher values indicate more 'Western' ideal points.") +
  hrbrthemes::theme_ipsum() + 
  theme(text = element_text(colour = "#415564", family = "IBM Plex Sans"), 
        plot.title = element_text(colour = "#415564", family = "IBM Plex Sans"), 
        plot.subtitle = element_text(colour = "#415564", family = "IBM Plex Sans"), 
        plot.background = element_rect(fill = "#f6f3f2", color = "#f6f3f2"), 
        panel.border = element_blank(), 
        axis.text = element_text(colour = "#415564"), 
        axis.title = element_text(colour = "#415564"),
*       legend.position = "none",
*       axis.text.y.right = element_text(margin = margin(0, 0, 0, -20))) +
  annotate(geom = "curve", xend = 1990, yend = 0.7, x = 1983, y = 0.5, 
           curvature = -.3, arrow = arrow(length = unit(2, "mm")), color = "#415564") + 
  annotate(geom = "text", x = 1971, y = 0.37, label = "End of Cold War", hjust = "left", color = "#415564") + 
  annotate(geom = "curve", xend = 1994, yend = -1.7, x = 2003, y = -2.1, 
           curvature = -.4, arrow = arrow(length = unit(2, "mm")), color = "#415564") + 
  annotate(geom = "text", x = 2004, y = -2.1, label = "Post-Tianmen \nSquare", hjust = "left", color = "#415564") +
* scale_y_continuous(sec.axis = dup_axis(breaks = ideal_fin$IdealPointAll, labels = c("USA", "GBR", "FRA", "DEU", "RUS", "CHN"), name = NULL))
```
]

]

.pull-right[
<img src="data:image/png;base64,#05_Data_Viz_files/figure-html/layer13-out-1.png" width="504" style="display: block; margin: auto;" />
]

---

# Setting the theme globally

Adding a theme to every plot can be cumbersome.

Luckily, we can use `theme_set()` to set it globally.

Changing default fonts is also a bit fiddly (esp. on windows).

- You can use the `extrafont` package and see, e.g., [here](https://cran.r-project.org/web/packages/extrafont/README.html).

- Alternatively, you can use the package `showtext` to load fonts directly from Google fonts!

```r
theme_set(plex)
```

---

# Saving your work

The last step is now to save your plot.

We can do this with `ggsave`. Exports the last graphic you plotted or the plot object u specify.

```r
ggsave("ideal_points.png", width = 9, height = 7)
```

If we work with `.rmd` documents, we can just control the output using 
chunk options (but more on this later).

---

# Other Geoms

You can find all geoms here: [https://ggplot2.tidyverse.org/reference/#section-geoms](https://ggplot2.tidyverse.org/reference/#section-geoms).

---