accessibility

# accessibility
### 2021-11-12

---

# Welcome

---

## Announcements

- Nov 17: Guest lecture/live coding session with Ijeamaka Anyene
  - Topic: Computational art
  - To prepare: Watch the video [aRt + ggplot: exploring radial visualizations](https://www.youtube.com/watch?v=h0i6KAahLY8)
  - Add a question to sli.do by 11:30am on Nov 17: [sli.do](https://slido.com/) / #921159

--
- Dec 1: AMA with Thomas Lin Pedersen
  - Topic: Anything ggplot2 or data visualization
  - To prepare: Just keep doing what you're doing!
  - Add a question to sli.do by 11:30am on Dec 1: [sli.do](https://slido.com/) / #632366 (will be open Nov 25)

--
- Project proposals due 5pm today
  - Make sure to review [grading guidelines](https://www.vizdata.org/project-2.html#proposal-10-points-1)
  - Each person in team must commit at least once

--
- Office hours for the week of Thanksgiving -- move to earlier in the week or to following week?

---

## Setup

```r
# load packages
library(tidyverse)
library(scales)
library(colorblindr)
library(coloratio)

# set default theme for ggplot2
ggplot2::theme_set(ggplot2::theme_minimal(base_size = 16))

# set default figure parameters for knitr
knitr::opts_chunk$set(
  fig.width = 8, fig.asp = 0.618, fig.retina = 3, dpi = 300, out.width = "60%"
)

# dplyr print min and max
options(dplyr.print_max = 6, dplyr.print_min = 6)
```
]

---

do you remember?

---

## Flatten the curve

.small[
Source: The New York Times. [Flattening the Coronavirus Curve](https://www.nytimes.com/article/flatten-curve-coronavirus.html)
]

---

## Exponential spread

.small[
Source: The Washington Post. [Why outbreaks like coronavirus
spread exponentially, and how to "flatten the curve"](https://www.washingtonpost.com/graphics/2020/world/corona-simulator/)
]

---

## JHU  COVID-19 Dashboard

.small[
Source: [COVID-19 Dashboard
by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU)](https://coronavirus.jhu.edu/map.html)
]

---

what do they all have in common?

---

## Accessible COVID-19 statistics tracker

---

# Accessibility and screen readers

---

## Alternative text

> It is read by screen readers in place of images allowing the content and function of the image to be accessible to those with visual or certain cognitive disabilities.
>
> It is displayed in place of the image in browsers if the image file is not loaded or when the user has chosen not to view images.
>
>It provides a semantic meaning and description to images which can be read by search engines or be used to later determine the content of the image from page context alone.

---

## Alt and surrounding text

```
"CHART TYPE of TYPE OF DATA where REASON FOR INCLUDING CHART`

+ Link to data source somewhere in the text
```

--
- `CHART TYPE`: It's helpful for people with partial sight to know what chart type it is and gives context for understanding the rest of the visual.

--
- `TYPE OF DATA`: What data is included in the chart? The x and y axis labels may help you figure this out.

--
- `REASON FOR INCLUDING CHART`: Think about why you're including this visual. What does it show that’s meaningful. There should be a point to every visual and you should tell people what to look for.

--
- `Link to data source`: Don't include this in your alt text, but it should be included somewhere in the surrounding text.

.footnote[
Source: [Writing Alt Text for Data Visualization](https://medium.com/nightingale/writing-alt-text-for-data-visualization-2a218ef43f81)
]

---

## Data

- Registered nurses by state and year
- Number of nurses, salaries, employment
- Source: [TidyTuesday](https://github.com/rfordatascience/tidytuesday/tree/master/data/2021/2021-10-05)

```r
nurses <- read_csv("data/nurses.csv") %>% janitor::clean_names()

names(nurses)
```

```
##  [1] "state"                                       
##  [2] "year"                                        
##  [3] "total_employed_rn"                           
##  [4] "employed_standard_error_percent"             
##  [5] "hourly_wage_avg"                             
##  [6] "hourly_wage_median"                          
##  [7] "annual_salary_avg"                           
##  [8] "annual_salary_median"                        
##  [9] "wage_salary_standard_error_percent"          
## [10] "hourly_10th_percentile"                      
## [11] "hourly_25th_percentile"                      
## [12] "hourly_75th_percentile"                      
## [13] "hourly_90th_percentile"                      
## [14] "annual_10th_percentile"                      
## [15] "annual_25th_percentile"                      
## [16] "annual_75th_percentile"                      
## [17] "annual_90th_percentile"                      
## [18] "location_quotient"                           
## [19] "total_employed_national_aggregate"           
## [20] "total_employed_healthcare_national_aggregate"
## [21] "total_employed_healthcare_state_aggregate"   
## [22] "yearly_total_employed_state_aggregate"
```
]

---

## Bar chart

.pull-left[
<img src="21-accessibility_files/figure-html/unnamed-chunk-5-1.png" width="100%" />
]
.pull-right[
- Provide the title and axis labels
- Briefly describe the chart and give a summary of any trends it displays
- Convert bar charts to accessible tables or lists
- Avoid describing visual attributes of the bars (e.g., dark blue, gray, yellow) unless there's an explicit need to do so 
]

---

## Developing the alt text

- Total employed registered nurses in three states over time.

--
- Total employed registered nurses in California, New York, and North Carolina, in 2000, 2010, and 2020.

--
- A bar chart of total employed registered nurses in California, New York, and North Carolina, in 2000, 2010, and 2020, showing increasing numbers of nurses over time.

--
- The figure is a bar chart titled "Total employed Registered Nurses" that displays the numbers of registered nurses in three states (California, New York, and North Carolina) over a 20 year period, with data recorded in three time points (2000, 2010, and 2020). In each state, the numbers of registered nurses increase over time. The following numbers are all approximate. California started off with 200K registered nurses in 2000, 240K in 2010, and 300K in 2020.  New York had 150K in 2000, 160K in 2010, and 170K in 2020. Finally North Carolina had 60K in 2000, 90K in 2010, and 100K in 2020.

---

## Line graph

.pull-left[
<img src="21-accessibility_files/figure-html/unnamed-chunk-6-1.png" width="100%" />
]
.pull-right[
- Provide the title and axis labels
- Briefly describe the graph and give a summary of any trends it displays
- Convert data represented in lines to accessible tables or lists where feasible
- Avoid describing visual attributes of the bars (e.g., purple, pink) unless there's an explicit need to do so 
]

---

## Scatterplot

.pull-left[
<img src="21-accessibility_files/figure-html/unnamed-chunk-7-1.png" width="100%" />
]
.pull-right[
Scatter plots are among the more difficult graphs to describe, especially if there is a need to make specific data point accessible.

- Identify the image as a scatterplot
- Provide the title and axis labels
- Focus on the overall trend
- If it's necessary to be more specific, convert the data into an accessible table
]

---

## Recommended reading

[Accessible Visualization via Natural Language Descriptions: A Four-Level Model of Semantic Content](http://vis.csail.mit.edu/pubs/vis-text-model/)

Alan Lundgard, MIT CSAIL  
Arvind Satyanarayan, MIT CSAIL

IEEE Transactions on Visualization & Computer Graphics (Proceedings of IEEE VIS), 2021

>To demonstrate how our model can be applied to evaluate the effectiveness of visualization descriptions, we conduct a mixed-methods evaluation with 30 blind and 90 sighted readers, and find that these reader groups differ significantly on which semantic content they rank as most useful. Together, our model and findings suggest that access to meaningful information is strongly reader-specific, and that research in automatic visualization captioning should orient toward descriptions that more richly communicate overall trends and statistics, sensitive to reader preferences.

---

# Accessibility and colors

---

## Color scales

Use colorblind friendly color scales (e.g., Okabe Ito, viridis)

.panelset.sideways[
.panel[.panel-name[Code]

```r
nurses_subset %>%
  ggplot(aes(x = year, y = hourly_wage_median, color = state)) +
  geom_point(size = 2) +
* colorblindr::scale_color_OkabeIto() +
  scale_y_continuous(labels = label_dollar()) +
  labs(
    x = "Year", y = "Median hourly wage", color = "State",
    title = "Median hourly wage of Registered Nurses"
  ) +
  theme(
    legend.position = c(0.15, 0.75),
    legend.background = element_rect(fill = "white", color = "white")
    )
```

]

]
]

---

## The default ggplot2 color scale + deuteranopia

Deuteranopia: A type of red-green confusion

<img src="21-accessibility_files/figure-html/default-ggplot2-1.png" width="100%" />
]
.pull-right[
**Default ggplot2 scale with deuteranopia**

<img src="images/deuteranopia.png" width="100%" />
]

---

## The default ggplot2 color scale + tritanopia

Tritanopia: A type of yellow-blue confusion

<img src="21-accessibility_files/figure-html/unnamed-chunk-10-1.png" width="100%" />
]
.pull-right[
**Default ggplot2 scale with tritanopia**

<img src="images/tritanopia.png" width="100%" />
]

---

## Testing for colorblind friendliness

- Best way to test is with users (or collaborators) who have these color deficiencies

- Simulation software also helps, e.g. Sim Daltonism for [Mac](https://michelf.ca/projects/sim-daltonism/) and [PC](https://pcmacstore.com/en/app/693112260/sim-daltonism)

---

## Color contrast

- Background and foreground text should have sufficient contrast to be distinguishable by users with different vision

- Web app for checking color contrast checking: [Colour Contrast Analyser](https://www.tpgi.com/color-contrast-checker/)

- An WIP R package for checking for color contrast: [**coloratio**](https://matt-dray.github.io/coloratio)

```r
cr_get_ratio("black", "white")
```

```
## [1] 21
```

```r
cr_get_ratio("#FFFFFF", "#000000")
```

```
## [1] 21
```

```r
cr_get_ratio("black", "gray10")
```

```
## Warning in cr_get_ratio("black", "gray10"): Aim for a value of 4.5 or higher.
```

```
## [1] 1.206596
```

---

## Double encoding

Use shape *and* color where possible

<img src="21-accessibility_files/figure-html/unnamed-chunk-13-1.png" width="100%" />
]
.pull-right[
**Default ggplot2 scale with deuteranopia**

<img src="images/deuteranopia-shape.png" width="100%" />
]

---

## Use direct labeling

- Prefer direct labeling where color is used to display information over a legend

- Quicker to read

- Ensures graph can be understood without reliance on color

---

## Without direct labeling

<img src="21-accessibility_files/figure-html/unnamed-chunk-15-1.png" width="100%" />
]
.pull-right[
**Default ggplot2 scale with deuteranopia**

<img src="images/deuteranopia-no-direct-labeling.png" width="100%" />
]

---

## With direct labeling

<img src="21-accessibility_files/figure-html/unnamed-chunk-17-1.png" width="100%" />
]
.pull-right[
**Default ggplot2 scale with deuteranopia**

<img src="images/deuteranopia-direct-labeling.png" width="100%" />
]

---

## Use whitespace or pattern to separate elements

- Separate elements with whitespace or pattern

- Allows for distinguishing between data without entirely relying on contrast between colors

---

## Without whitespace

<img src="21-accessibility_files/figure-html/unnamed-chunk-19-1.png" width="100%" />
]
.pull-right[
**Default ggplot2 scale with tritanopia**

<img src="images/tritanopia-no-separate.png" width="100%" />
]

---

## With whitespace

<img src="21-accessibility_files/figure-html/unnamed-chunk-21-1.png" width="100%" />
]
.pull-right[
**Default ggplot2 scale with tritanopia**

<img src="images/tritanopia-with-separate.png" width="100%" />
]

---

.task[
Find a colorblind friendly color palette other than viridis and those in the colorblindr package. This could be an R package that offers a colorblind friendly color palette or it could be just a palette you find online. Apply it to the scatterplot you previously wrote alt text for.
]

---

# Accessibility and fonts

---

## Accessibility and fonts

- Use a font that has been tested for accessibility (e.g., [Atkinson Hyperlegible](https://brailleinstitute.org/freefont))

--
- Keep plot labels and annotations similarly sized as the rest of your text (e.g., `ggplot2::theme_set(ggplot2::theme_minimal(base_size = 16))`)

---

.footnote[
Source: [A Comprehensive Guide to Accessible Data Visualization](https://www.betterment.com/resources/accessible-data-visualization/)
]

---

## Acknowledgements

- COVID visualization examples:
  - The New York Times. [Flattening the Coronavirus Curve](https://www.nytimes.com/article/flatten-curve-coronavirus.html)
  - The Washington Post. [Why outbreaks like coronavirus
spread exponentially, and how to "flatten the curve"](https://www.washingtonpost.com/graphics/2020/world/corona-simulator/)
  - [COVID-19 Dashboard
by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU)](https://coronavirus.jhu.edu/map.html)
  - T. Littlefield (2020) [COVID-19 Statistics Tracker](https://cvstats.net)

- Lundgard, Alan, and Arvind Satyanarayan. ["Accessible Visualization via Natural Language Descriptions: A Four-Level Model of Semantic Content."](https://ieeexplore.ieee.org/abstract/document/9555469) IEEE transactions on visualization and computer graphics (2021).

- [A Comprehensive Guide to Accessible Data Visualization](https://www.betterment.com/resources/accessible-data-visualization/)

- Silvia Canelón and Liz Hare. [Revealing Room for Improvement in Accessibility within a Social Media Data Visualization Learning Community](https://spcanelon.github.io/csvConf2021/slides/#1)

---

.task[
Revisit your Project 2 proposals with accessibility guidelines in place and use the remaining time for checking for and implementing as many of the accessibility guidelines as relevant to your project.
]