Amelia McNamara | Working with categorical data in R without losing your mind | RStudio (2019)
Categorical data, called “factor” data in R, presents unique challenges in data wrangling. R users often look down at tools like Excel for automatically coercing variables to incorrect datatypes, but factor data in R can produce very similar issues. The stringsAsFactors=HELLNO movement and standard tidyverse defaults have moved us away from the use of factors, but they are sometimes still necessary for analysis. This talk will outline common problems arising from categorical variable transformations in R, and show strategies to avoid them, using both base R and the tidyverse (particularly, dplyr and forcats functions). VIEW MATERIALS http://www.amelia.mn/WranglingCats.pdf (related paper from the DSS collection) http://bitly.com/WranglingCats https://peerj.com/collections/50-prac... About the Author Amelia McNamara My work is focused on creating better tools for novices to use for data analysis. I have a theory about what the future of statistical programming should look like, and am working on next steps toward those tools. For more on that, see my dissertation. My research interests include statistics education, statistical computing, data visualization, and spatial statistics. At the moment, I am very interested in the effects of parameter choices on data analysis, particularly data visualizations. My collaborator Aran Lunzer and I have produced an interactive essay on histograms, and an initial foray into the effects of spatial aggregation. I talked more about spatial aggregation in my 2017 OpenVisConf talk, How Spatial Polygons Shape Our World.

Barret Schloerke | Reactlog 2.0 Debugging the state of Shiny | RStudio (2019)

Working with factors and categorical variables. Use forcats in R programming to change factor levels

Tom Mock | A Gentle Introduction to Tidy Statistics in R | RStudio (2019)

Hadley Wickham: Managing many models with R

How to explore Categorical Data in R ✔️ Exploring categorical data in the R programming language

Variables and Types of Variables | Statistics Tutorial | MarinStatsLectures

Learn R in 39 minutes
![[DSC 4.0] What Makes a Good Function - Hadley Wickham keynote](https://i.ytimg.com/vi/Qne86lxjgtg/hqdefault.jpg?sqp=-oaymwEnCNACELwBSFryq4qpAxkIARUAAAAAGAElAADIQj0AgKJDeAG4AvMY&rs=AOn4CLB6p8qY-MICiOOEgVcLE7J4bF69uA&usqp=CCY)
[DSC 4.0] What Makes a Good Function - Hadley Wickham keynote

Dealing with MISSING Data! Data Imputation in R (Mean, Median, MICE!)

Introduction to ggplot in R

Eric Nantz | Effective use of Shiny modules in application development | RStudio (2019)

R/Database: Using R at Scale on Database Data

R Ladies Baltimore | Make your R code purr with purrr

Trump Gets Booed & Falls Asleep During NBA Finals, Claims War is Almost Over & Goodbye Spencer Pratt

Factors and the forcats package | Further Data Analysis with R (Lesson 2)

Edgar Ruiz | Databases using R The latest | RStudio (2019)

EMBL Keynote Lecture - Data visualization and data science, Hadley Wickham

Factor Variables in R

Emily Robinson - The Lesser Known Stars of the Tidyverse

