Campus Pride Index Analysis

I’ve decided to finally take the leap to learn R by participating in Tidy Tuesday! My professor, Jo Hardin, has helped me learn the basics of R. So now, I want to put these skills to the test with some straightforward data-viz.

Here is some background information on the data for this project, if you wish to look into the data:

Campus Pride. 2007. Campus Pride Index. Campus Pride. Retrieved from https://www.campusprideindex.org

“The Campus Pride Index is owned and operated by Campus Pride, a leading national nonprofit dedicated to fostering safer and more LGBTQ-friendly learning environments at colleges and universities. Supported by the Campus Pride Q Research Institute for Higher Education, the index benefits from strategic partnerships in higher education and LGBTQ-related organizations. The index aims to advance LGBTQ-inclusive policies, programs, and practices in higher education.”

I am intersted in the correlation between size and school rating - I grew up in a small town, and can guarantee that size can jnfluence how you express your identity!

Below, you will find a box plot highlighting the Campus Pride Index’s scores varying across campus community sizes. There is a correlation between community size and ratings, with smaller communities generally scoring lower among queer students (with a few notable exceptions).

library(tidyverse)
library(dplyr)
library(httr2)

tuesdata <- tidytuesdayR::tt_load(2024, week = 24)

pride_index <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2024/2024-06-11/pride_index.csv')
pride_index_tags <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/master/data/2024/2024-06-11/pride_index_tags.csv')

# pulling column names to check data type
colnames(pride_index)
# pulling variables of inerest
community_type <- pride_index$community_type
rating <- pride_index$rating

# plotting!
ggplot(data = pride_index, aes(x = community_type, y = rating)) +
  geom_boxplot() +   
  labs(
    x = "Community Type",
    y = "Rating",
    title = "Campus Pride Index, Sorted by Community Size"
  ) + theme_minimal()



And there we go! It’s clear that this correlation is prevalent in this data set, though there are some notable outliers! This data set is a really cool tool, and I;ll definitely use it in the future if I ever plan on visiting/returning to academics in the future.