Bechdel#

A simple tool for detecting gender bias in Hollywood movies is the Bechdel test, which originated in Alison Bechdel’s comic strip “Dykes To Watch Out For”. A movie passes the test if it has: at least two women in it who talk to each other about something besides a man This data set contains information about movies, whether they pass or fail the Bechdel test, and financial information for those movies.

From the authors: “Audiences and creators know that on one level or another, there’s an inherent gender bias in the movie business — whether it’s the disproportionately low number of films with female leads, the process of pigeonholing actresses into predefined roles (action chick, romantic interest, middle-aged mother, etc.), or the lack of serious character development for women on screen compared to their male counterparts. What’s challenging is quantifying this dysfunction, putting numbers to a trend that is — at least anecdotally — a pretty clear reality.”

Initialization#

library(fosdata)
data <- fosdata::bechdel

Accessing fields#

data <- fosdata::bechdel
domgross <- data$domgross # Just a random field in the dataset

Interactive R Sample#

You can use the R editor below to interactively explore the dataset and generate plots. This contains a fully self-contained R environment with fosdata, ggplot2, and dplyr loaded.

webR + fosdata Test

Console
Plot

    
No plot generated yet.
scatterplot

LLM instructions#

If using an LLM, you can copy-paste the following instructions to accompany your prompt to inform the model of the fields and their types in the dataset.

LLM Instructions
The fosdata::bechdel dataset containing the following fields:

fields[15]{name,type,values}:
  year,integer,n/a
  imdb,character,n/a
  title,character,n/a
  test,factor,n/a
  clean_test,factor,[notalk,ok,men,nowomen,dubious]
  binary,factor,[FAIL,PASS]
  budget,numeric,n/a
  domgross,numeric,n/a
  intgross,numeric,n/a
  code,factor,n/a
  budget_2013,numeric,n/a
  domgross_2013,numeric,n/a
  intgross_2013,numeric,n/a
  period_code,integer,n/a
  decade_code,integer,n/a

Fields#

Name Description Type Min Max Values
year Movie release year. integer 1970 2013 -
imdb Movie ID number at imdb.com. character - - -
title Movie title. character - - -
test The full Bechdel test status of the film, as rated by BechdelTest.com. factor - - -
clean_test A simplified Bechdel test status. Contains levels dubious, men, notalk, nowmen, and ok. factor - - notalk, ok, men, nowomen, dubious
binary The Bechdel test stautus simplified to PASS/FAIL. factor - - FAIL, PASS
budget The film’s budget, in US dollars, when made. numeric 7000 425000000 -
domgross Domestic earnings (US, Canada, PR, Guam). numeric - 760507625 -
intgross International earnings. numeric 828 2783918982 -
code Combination of year and binary variables. factor - - -
budget_2013 The film’s budget, in 2013 US dollars. numeric 8632 461435929 -
domgross_2013 Domestic earnings in 2013 US dollars. numeric 899 1771682790 -
intgross_2013 International earnings in 2013 US dollars. numeric 899 3171930973 -
period_code Movie release year segmented into five periods, 1=2010-2014, 2=2005-2009, etc. integer 1 5 -
decade_code Movie release year segmented into three decades, 1=2010s, 2=2000s, 3=1990s. integer 1 3 -

Source#

“The Dollar-And-Cents Case Against Hollywood’s Exclusion of Women”, Walt Hickey, FiveThirtyEight, April 1, 2014. https://github.com/fivethirtyeight/data/tree/master/bechdel