The Bechdel Test: Assessing 7 Decades of Representation in Movies
Members
- Samika Kikkeri, 457601, s.kikkeri@wustl.edu
- Vishesh Patel, 455214, vppatel@wustl.edu
Project Repository: https://github.com/washuvis/bvis
Overview and Motivation
The Bechdel test checks to see if a woman in a movie's character is entirely about the men in the movie.
It specifically asks three questions of movies: 1. Are there two or more female characters in the story?
2. Do they talk to each other, and
3. Do they at any point have a conversation unrelated to a man?
This is an important baseline test to diagnose representation in movies though the criteria for
passing or failing the test might seem simple to most,
the amount of popular movies and works of fiction that fail this test are numerous.
Hollywood movies are extremely influential to US culture, and in many cases,
emphasize stereotypes and cultural norms. We wanted to design a webpage that
makes use of the Bechdel test to highlight these disparities in movies through our data visualizations
and try to map trends across time.
Related Work or Inspirations
We were inspired to do this project as 50% of our two person group identifies as female, and 100% of people who
watch movies are affected by the influence that movies have on society and cultures worldwide. As of milestone
1, we don't have any specific studies that we saw that inspired us besides viewing this as an interesting topic
and one that would be suitable to study with data visualization. However, if you've seen Avengers: Endgame, there's a
certain scene in the third act of the movie where all the female Avengers band together which raises some questions
as to what fair representation for women in movies looks like, especially in that movie.
Research Questions
Our preliminary research questions are as follows:
- How do different decades compare and contrast in terms of proportion of movies that pass or fail the Bechdel test?
- How do the revenues of movies that pass/fail the test within a given year compare? Do movies that fail the test make more or less money than those that pass?
- Do movies that pass the Bechdel test tend to have higher ratings than those that fail?
- Do movies of a particular genre tend to have more movies that pass or fail the test?
Data sources and manipulation
As of milestone one, we've used the following datasets:
- The Bechdel Dataset, found here.
- The Movies Dataset by Rounak Banik, found here.
Here are some sources we used to help write the code.
- For Ratings Bubble Chart - //Website used to reference Circular Packing: https://www.d3-graph-gallery.com/graph/circularpacking_group.html
Cite : Author: Yan Holtz, Date: 2019, Date Accessed: December 6th, 2020, Title: Grouped circular packing in d3.js
Note: This circular packing graph is inspired by this code
- Course textbook (general guidance)
- Previous assignments (general guidance and code referral)
Libraries used:
- d3 js
- d3 tip
- bootstrap
- jquery
The Bechdel Dataset is a website that has data on around 8000 movies.
Anyone is able to submit movies along with data on their Bechdel rating (between 0-3),
and each entry also includes information on a movie's imdb value and rating.
The Movies Dataset contains data on over 45,000 movies and has lots of metadata for each movie.
We matched entries in the Bechdel set with entries in the Movies set by imdb id and kept in information
about revenue, budget, genre, and imdb user votes.
Exploratory Data Analysis
In our initial design we thought to use a bar chart to determine if there were differences
between average Bechdel rating within decades/years to compare to one another. It's difficult
to glean anything of value at that high level given the nature of the Bechdel dataset being
operated by a community, and some years having significantly more movies available than others.
We then decided to switch up our approach to have our visualizations to
allow users to get deeper into the data and put each decade/year more on an even field with the others.
Design Evolution
We started with the idea of having a bar chart be the main controller for the other visualizations.
The idea was to start at a high level, and then go deeper into the data and time periods, down to the movie,
as the user interacts with the charts. Here are two images of that concept:
We realized that we could be more creative with how we hook a user onto our webpage,
and that there are better ways to do that. We decided instead to create a visualization based on
nodes and thematically tie it into movies. Each node would represent a year, sized by proportion of
movies that pass/fail the test and colored by decade. Clicking on one of these takes the user further
down the page to see more information about that particular year and decade through the other visualizations
Here are some sketches that indicate our ideas for the orientation
As of milestone one, here is our current primary visualization:
it utilizes the concept of the spotlight sketch but flipped over.
This visualization only includes the nodes from the 90s for Milestone 1, but will be reshaped
into a different movie-related object or fancier design once we reintroduce the other 60 nodes.
We also intend to adjust the positioning of the svg and images to be responsive and centered.
For milestone 2, we will also include sketches and implementation of the search and compare visualization.
It will include a form that allows users to try and search for movies within our database (from any year available),
and then display all the different attributes we have such as pass/fail, budget/revenue, genres, rating, etc.
As of Milestone 2, here is our updated initial visualization, designed to emulate reserving seats from a theater
online.
We also added in a search function that we stated our intentions for in the milestone 1 push. It displays
four categories of data in four different quadrants (general, financial, Bechdel, and popularity/rating).
Here is a sketch of the initial thought process, as well as a screenshot of the execution.
Implementation
We've covered the changes and the implementation strategy behind the primary visualization.
We also created a line chart that draws two lines over the span of a decade: the points on the lines
represent the cumulative revenue of movies that pass or failed the Bechdel test in a given year.
This revenue line chart correlates revenue to the movies that passed and movies that failed the test in a
given year, and allows users to see trends in whether or not movies that pass or fail the test make more money
or not. We calculated the sum of movies that pass/failed the test in a year and put them into a csv.
Here you'll find our initial sketches for the revenue chart as well as the current prototype
In our update for milestone 2, we intend to
restyle the axes on the chart, make the points
more distinct, and add an interactive
tooltip component that will show a user the exact value of
a given point on hover.
We implemented a bubble chart that groups the movies that pass or fail the Bechdel test for a clickable
year. In Milestone 1, we have the data for each year in the movies from the 90s. Here you'll find our initial
sketches as well as the current prototype.
For milestone 2, for interactivity, we will
implement a tooltip that shows the user (on hovering over a node) what movie they are hovering on. We will also
fix the chart to have an encompassing green and red circle over the other nodes and adjust opacity, as well as
increase the overall size, positioning, and responsiveness (to adjusting screen dimensions) of the chart.
For milestone 2, we added an interactivity element and converted to a pack chart to prevent nodes from overlapping.
We wanted to also determine if certain genres had more movies that passed or failed the Bechdel test.
In our proposal we envisioned this as being a pie chart where we had genres occupy a proportion of the
overall circle. Here you'll find our preliminary sketches for the idea:
We realized that it would be more effective to convey this information in a bar chart, with genres
on the x-axis and count of movies that pass/fail on the y-axis. Each genre would have two bars, one
for passing and one for failing movies. Here is a sketch of what the revised version would look like
Our final version will implement two bars along with a tooltip that shows how many movies pass/fail
a category on hover.
The milestone 1 and 2 versions can be seen here, with the milestone 2 version having both bars per genre.
Evaluation
We believe our visualizations can provide answers to all of the questions of interest we outlined when starting our project.
The visualizations all provide different kinds of insights into the data surrounding movies, and a user can easily access our website
and interact with the visualizations, movie data for different years/decades, and even specific movies as well. Our tool provides a lot of
opportunities for user engagement whether they're curious about multiple years' worth of data or individual movies themselves, and our nav bar
which we added after user feedback gives them a quick way to move from visualization to visualization.
From our visualizations, we were able to see different trends in years/decades. If you click on one of the earlier decades and compare them to
a later year (such is starting with a year in the 60s vs a year in the 2010s), you can see the visual difference in how many movies
passed or failed the Bechdel test through our bubble chart. The bar chart is also a place to draw interesting conclusions from as some genres
like action have a high proportion of movies that fail the Bechdel test across multiple years. Of course, our tool would be better with more accurate
and complete data though from a conceptual standpoint it does what it sets out to do.
In user testing we received feedback on clarifying what the Bechdel test was, and also feedback surrounding styling, clarity, and usability.
We implemented the navbar, updated some styles, and added more information wherever we could such as legends and tooltips to help make the
site more interactive and clear. If we were to further improve, we would overhaul the styling and make the graphs and colors look a bit cleaner
as well as make some small adjustments to how some numbers are displayed.