Project Processbook

Basic Info:

Title: Visualizing St. Louis Crime Data

By:

Names Email ID
Alina Weng alina.w@wustl.edu 518839
Jasmine Diaz Jarquin diaz-jarquin@wustl.edu 509686
Soleyana Tekalgn s.k.tekalgn@wustl.edu 520813

Background and Motivation

St.Louis is notorious for its drastic differences in its neighborhoods. Considering this, we have decided to visualize just how drastically different these neighborhoods are by measuring safety. Concentration of crime and safety is a major issue, because these resources need to be spread out. By using the Dataset that shows crimes in St. Louis by neighborhood, our solution is to keep a count and specify categories of crimes that occur in specific neighborhoods. This matters because it can reveal more questions as to why these crime patterns occur and reveal the proximity of dangerous neighborhoods to safer neighborhoods.

Project Objectives:

Primary Question:

Learn and Accomplish:
From answering our primary question, we hope to learn if there is any correlation between the different neighborhoods and crime rates. From mapping out all the crime happening in St. Louis, we want to be able to show people which neighborhoods in St. Louis are considered safe and what is not.

Benefits:

Visualization Design:

Sketches

Sketches

Design #1

Design 1

Design #2

Design 2

Design #3

Design 3

Final Design:

Final Design

We ended up choosing this design as our final design because the filter box with the different drop-down options is the most user-friendly. The user can easily use the interface and apply the filter they want. The color scale used on the map, from green to red, is a more straightforward connection the user can make for determining which neighborhood is safe/unsafe. In addition to the line graph and bar graph that are on the final design, more variations of the graphs can be done. For example, if we want to focus on a certain neighborhood, Clayton, we can also do a line graph to see how the amount of different types of crime changes over time.

Must Have Features:

Optional Features:

Data:

We got our data from the St. Louis Metropolitan website, where it has the National Incident Based Reporting System (NIBRS) statistics of crime happening in St. Louis.

https://slmpd.org/stats/

Data Processing:

We’ll be using Jupyter notebooks + Python to clean and process the data and extract more metrics that we could visualize. There is data from 2008 - 2025. The format changes starting Jan. 2021, so we will have to analyze the differences between the formats and determine if we want to visualize both and how to handle them. There may be some typos and NA values in the data, so we will have to clean it. Some crimes may be reported years after they happened - we will have to handle that and ensure every crime is counted. We have to take into account administrative adjustments (may change the type of crime, ex., assault -> homicide). Take into account the x and y coordinate format. Do further investigation into what some of the values mean.

Project Schedule:

Date Deadline Notes What to Accomplish
10/27 - 10/2 10/27 Project Proposal Project Proposal. Start working on data wrangling.
11/03 - 11/9 Alina: 2 exams 11/6
Jasmine: exam 11/6
Soleyana: exam 11/5
Finish data wrangling. Get a working prototype (map and plot crimes on it).
11/10 - 11/16 11/10 Milestone 1 Jasmine: exam 11/12
Soleyana: exam 11/12
Add all the different features we want.
11/17 - 11/23 Making sure everything is interactive.
11/24 - 11/30 11/24 Milestone 2 Alina: won’t be in class 11/25
12/1 - 12/8 12/8 Project Due Jasmine: in STL
Alina: exam 12/4
Do final touches, make sure edge cases are handled.

Milestone 1:

Milestone 1 Design:

Milestone 1

What was done:

Milestone 2:

Milestone 2 Design:

Milestone 2 Milestone 2

What was done:

User Study Plan

Session 1: Think-Aloud

Participants freely explore the visualization while verbalizing their thoughts. We observe how they interpret markers, clusters, colors, and filters, and note any confusion during initial use.

Session 2: Task-Based Evaluation

Participants complete short, specific tasks to test usability and accuracy. Example tasks include these:

As they are doing the tasks, we will observe for any confusion.

Session 3: Feedback / Critique

Participants give open feedback about clarity, color choices, filter usability, and overall experience. We will ask them what felt intuitive, what was confusing, and what features they would improve or add.

Session 4: Debrief

We briefly explain the goal of the visualization, answer any remaining questions, and gather any other comments they have.

User Study Feedback

Final Submission

What's been changed since Milestone 2:

We decided to keep the same color palette because of we wanted it to be colorblind friendly according to the Tol palette thus accessible, even though the colors won't pop as much.

Related Works

Questions that we were trying to answer

The questions that we tried to answer throughout the project were:

We believe that the questions we tried to answer did not change much over time as we worked on the project. Yet we did consider other questions as we look deeper into the data we have. When we were doing data processing, we noticed that it has columns for when the crime happened and when it was reported. We noticed that many crimes happened in the 2000s but weren't reported until recent years, and of those crimes, many of them were assault crimes. At that time, we were thinking of doing a question like Do certain types of crimes (such as assaults) have longer reporting delays?. But due to the scope and focus of our project, we decided not to pursue this question further, at least not in this project.

Exploratory Data Analysis:

We explored some of the exisiting maps for crime in St. Louis and looked at what inspiration we would take to improve our own visualization. We liked how some maps had jurisdiction boundaries, but we thought we could improve them by adding a choropleth layers and supporting graphs to give more insight than seeing a lot of points of the graph. We also looked at some features that were common throughout them, such as legends, filters, and different icons for crime categories.

Design Evolution

At the beginning of the project, we considered different kinds of visualization to show crime intensity. We were considering between a heat map or a choropleth map. After consideration, we went with a choropleth map because it made it easier to see which neighborhoods have the most crime. A choropleth map takes in consideration real geographic boundaries, use a clear sequential color scale, and supports direct comparison between regions. Heat maps looked visually appealing, but they created smooth gradients that were harder to interpret and could suggest density in areas without actual crime data. This change helped align our visualization with perceptual design principles and better answer our research questions.

At the beginning of the project, we had planned to potentially just have the map and the line graph to visualize our data, but we realized that we needed a bar chart to visualize the crime frequency in different categories other than the selected filters.

Implementation

Map

Evaluation