Skip to main content

Coronavirus: Some Thoughts on the Quarantine

It's been some 45 days since the White House declared COVID-19 a National Emergency and the bulk of us became largely confined to our homes. As people begin to agitate for the end of the quarantine and a number of states begin to open up, I thought I'd offer up a loose collection of data-driven thoughts on the quarantine we've all been experiencing. These insights are based on mobility data measured as Vehicle Miles Traveled (VMT). While this is an imperfect metric, we can use the reduction in VMT since before the pandemic as a rough proxy of how effectively we have been adhering to the quarantine.

The Quarantine Was Effective, but Effectiveness Varied by State


The first observation is that the quarantine was effective, which is probably obvious from a quick glance outside. Average mobility by county fell by approximately 65% when compared to January (the troughs you see are weekends).

Some states adhered to the quarantine more seriously than others

Here, we can observe that the Southern states reduced their movement less than the rest of the country. A similar phenomenon can be observed to a lesser extent among the Western states. What could explain this disparity? Let's discuss the obvious: the Northeastern states like New York, New Jersey, and Massachusetts have been hit hardest by COVID-19. Southern and Western states, less so. Obviously, you would expect that states hit hardest by the virus would observe the quarantine the most. Interestingly enough, if you control for a few key variables like population density (more on that later) and education, there is very little relationship between COVID-19 infection rate and mobility. For instance, doubling Mississippi's COVID-19 infection rate would decrease mobility by less than half a percentage point when education and population density are taken into account.

Let's try a different tact. Recall that mobility has been defined as Vehicle Miles Traveled (VMT). The West and South have less population density than the rest of the county and their inhabitants are far more spread out on average, so mobility reduction measured as reduction in VMT may have been capped (i.e. basic amenities like the grocery store are much further away). So we can develop a model that allows us to control for population density and imagine a world where every county in the United States has the same population density. Then we can calculate mobility in this hypothetical world. The results are as follows:
As you can see, the Western states fare considerably better when adjusted for population density -- on par with the Northeast in fact. On the other hand, the Southern states continue to lag behind despite this adjustment, despite having more confirmed cases and deaths than their Western brethren. It could very well be that states in the deep South took the quarantine less seriously than the rest of the country.

Infection and Mobility 

Earlier I commented that there appears to be a negligible relationship between infection rate and mobility, especially once certain variables are taken into account. This actually presents a great opportunity to talk about panel data -- which is data that has a temporal component. The data I am using has mobility and infection rate over time by county. If you plot all of the data available, you get the following: 
From the looks of it, there appears to be a pretty strong negative relationship between infection rate and mobility -- as infection rate increases, counties observe the quarantine more strictly. But recall from our first graph there was a temporal trend to mobility -- as time went on, mobility fell (probably independently of infection rate) as the country adjusted to the quarantine. That trend is also captured in the graph above. To get around that, we can consider looking at a "snapshot" of each day for which we have data and see across counties, do counties with higher infection rates have higher mobility reduction? Below is the result: 


Here, we see a much less clear relationship between infection rate and mobility reduction. What little relationship remains disappears in a model that controls for other variables of interest. It does not readily appear that counties with more COVID-19 cases observed the quarantine more than those that did not -- other factors were more important in determining the extent to which a county observed the quarantine.  

Political Leanings and the Quarantine

Unfortunately, from the get-go, COVID-19 was highly politicized. President Trump initially compared the virus to the common flu, called concerns about it a democratic "hoax", and declared that the death rate was lower than projected by public health officials based on nothing more than a "hunch." There is evidence that this initial downplaying of the virus by the administration had a downstream effect -- polls show a partisan divide on the dangers of COVID-19. This effect also bears out in the mobility data.

As we can see, counties that voted blue were more likely to observe the quarantine, and to a greater extent. This relationship persists when variables like education, population density, and income are controlled for. 



Comments

Popular posts from this blog

Why isn't Robinhood letting me trade? (hint: there's probably not a conspiracy against you)

Today's been a big day in the stock market . Lots of people have lost a lot of money, and a lot of people are understandably really upset . Here's a quick breakdown of what's happened so far A subreddit called /r/wallstreetbets  (visit at your own peril), which has exploded in popularity recently and has over 5 million subscribers (and counting) got really excited about three stocks: GME (Gamestop), AMC (the movie theater place), and BB (Blackberry). Gamestop was the main stock.  Yes, I know all three companies are doing terribly in the real world. I won't go into why they got excited about the stocks here.  They convinced a lot of other people to buy the stocks and they did well. Really well. Take a look at their Yahoo Finance pages and look at their 1 month price charts (then ignore the past two days). GME , BB , AMC Everyone got in on it, and I mean it. When a lot of people buy a single stock, the price rises. It turns out, this was hurting a lot of Hedge Funds and I

Determining NFL Quarterback Archetypes (with stats!)

We're obsessed with grouping things together. We self-select each other into groups based on which political candidate we support, which sports team we root for, and which arbitrary country we're born in. People also spend hours on the internet arguing over "tiers", or groupings, of their favorite athletes and sports teams. For example, which NBA players are "elite" vs. "great" vs. just "good"? Did Carmelo Anthony belong  on the Banana Boat ? When engaging in these arguments, we typically use statistics like points or rebounds per game to back up our points, but at the end of the day, the groups are more or less kind of arbitrary.  But what if there was a way to algorithmically sort observations into groups based on shared characteristics using machine learning methods? Enter clustering , which is the methodology of grouping similar observations into groups, or "clusters", using a mathematical distance metric derived from a set

Analyzing Hip Hop - Who's Most Lyrical, What Determines Popularity, and More

Have you ever thought about bringing cold, hard statistics to one of life's greatest artistic joys? Well fear not, because in our increasingly data-driven world, our analyst friends are hard at work attempting to statistisize (numerize?) everything you can think of, so we can analyze and therefore optimize it. One of the art realms that is increasingly falling under the purview of data science is music. We all benefit from it in the form of curated daily Spotify playlists and Pandora stations that allow us to find new artists and songs.  I was recently able to get my hands on a Spotify dataset  that contains data on over 160k tracks dating from 1921 through December 2020. Aside from containing some basic features like track name, duration, and release date, it also contains some advanced metrics as calculated by Spotify like "track positivity" (is it a sad, depressed song, or a happy, positive song?), "danceability", "energy", "speechiness" (