Project Report: Contains a detailed report of our project including motivation, analyses, and conclusions.
Interactive Map: Contains an interactive map showing the geographical distribution of death due to drug overdose. It will also show the corresponding age and drug type distribution.
Github repo & Data source: Click the icons at the right top corner to see the data source and our github repo.
For a brief overview of our website and project, please view our screencast:
Motivations
According to a study, illicit drug use in the United States has been increasing and the number of related death is also growing. Therefore, studying the relationship between drug use and related death is important to discourage drug use and prevent death. In this project, we are going to analyze the data in drug use and related death in Connecticut from 2012 to 2018 and intend to discover some critical factors in drug use that will arise high chance to be dead.
Data
The dataset contains a listing of accidental death associated with drug overdose in Connecticut from 2012 to 2018. It was obtained from an investigation by the Office of the Chief Medical Examiner which includes the toxicity report, death certificate, as well as a scene investigation.
What was the distribution in race, gender, and age among death related to drug use?
Has the patterns of drug changed over recent years?
Was the number of death due to drug use rising, declining, or steady?
Was there any pattern of geographical distribution among death due to drug use?
Main findings and conclusions
People in 30 to 40 years old have most drugs related deaths.
More than 80% death caused by overdose of drugs involves Fentanyl.
Women older than 40 years old have higher risk of death when using drugs.
POMs substitute for popular natural drugs causes doubled death in Connecticut in 6 years.
Discussion
Actions should be taken to tackle the drug issue in Connecticut, especially for heroine, opioid and fentanyl, as they are now the most prevalent drugs in most of the areas. Police intervention should be considered in the areas or locations where drugs are abused.
There are still some limitations of this dataset. It only includes the death cases but no controls (those who involved in drugs but still alive). Moreover, the dataset is more suitable for exploratory data analysis since there is no any meaningful outcome variable for modelling. The causes of death (‘cod’) might add useful findings but they were probably recorded by police officers, which are hard for categorization. Additional information regarding the diseases and lifestyle should be considered in future studies. Note that there are only 208 geographical coordinates for 5087 samples.