Tag Archives: public health data

“Bad Blood” Demonstrates how a Lack of Product Description Leads to Data Science Misconduct: Part 2 of 5

In order to operationalize your data variables, you need to have clear product descriptions

This blog post talks about how lack of product description led to data-related misconduct at Theranos, because they could never nail down exactly what they were trying to do.

Wondering if You Will Like Healthcare Research? Try This: Data Collection

Learn data science skills online in order to develop data collection materials

If you are not sure if you will like doing research in healthcare, instead of starting with big data, start with data collection and get to know the data as it comes into the dataset.

Healthcare Data Science Newbie Do-it-Yourself Starter Kit

The tools for healthcare data science include both descriptive and inferential statistics

Monika posts her “data science newbie do-it-yourself starter kit”, with links to cheap or free learning resources for the data science newbie who wants to get started in healthcare analytics.

Quality Improvement in Healthcare: What is the PDSA Model, and How Well Does it Work for QA/QI? Part 1 of 5

Continuous quality improvement through conducting research projects to get evidence to inform change

Wondering what the Plan-Do-Study-Act (PDSA) Model is, and if you should adopt it for quality improvement in healthcare? Read my series of blog posts on the subject for my personal experience and recommendations

Applying Rothman’s Causal Pie Model to the Death of George Floyd

Weighing relative causes visually is easier with Rothman's causal pie model

In the murder trial of Officer Derek Chauvin, the prosecution must demonstrate that the police officer’s knee on George Floyd’s neck constituted a “substantial” cause of Mr. Floyd’s death “beyond a reasonable doubt”. This presents a challenge in weighing relative causes of death, and this leads us essentially to causal inference. My blog post demonstrates […]

Why COVID-19 is Overrunning the US in Late 2020: Overlapping Epicurves

Data in simulated epicurves show frequencies and explain outbreak timing

While other countries have found a way to control their community spread of COVID-19 while waiting for the vaccine program to be implemented, the United States has totally failed at this. An epicurve is a diagram of the timing of an outbreak, and in other countries, this curve has been flattened. But in the United […]

Confused when Downloading BRFSS Data? Here is a Guide

Many images of colorful database shapes on a rainbow background

I use the datasets from the Behavioral Risk Factor Surveillance Survey (BRFSS) to demonstrate in a lot of my data science tutorials. The BRFSS are free and available to the public – but they are kind of buried on the web site. This blog post serves as a “map” to help you find them!

I Installed the Free “SAS University Edition” and Here’s What I Recommend: Part 3

Install Free SAS University Edition Set up your computer to use SAS for free

In this blog post, I walk you through SAS Download and Install steps 1 and 2. In those steps, you make a free account with SAS, and you download and install Oracle’s VirtualBox. The screen shots in this post will get you ready for what you will see when you do these steps.