Tag Archives: data stewardshiip

Adding Error Bars to ggplot2 Plots Can be Made Easy Through Dataframe Structure

Error bars on plots can provide the audience an estimate of the amount of certainty you have with your estimates.

Adding error bars to ggplot2 in R plots is easiest if you include the width of the error bar as a variable in your plot data. Read my blog post to see an example.

US Public Health Alphabet Soup Explained: What is the ONC?

Before the ONC office was established in 2009, there was no federal oversight of medical record systems.

“What is the ONC?” is what I used to ask before I realized it involves health technology. Although ONC just means “Office of the National Coordinator”, this agency is now known as HealthIT.gov, as I explain in my blog post.

Querying the GHDx Database: Demonstration and Review of Application

Many data scientists interested in health are looking to query the Global Burden of Disease database, also known as the GHDx

Querying the GHDx database is challenging because of its difficult user interface, but mastering it will allow you to access country-level health data for comparisons! See my demonstration!

“Bad Blood” is a Lesson in How Bad Leadership Leads to Bad Data: Part 4 of 5

Data science leaders do not always realize they are responsible for writing policies about governance and data stewardship

As a data science leader, what should you put in place so your organization doesn’t end up a data mess like startup Theranos? This blog posts provides guidance.