Tag Archives: cross-sectional study

Dataset Source Documentation: Necessary for Data Science Projects with Multiple Data Sources

If you work on a big data project with multiple source datasets, you run the risk of forgetting exactly how you blended them together.

Dataset source documentation is good to keep when you are doing an analysis with data from multiple datasets. Read my blog to learn how easy it is to throw together some quick dataset source documentation in PowerPoint so that you don’t forget what you did.

REDCap Mess: How it Got There, and How to Clean it Up

REDCap mess on your hands? The REDCap designers made the application so loosey goosey, you can really program yourself into a messy corner if you don't plan well.

REDCap mess happens often in research shops, and it’s an analysis showstopper! Read my blog post to learn my secret tricks for breaking through the barriers and getting on with data analytics!

US Public Health Alphabet Soup Explained: What is the ONC?

Before the ONC office was established in 2009, there was no federal oversight of medical record systems.

“What is the ONC?” is what I used to ask before I realized it involves health technology. Although ONC just means “Office of the National Coordinator”, this agency is now known as HealthIT.gov, as I explain in my blog post.

Time Series Plots in R Using ggplot2 Are Ultimately Customizable

Time series plots can be customized if you use package ggplot2 in R. You can place labels and configure axes.

Time series plots in R are totally customizable using the ggplot2 package, and can come out with a look that is clean and sharp. However, you usually end up fighting with formatting the x-axis and other options, and I explain in my blog post.

Counting Rows in SAS and R Use Totally Different Strategies

If you are a data scientist working with large datasets, you need to learn the commands to count both columns and rows in the dataset, whether you are using SAS or R.

Counting rows in SAS and R is approached differently, because the two programs process data in different ways. Read my blog post where I describe both ways.

Confused when Downloading BRFSS Data? Here is a Guide

You can download public data from health surveillance surveys. However, you have to know how to locate it on the web site.

I use the datasets from the Behavioral Risk Factor Surveillance Survey (BRFSS) to demonstrate in a lot of my data science tutorials. The BRFSS are free and available to the public – but they are kind of buried on the web site. This blog post serves as a “map” to help you find them!

Verified by MonsterInsights