Sort Order, Formats, and Operators: A Tour of The SAS Documentation Page

SAS software sorting a to z or using arithmetic operators

Get to know three of my favorite SAS documentation pages: the one with sort order, the one that lists all the SAS formats, and the one that explains all the SAS operators and expressions!

Data Scientists Interested in Encryption Should Take this Online Cryptography Course

Cartoon of person programming with code in the background

Even if you do not deal directly with cryptography, the need to maintain data privacy often leads data scientists to need to study cryptography. This basic online course is part of an ethical hacking certification and gives a basic overview of issues with data transfer and cryptography.

Understand US Payroll Data from this Online Course about Payroll

Check with payroll royalty data processing

If you receive payroll in the US, you can see that the data on the payroll stub is pretty complicated. This course in payroll is helpful for data scientists who find themselves analyzing US payroll data, because it explains the business rules and regulations behind the data.

If You Want to Increase Conversions, Try my A/B Testing Course on LinkedIn Learning

Diagram explaining how A/B testing is done

A/B testing seems straightforward, but there are a lot of picky details. What A and B conditions do you actually test? How long do you run the test? How do you calculate the statistics for the test? Answer your questions by taking this LinkedIn Learning course.

Announcing the Publication of my New SAS Book on Data Warehousing

Textbook explaining how to use SAS to program and design a data warehouse

SAS is known for big data and data warehousing, but how do you actually design and build a SAS data warehouse or data lake? What datasets do you include? How do you transform them? How do you serve warehouse users? How do you manage your developers? This book has your answers!

Announcing my New(-ish) Data Curation Course on LinkedIn Learning!

This flow chart is an example of the time of files you make if you take this online data curation course

Curation files are especially helpful for communicating about data on teams. Learn more about what you’ll learn when you take my online LinkedIn Learning data curation course!

Confused when Downloading BRFSS Data? Here is a Guide

Many images of colorful database shapes on a rainbow background

I use the datasets from the Behavioral Risk Factor Surveillance Survey (BRFSS) to demonstrate in a lot of my data science tutorials. The BRFSS are free and available to the public – but they are kind of buried on the web site. This blog post serves as a “map” to help you find them!

Doing Surveys? Try my R Likert Plot Data Hack!

The Likert package in R can visualize categorical data.

I love the Likert package in R, and use it often to visualize data. The problem is that sometimes, I have sparse data, and this can cause problems with the package. This blog post shows you a workaround, and also, a way to format the final plot that I think looks really great!

Fall 2020 Update: Education Mission in the End of Days!

Images of fall leaves on trees on a road in New England

After being hit with the pandemic, I’m racing to catch up with my blog! Here are a few announcements of new courses, new scientific papers, and new ideas on data science!

I Used the R Package EpiCurve to Make an Epidemiologic Curve. Here’s How It Turned Out.

Epidemiologic Curve of 2015 Middle East Respiratory Virus Outbreak Using R EpiCurve Package

With all this talk about “flattening the curve” of the coronavirus, I thought I would get into the weeds about what curve we are talking about when we say that. We are talking about what’s called an epidemiologic curve, or epicurve for short. And to demonstrate what an epicurve is and what it means, I […]