Why COVID-19 is Overrunning the US in Late 2020: Overlapping Epicurves

Data in simulated epicurves show frequencies and explain outbreak timing

While other countries have found a way to control their community spread of COVID-19 while waiting for the vaccine program to be implemented, the United States has totally failed at this. An epicurve is a diagram of the timing of an outbreak, and in other countries, this curve has been flattened. But in the United […]

This Course in Explainable AI will Get you Ready for the Future!

What do the data say when a machine learning algorithm is applied, and which features are important?

We experience artificial intelligence all the time on the internet in terms of friend suggestions on social media, internet ads that reflect what we have been searching for, and “smart” recommendations from online stores. But the reality is that even the people who build those formulas cannot usually explain why you were shown a certain […]

Read Our New Peer-reviewed Paper on the Ketogenic Hypothesis for Lipedema!

Lipedema is a chronic condition that is often misdiagnosed as obesity

Lipedema, a severe metabolic disorder, is more common than originally thought. A non-trivial proportion of women who struggle with obesity actually have undiagnosed lipedema. I am on a research team that just published a peer-reviewed article that presents the ketogenic hypothesis for lipedema, and here, I present a summary.

Two Takeaways from Danny Ma’s Machine Learning Panel: Understanding the Problem, and Understanding your Data

Roller coaster like an ETL pipeline that does automation

This lively panel discussed many topics around designing and implementing machine learning pipelines. Two main issues were identified. The first is that you really have to take some time to do exploratory research and define the problem. The second is that you need to also understand the business rules and context behind the data.

Donate to Central Boston Elder Services in GIVE65’s Giving Tuesday Event!

elders charity donation minority COVID-19

On #GivingTuesday, donate to Central Boston Elder Services’ Little Necessities program! Give early on December 1, 2020, and your donation may be matched through a program arranged by the #GIVE65 senior services crowdfunding platform.

Sort Order, Formats, and Operators: A Tour of The SAS Documentation Page

SAS software sorting a to z or using arithmetic operators

Get to know three of my favorite SAS documentation pages: the one with sort order, the one that lists all the SAS formats, and the one that explains all the SAS operators and expressions!

Data Scientists Interested in Encryption Should Take this Online Cryptography Course

Cartoon of person programming with code in the background

Even if you do not deal directly with cryptography, the need to maintain data privacy often leads data scientists to need to study cryptography. This basic online course is part of an ethical hacking certification and gives a basic overview of issues with data transfer and cryptography.

Understand US Payroll Data from this Online Course about Payroll

Check with payroll royalty data processing

If you receive payroll in the US, you can see that the data on the payroll stub is pretty complicated. This course in payroll is helpful for data scientists who find themselves analyzing US payroll data, because it explains the business rules and regulations behind the data.

If You Want to Increase Conversions, Try my A/B Testing Course on LinkedIn Learning

Diagram explaining how A/B testing is done

A/B testing seems straightforward, but there are a lot of picky details. What A and B conditions do you actually test? How long do you run the test? How do you calculate the statistics for the test? Answer your questions by taking this LinkedIn Learning course.

Announcing the Publication of my New SAS Book on Data Warehousing

Textbook explaining how to use SAS to program and design a data warehouse

SAS is known for big data and data warehousing, but how do you actually design and build a SAS data warehouse or data lake? What datasets do you include? How do you transform them? How do you serve warehouse users? How do you manage your developers? This book has your answers!