Tag Archives: online data science

SAS-R Integration Example: Transform in R, Analyze in SAS!

You can use SAS and R together in one project. I show you how to develop an analytic dataset in R and put it in SAS ODA for analysis.

Looking for a SAS-R integration example that uses the best of both worlds? I show you a use-case where I was in a hurry, and did transformation in R with the analysis in SAS!

Dumbbell Plot for Comparison of Rated Items: Which is Rated More Highly – Harvard or the U of MN?

This is an example of a dumbbell plot from the ggalt package in R that you can also use in RStudio

Want to compare multiple rankings on two competing items – like hotels, restaurants, or colleges? I show you an example of using a dumbbell plot for comparison in R with the ggalt package for this exact use-case!

Data for Meta-analysis Need to be Prepared a Certain Way – Here’s How

This is the forrest plot resulting from analysis with open source statistical software R using package rmeta.

Getting data for meta-analysis together can be challenging, so I walk you through the simple steps I take, starting with the scientific literature, and ending with a gorgeous and evidence-based Forrest plot!

Alternative to the PDSA Model for QA/QI in Healthcare? Old-fashioned Epidemiology and Biostatistics! Part 4 of 5

The Plan Do Study Act model does not take into account all functions of a healthcare quality improvement and assurance department

Want an alternative to the Plan-Do-Study-Act (PDSA) model for quality assurance/quality improvement (QA/QI) in healthcare? I recommend approaching QA/QI a different way, by thinking about the various functions of the QA/QI department.

“Bad Blood” Shows how Theranos was an Abject Failure in Data Stewardship: Part 3 of 5

You need governance in data science whether you are doing clinical research in a healthcare setting or in a laboratory.

The book “Bad Blood” describes the fall of startup unicorn Theranos, but also provides insight into the company’s abject failure at data stewardship, which I talk about in this blog post.

“Bad Blood” Demonstrates how a Lack of Product Description Leads to Data Science Misconduct: Part 2 of 5

You need to write a product description for your computer and business applications. Then, when scientists and marketers do research, they know what endpoints to study.

This blog post talks about how lack of product description led to data-related misconduct at Theranos, because they could never nail down exactly what they were trying to do.

Why COVID-19 is Overrunning the US in Late 2020: Overlapping Epicurves

Data in simulated epicurves show frequencies and explain outbreak timing

While other countries have found a way to control their community spread of COVID-19 while waiting for the vaccine program to be implemented, the United States has totally failed at this. An epicurve is a diagram of the timing of an outbreak, and in other countries, this curve has been flattened. But in the United […]

This Course in Explainable AI will Get you Ready for the Future!

What do the data say when a machine learning algorithm is applied, and which features are important?

We experience artificial intelligence all the time on the internet in terms of friend suggestions on social media, internet ads that reflect what we have been searching for, and “smart” recommendations from online stores. But the reality is that even the people who build those formulas cannot usually explain why you were shown a certain […]

Two Takeaways from Danny Ma’s Machine Learning Panel: Understanding the Problem, and Understanding your Data

Roller coaster like an ETL pipeline that does automation

This lively panel discussed many topics around designing and implementing machine learning pipelines. Two main issues were identified. The first is that you really have to take some time to do exploratory research and define the problem. The second is that you need to also understand the business rules and context behind the data.

Data Scientists Interested in Encryption Should Take this Online Cryptography Course

Cartoon of person programming with code in the background

Even if you do not deal directly with cryptography, the need to maintain data privacy often leads data scientists to need to study cryptography. This basic online course is part of an ethical hacking certification and gives a basic overview of issues with data transfer and cryptography.

Verified by MonsterInsights