Tag Archives: data visualization

Curated Datasets: Great for Data Science Portfolio Projects!

If you need data to do a project, read this blog post for information.

Curated datasets are useful to know about if you want to do a data science portfolio project on your own. I made this blog post for our group mentoring program. Check out the ones I am promoting on my blog!

Querying the GHDx Database: Demonstration and Review of Application

Many data scientists interested in health are looking to query the Global Burden of Disease database, also known as the GHDx

Querying the GHDx database is challenging because of its difficult user interface, but mastering it will allow you to access country-level health data for comparisons! See my demonstration!

Interview Preparation for Data Science Positions: Tips and Tricks

You can actually prepare for interviewing for data science positions by doing certain activities, like looking up common questions, and practicing answers.

Interview preparation for data science jobs can involve taking several simple, actionable steps to make yourself feel confident and ready to answer questions with ease. Read my blog post for my tips and tricks!

Dumbbell Plot for Comparison of Rated Items: Which is Rated More Highly – Harvard or the U of MN?

This is an example of a dumbbell plot from the ggalt package in R that you can also use in RStudio

Want to compare multiple rankings on two competing items – like hotels, restaurants, or colleges? I show you an example of using a dumbbell plot for comparison in R with the ggalt package for this exact use-case!

Data for Meta-analysis Need to be Prepared a Certain Way – Here’s How

This is the forrest plot resulting from analysis with open source statistical software R using package rmeta.

Getting data for meta-analysis together can be challenging, so I walk you through the simple steps I take, starting with the scientific literature, and ending with a gorgeous and evidence-based Forrest plot!

US Public Health Alphabet Soup Explained: What is the APHA?

The American Public Health Association is the professional society for the occupation of public health rather than healthcare.

Curious about the American Public Health Association (APHA) – what it does, and where it fits into the bigger picture of public health organizations? I delve into these topics, and explain how you can get involved.

Alternative to the PDSA Model for QA/QI in Healthcare? Old-fashioned Epidemiology and Biostatistics! Part 4 of 5

The Plan Do Study Act model does not take into account all functions of a healthcare quality improvement and assurance department

Want an alternative to the Plan-Do-Study-Act (PDSA) model for quality assurance/quality improvement (QA/QI) in healthcare? I recommend approaching QA/QI a different way, by thinking about the various functions of the QA/QI department.

“Bad Blood” Shows how Theranos was an Abject Failure in Data Stewardship: Part 3 of 5

You need governance in data science whether you are doing clinical research in a healthcare setting or in a laboratory.

The book “Bad Blood” describes the fall of startup unicorn Theranos, but also provides insight into the company’s abject failure at data stewardship, which I talk about in this blog post.

Two Takeaways from Danny Ma’s Machine Learning Panel: Understanding the Problem, and Understanding your Data

Roller coaster like an ETL pipeline that does automation

This lively panel discussed many topics around designing and implementing machine learning pipelines. Two main issues were identified. The first is that you really have to take some time to do exploratory research and define the problem. The second is that you need to also understand the business rules and context behind the data.

Announcing the Publication of my New SAS Book on Data Warehousing

Learn how to do data warehousing in SAS. You can purchase this book and use the code in it to help you.

SAS is known for big data and data warehousing, but how do you actually design and build a SAS data warehouse or data lake? What datasets do you include? How do you transform them? How do you serve warehouse users? How do you manage your developers? This book has your answers!

Verified by MonsterInsights