Category Archives: Data Science

Posts about data science topics.

NHANES Data: Pitfalls, Pranks, Possibilities, and Practical Advice

If you are interested in population-level surveillance data, you might have thought about using NHANES data in portfolio projects.

NHANES data piqued your interest? It’s not all sunshine and roses. Read my blog post to see the pitfalls of NHANES data, and get practical advice about using them in a project.

Color in Visualizations: Using it to its Full Communicative Advantage

When using big data, you will want to make visualizations. How do you use color to the greatest communicative advantage?

Color in visualizations of data curation and other data science documentation can be used to enhance communication – I show you how!

Defaults in PowerPoint: Setting Them Up for Data Visualizations

The defaults in PowerPoint are really set up for making presentations, not data visualizations.

Defaults in PowerPoint are set up for slides – not data visualizations. Read my blog post for tips on reconfiguring PowerPoint to make it easy for dataviz!

Text and Arrows in Dataviz Can Greatly Improve Understanding

Adding text and arrows to diagrams can help your audience navigate the image, and understand what you are trying to communicate.

Text and arrows in dataviz, if used wisely, can help your audience understand something very abstract, like a data pipeline. Read my blog post for tips in choosing images for your data visualizations!

Shapes and Images in Dataviz: Making Choices for Optimal Communication

If you use good judgment in choosing chapes and images to add to your data visualizations, your audience will be enlightened.

Shapes and images in dataviz, if chosen wisely, can greatly enhance the communicative value of the visualization. Read my blog post for tips in selecting shapes for data visualizations!

Table Editing in R is Easy! Here Are a Few Tricks…

When you use a data analysis program like R or SAS, you often have to do some data editing. It can be difficult because the software was intended for calculations, not transformation.

Table editing in R is easier than in SAS, because you can refer to columns, rows, and individual cells in the same way you do in MS Excel. Read my blog post for example R table editing code.

R for Logistic Regression: Example from Epidemiology and Biostatistics

Logistic regression calculate the log odds of the probability of the outcome. Many people are used to using SAS for logistic regression, but you can also use R.

R for logistic regression in health data analytics is a reasonable choice, if you know what packages to use. You don’t have to use SAS! My blog post provides you example R code and a tutorial!

Connecting SAS to Other Applications: Different Strategies

Did you know it is possible to integrate SAS with other data environments, like Microsoft SQL or Excel?

Connecting SAS to other applications is often necessary, and there are many ways to do it. Read this blog post for a couple of use-cases of SAS data integration using various SAS components.

Portfolio Project Examples for Independent Data Science Projects

Are you a data scientist who is interested in doing independent portfolio projects to sharpen your skills? Then I strongly suggest you get a coach or a mentor.

Portfolio project examples are sometimes needed for newbies in data science who are looking to complete independent projects. This blog post provides some great examples of independent projects you can do with datasets available online!

Project Management Terminology for Public Health Data Scientists

If you are a health data analyst or a biostatistician, we might find computer programmers and application developers use different terminology for the same ideas and concepts.

Project management terminology is often used around epidemiologists, biostatisticians, and health data scientists, and it’s often hard for us to admit we aren’t familiar with some of the terms. Watch my videos and take my Applications Basics course to get up to speed with vocabulary from the health application development domain.

Verified by MonsterInsights