Making upset plots with R package UpSetR is an easy way to visualize patterns of attributes in your data. My blog post demonstrates making patterns of co-morbidities in health survey respondents from the BRFSS, and walks you through setting text and color options in the code.
Category Archives: Data Science
Posts about data science topics.
Making box plots in R affords you many different approaches and features. My blog post will show you easy ways to use both base R and ggplot2 to make box plots as you are proceeding with your data science projects.
Convert CSV to RDS is what you want to do if you are working with big data files in R GUI and want to improve efficiency. Read my blog post for an explanation and video demonstrations of this process!
GPower case example shows a use-case where we needed to select an outcome measure for our study, then do a power calculation for sample size required under different outcome effect size scenarios. My blog post shows what I did, and how I documented/curated the results.
Querying the GHDx database is challenging because of its difficult user interface, but mastering it will allow you to access country-level health data for comparisons! See my demonstration!
Variable names in SAS and R are subject to different “rules and regulations”, and these can be leveraged to your advantage, as I describe in this blog post.
Referring to variables in processing is different conceptually when thinking about SAS compared to R. I explain the differences in my blog post.
Counting rows in SAS and R is approached differently, because the two programs process data in different ways. Read my blog post where I describe both ways.
Native formats in SAS and R of data objects have different qualities – and there are reasons behind these differences. Learn about them in this blog post!
Looking for a SAS-R integration example that uses the best of both worlds? I show you a use-case where I was in a hurry, and did transformation in R with the analysis in SAS!