Tag Archives: data warehousing in SAS

Connecting SAS to Other Applications: Different Strategies

Did you know it is possible to integrate SAS with other data environments, like Microsoft SQL or Excel?

Connecting SAS to other applications is often necessary, and there are many ways to do it. Read this blog post for a couple of use-cases of SAS data integration using various SAS components.

Project Management Terminology for Public Health Data Scientists

If you are a health data analyst or a biostatistician, we might find computer programmers and application developers use different terminology for the same ideas and concepts.

Project management terminology is often used around epidemiologists, biostatisticians, and health data scientists, and it’s often hard for us to admit we aren’t familiar with some of the terms. Watch my videos and take my Applications Basics course to get up to speed with vocabulary from the health application development domain.

Understanding Legacy Data in a Relational World

Data systems started being in use in the 1960s and 1970s, but these were flat systems, usually using IBM mainframes.

Understanding legacy data is necessary if you want to analyze datasets that are extracted from old systems. This knowledge is still relevant, as we still use these old systems today, as I discuss in my blog post.

Reducing Query Cost (and Making Better Use of Your Time)

Slow queries can happen in SAS, R, Python, SQL or any database language. These slow queries have a cost.

Reducing query cost is especially important in SAS – but do you know how to do it, or what it even means? Read my blog post to learn why this is important in health data analytics.

Adding Error Bars to ggplot2 Plots Can be Made Easy Through Dataframe Structure

Error bars on plots can provide the audience an estimate of the amount of certainty you have with your estimates.

Adding error bars to ggplot2 in R plots is easiest if you include the width of the error bar as a variable in your plot data. Read my blog post to see an example.

Querying the GHDx Database: Demonstration and Review of Application

Many data scientists interested in health are looking to query the Global Burden of Disease database, also known as the GHDx

Querying the GHDx database is challenging because of its difficult user interface, but mastering it will allow you to access country-level health data for comparisons! See my demonstration!

Variable Names in SAS and R Have Different Restrictions and Rules

You need to come up with names of variables in SAS and in R, but they need to be compatible with both languages if you are running a data warehouse.

Variable names in SAS and R are subject to different “rules and regulations”, and these can be leveraged to your advantage, as I describe in this blog post.

Referring to Variables in Processing Data is Different in SAS Compared to R

When doing data processing, especially extract-transform-load (ETL) into a data warehouse, you might need to refer to the variables in your code, and it's done differently in SAS vs. R.

Referring to variables in processing is different conceptually when thinking about SAS compared to R. I explain the differences in my blog post.

Interview Preparation for Data Science Positions: Tips and Tricks

You can actually prepare for interviewing for data science positions by doing certain activities, like looking up common questions, and practicing answers.

Interview preparation for data science jobs can involve taking several simple, actionable steps to make yourself feel confident and ready to answer questions with ease. Read my blog post for my tips and tricks!

Verified by MonsterInsights