Dataset source documentation is good to keep when you are doing an analysis with data from multiple datasets. Read my blog to learn how easy it is to throw together some quick dataset source documentation in PowerPoint so that you don’t forget what you did.
Tag Archives: career discussion
Joins in base R must be executed properly or you will lose data. Read my tutorial on how to correctly execute left joins in base R.
Table editing in R is easier than in SAS, because you can refer to columns, rows, and individual cells in the same way you do in MS Excel. Read my blog post for example R table editing code.
R for logistic regression in health data analytics is a reasonable choice, if you know what packages to use. You don’t have to use SAS! My blog post provides you example R code and a tutorial!
Management tips for data scientists can be used by anyone – at work and in your personal life! Get the details in my blog post.
ETL pipeline documentation is great for team communication as well as data stewardship! Read my blog post to learn my tips and tricks.
End-to-end AI pipelines are being created routinely in industry, and one complaint is that academics can only contribute to one component of the pipeline. Really? Read my blog post for an alternative viewpoint!
Researching data science companies who might be your future employers, but you don’t know where to start? Read my blog post to learn my simple approach.