Tag Archives: career advice

Dataset Source Documentation: Necessary for Data Science Projects with Multiple Data Sources

If you work on a big data project with multiple source datasets, you run the risk of forgetting exactly how you blended them together.

Dataset source documentation is good to keep when you are doing an analysis with data from multiple datasets. Read my blog to learn how easy it is to throw together some quick dataset source documentation in PowerPoint so that you don’t forget what you did.

Joins in Base R: Alternative to SQL-like dplyr

In base R, you can execute SQL-like joins, as long as you use the correct code syntax.

Joins in base R must be executed properly or you will lose data. Read my tutorial on how to correctly execute left joins in base R.

R for Logistic Regression: Example from Epidemiology and Biostatistics

Logistic regression calculate the log odds of the probability of the outcome. Many people are used to using SAS for logistic regression, but you can also use R.

R for logistic regression in health data analytics is a reasonable choice, if you know what packages to use. You don’t have to use SAS! My blog post provides you example R code and a tutorial!

Portfolio Project Examples for Independent Data Science Projects

Are you a data scientist who is interested in doing independent portfolio projects to sharpen your skills? Then I strongly suggest you get a coach or a mentor.

Portfolio project examples are sometimes needed for newbies in data science who are looking to complete independent projects. This blog post provides some great examples of independent projects you can do with datasets available online!

Internship Strategy for Data Science: Download our Guide!

In data science, you can learn applied skills by being part of an internship at a noted organization.

Internship strategy for data science is not obvious, and even if you are in a college program, they often expect you to find your own internship. Download our internship strategy guide and get the experience you want!

Management Tips for Data Scientists

When working in data science, there are some tips and tricks to managing your communication and relationship with superiors that can help you advance in your career.

Management tips for data scientists can be used by anyone – at work and in your personal life! Get the details in my blog post.

ETL Pipeline Documentation: Here are my Tips and Tricks!

This blog post shows you how to properly document your extract, transform, and load code.

ETL pipeline documentation is great for team communication as well as data stewardship! Read my blog post to learn my tips and tricks.

End-to-End AI Pipelines: Can Academics Be Taught How to Do Them?

What is an end-to-end AI pipeline? And why are academics so bad at making one? These are different ideas we will examine in this blog post.

End-to-end AI pipelines are being created routinely in industry, and one complaint is that academics can only contribute to one component of the pipeline. Really? Read my blog post for an alternative viewpoint!

Coloring Plots in R using Hexadecimal Codes Makes Them Fabulous!

You do not need to use the default R colors on your plot. You don't even need to limit yourself to named colors on cheat sheets.

Recoloring plots in R? Want to learn how to use an image to inspire R color palettes you can use in ggplot2 plots? Read my blog post to learn how.

Rebrand as a Data Scientist: Online Mentoring Program

Want to apply your domain knowledge to the field of data science? Join our mentoring program!

Learn about our online data science group mentoring program, and find out if it is right for you!

Verified by MonsterInsights