Tag Archives: data pipeline

ETL Pipeline Documentation: Here are my Tips and Tricks!

This blog post shows you how to properly document your extract, transform, and load code.

ETL pipeline documentation is great for team communication as well as data stewardship! Read my blog post to learn my tips and tricks.

Referring to Columns in R by Name Rather than Number has Pros and Cons

There are different ways to refer to variables in R dataframes. You can use a field names, and you can also use field numbers.

Referring to columns in R can be done using both number and field name syntax. Although field name syntax is easier to use in programming, my blog demonstrates how you can use column numbers to make automation easier.

AI on the Edge: What it is, and Data Storage Challenges it Poses

AI on the edge refers to doing the AI processing and equations at the site of the object collecting the data.

“AI on the edge” was a new term for me that I learned from Marc Staimer, founder of Dragon Slayer Consulting, who was interviewed in a podcast. Marc explained how AI on the edge poses a data storage problem, and my blog post proposes a solution!

Verified by MonsterInsights