Tag Archives: academic publication

Classification Crosswalks: Strategies in Data Transformation

What if you have too many categories in a categorical variable? Your cardinality is too high for a chi-square analysis.

Classification crosswalks are easy to make, and can help you reduce cardinality in categorical variables, making for insightful data science portfolio projects with only descriptive statistics. Read my blog post for guidance!

NHANES Data: Pitfalls, Pranks, Possibilities, and Practical Advice

If you are interested in population-level surveillance data, you might have thought about using NHANES data in portfolio projects.

NHANES data piqued your interest? It’s not all sunshine and roses. Read my blog post to see the pitfalls of NHANES data, and get practical advice about using them in a project.

Color in Visualizations: Using it to its Full Communicative Advantage

When using big data, you will want to make visualizations. How do you use color to the greatest communicative advantage?

Color in visualizations of data curation and other data science documentation can be used to enhance communication – I show you how!

Shapes and Images in Dataviz: Making Choices for Optimal Communication

If you use good judgment in choosing chapes and images to add to your data visualizations, your audience will be enlightened.

Shapes and images in dataviz, if chosen wisely, can greatly enhance the communicative value of the visualization. Read my blog post for tips in selecting shapes for data visualizations!

Rapid Application Development Public Health Style

If you work on front-ends or back-ends of health applications, you are probably already familiar with the concepts of Agile and rapid application development.

“Rapid application development” (RAD) refers to an approach to designing and developing computer applications. In public health and healthcare, we are not taught about application development – but it’s good for us to learn about it, since we have to deal with data from health applications. My blog post talks about the RAD approach I […]

CitePeeps: Want to Increase Citations to Your Research? Join our Online Community!

CitePeeps is an online community of scientific authors who are interested in increasing the number of citations to their written works.

CitePeeps is a new online community of scientific authors focused on increasing the number of citations to their published works. Join us!

Benchmarking Runtime is Different in SAS Compared to Other Programs

How do you measure how long it takes for code to run in different programs? And why would you want to measure something like that? Mainly, the reason to benchmark runtime is so that you can figure out how to optimize your code.

Benchmarking runtime is different in SAS compared to other programs, where you have to request the system time before and after the code you want to time and use variables to do subtraction, as I demonstrate in this blog post.

End-to-End AI Pipelines: Can Academics Be Taught How to Do Them?

What is an end-to-end AI pipeline? And why are academics so bad at making one? These are different ideas we will examine in this blog post.

End-to-end AI pipelines are being created routinely in industry, and one complaint is that academics can only contribute to one component of the pipeline. Really? Read my blog post for an alternative viewpoint!

Coloring Plots in R using Hexadecimal Codes Makes Them Fabulous!

You do not need to use the default R colors on your plot. You don't even need to limit yourself to named colors on cheat sheets.

Recoloring plots in R? Want to learn how to use an image to inspire R color palettes you can use in ggplot2 plots? Read my blog post to learn how.

US Public Health Alphabet Soup Explained: What is the ONC?

Before the ONC office was established in 2009, there was no federal oversight of medical record systems.

“What is the ONC?” is what I used to ask before I realized it involves health technology. Although ONC just means “Office of the National Coordinator”, this agency is now known as HealthIT.gov, as I explain in my blog post.

Verified by MonsterInsights