Tag Archives: data science

Color in Visualizations: Using it to its Full Communicative Advantage

When using big data, you will want to make visualizations. How do you use color to the greatest communicative advantage?

Color in visualizations of data curation and other data science documentation can be used to enhance communication – I show you how!

SAS Macros for Beginners: Learn the Basics with my Tutorial Videos!

If you are new to SAS, you will want to learn about macros. I make video tutorials and have written a book about data warehousing to help learners grasp automation in SAS.

Want to get started learning about SAS macros? This blog post provides SAS macros for beginners with video tutorials to walk beginners and code newbies through the basic steps!

US Public Health Alphabet Soup Explained: What is the BPHC?

The Bureau of Primary Healthcare is a United States federal agency that ensures safety net services to poor individuals

The United States (US) Bureau of Primary Healthcare (BPHC) is the federal agency that funds our safety net infrastructure serving patients who can’t get on Medicare or Medicaid. I explain how all that works, and the relationship of BPHC to the rest of the public health infrastructure.

US Public Health Alphabet Soup Explained: What is the APHA?

The American Public Health Association is the professional society for the occupation of public health rather than healthcare.

Curious about the American Public Health Association (APHA) – what it does, and where it fits into the bigger picture of public health organizations? I delve into these topics, and explain how you can get involved.

“Bad Blood” Highlights the Issues with No Administrative Barrier between Research and Clinical Data: Part 5 of 5

Clinical data and research data are governed by different regulations. Therefore, you cannot mix them together, but you can transfer them around from project to project.

Read my last post in a series on data-related misconduct at startup Theranos outlined in the book, “Bad Blood”, where I discuss their lack of administrative barrier between research and clinical data.

Alternative to the PDSA Model for QA/QI in Healthcare? Old-fashioned Epidemiology and Biostatistics! Part 4 of 5

The Plan Do Study Act model does not take into account all functions of a healthcare quality improvement and assurance department

Want an alternative to the Plan-Do-Study-Act (PDSA) model for quality assurance/quality improvement (QA/QI) in healthcare? I recommend approaching QA/QI a different way, by thinking about the various functions of the QA/QI department.

“Bad Blood” Shows how Theranos was an Abject Failure in Data Stewardship: Part 3 of 5

You need governance in data science whether you are doing clinical research in a healthcare setting or in a laboratory.

The book “Bad Blood” describes the fall of startup unicorn Theranos, but also provides insight into the company’s abject failure at data stewardship, which I talk about in this blog post.

“Bad Blood” Demonstrates how a Lack of Product Description Leads to Data Science Misconduct: Part 2 of 5

You need to write a product description for your computer and business applications. Then, when scientists and marketers do research, they know what endpoints to study.

This blog post talks about how lack of product description led to data-related misconduct at Theranos, because they could never nail down exactly what they were trying to do.

“Bad Blood” Reveals Theranos was Guilty of Bad Business and Bad Data Science: Part 1 of 5

Businesses that are chaotic and poorly run do not steward their data properly, and it is inaccurate.

This is my first blog post in a series of five where I talk about data-related misconduct outlined in the book “Bad Blood”, and provide guidance on how to prevent it.

Two Takeaways from Danny Ma’s Machine Learning Panel: Understanding the Problem, and Understanding your Data

Roller coaster like an ETL pipeline that does automation

This lively panel discussed many topics around designing and implementing machine learning pipelines. Two main issues were identified. The first is that you really have to take some time to do exploratory research and define the problem. The second is that you need to also understand the business rules and context behind the data.

Verified by MonsterInsights