Read my last post in a series on data-related misconduct at startup Theranos outlined in the book, “Bad Blood”, where I discuss their lack of administrative barrier between research and clinical data.
Tag Archives: big data analytics
Want an alternative to the Plan-Do-Study-Act (PDSA) model for quality assurance/quality improvement (QA/QI) in healthcare? I recommend approaching QA/QI a different way, by thinking about the various functions of the QA/QI department.
If you are not sure if you will like doing research in healthcare, instead of starting with big data, start with data collection and get to know the data as it comes into the dataset.
Monika posts her “data science newbie do-it-yourself starter kit”, with links to cheap or free learning resources for the data science newbie who wants to get started in healthcare analytics.
We experience artificial intelligence all the time on the internet in terms of friend suggestions on social media, internet ads that reflect what we have been searching for, and “smart” recommendations from online stores. But the reality is that even the people who build those formulas cannot usually explain why you were shown a certain […]
Get to know three of my favorite SAS documentation pages: the one with sort order, the one that lists all the SAS formats, and the one that explains all the SAS operators and expressions!
SAS is known for big data and data warehousing, but how do you actually design and build a SAS data warehouse or data lake? What datasets do you include? How do you transform them? How do you serve warehouse users? How do you manage your developers? This book has your answers!
I use the datasets from the Behavioral Risk Factor Surveillance Survey (BRFSS) to demonstrate in a lot of my data science tutorials. The BRFSS are free and available to the public – but they are kind of buried on the web site. This blog post serves as a “map” to help you find them!
With all this talk about “flattening the curve” of the coronavirus, I thought I would get into the weeds about what curve we are talking about when we say that. We are talking about what’s called an epidemiologic curve, or epicurve for short. And to demonstrate what an epicurve is and what it means, I […]
Snowflake is a new big data cloud back-end solution, and the company has been holding “Data for Breakfast” meetings in many different cities to introduce data scientists to their product. I went to the meeting held on March 4, 2020 at the Marriott Long Wharf in Boston, and here is my review. Not My First […]