A/B testing seems straightforward, but there are a lot of picky details. What A and B conditions do you actually test? How long do you run the test? How do you calculate the statistics for the test? Answer your questions by taking this LinkedIn Learning course.
Author Archives: Monika Wahi
SAS is known for big data and data warehousing, but how do you actually design and build a SAS data warehouse or data lake? What datasets do you include? How do you transform them? How do you serve warehouse users? How do you manage your developers? This book has your answers!
Curation files are especially helpful for communicating about data on teams. Learn more about what you’ll learn when you take my online LinkedIn Learning data curation course!
I use the datasets from the Behavioral Risk Factor Surveillance Survey (BRFSS) to demonstrate in a lot of my data science tutorials. The BRFSS are free and available to the public – but they are kind of buried on the web site. This blog post serves as a “map” to help you find them!
I love the Likert package in R, and use it often to visualize data. The problem is that sometimes, I have sparse data, and this can cause problems with the package. This blog post shows you a workaround, and also, a way to format the final plot that I think looks really great!
With all this talk about “flattening the curve” of the coronavirus, I thought I would get into the weeds about what curve we are talking about when we say that. We are talking about what’s called an epidemiologic curve, or epicurve for short. And to demonstrate what an epicurve is and what it means, I […]
If you are learning epidemiology or refreshing your memory of your formal study of it, you always want to have credible scientific definitions at hand of terms such as “measures of association”. Read my blog post for quick links to Boston University’s online educational module!
Since the coronavirus (COVID-19) registered on our public health radar as a communicable infectious disease amongst humans, countries try to calculate their coronavirus mortality rate (otherwise known as case fatality rate). As a result, many different mortality rates have been reported, causing confusion. This article from Business Insider reports country-wide mortality rates that range from […]
Snowflake is a new big data cloud back-end solution, and the company has been holding “Data for Breakfast” meetings in many different cities to introduce data scientists to their product. I went to the meeting held on March 4, 2020 at the Marriott Long Wharf in Boston, and here is my review. Not My First […]
I encourage those of you into data science and AI to sign up to receive digests from Data Science Central. I happened upon this web site when looking for some references for a book I am writing, and found some amazing articles. Since I’m into data science in and AI in healthcare, I was intrigued […]