Blog
Data science, statistics, and Appalachian culture and history
Quick Hits: Exclude Current Week in SQL
Welcome to the first of my “Quick Hits” entries on the blog! I am still trying to find the proper shape for my blog formats, and I hope this one will persist going forward. Quick Hits entries will be more focused on a problem I found interesting and a solution to that problem. Other content will usually be written to explore a machine learning or...
How to find data for data science or GIS projects
Intro When starting a personal data science project, often the most difficult part is figuring out where to obtain the data you will use. Whether you are just looking for some test data to use when learning a new technique or are looking to investigate the answer to a question you have, you need to have a good quality data source. A lot more free, public...
The Birthday Problem with Real-world data (Birthday Problem Pt. 2)
As promised in the previous post, I have found some real-world datasets to test out our Birthday Problem predictions on! The more advanced visualizations will have to wait until another weekend when I have some time. For this post, I will be looking at four datasets and checking how compare to our statistics. Do the rather remarkable predictions of the...
Exploring the Birthday Problem/Paradox (Pt. 1)
My first sequence of posts was all about COVID-19 in the early days of the outbreak in the US. Now, I want to turn towards something a little more fun--the Birthday Problem! This classic stats problem is also known as the "Birthday Paradox" because its conclusions run counter to so many people's intuition. This post is the first example of another type of...
Predicting COVID-19 Spread Pt. 3: Model and Predictions
If you stuck with me through the first two posts, thanks! If you are just joining me in this one, be sure to check out parts 1 and 2 for more background about the project, the data, and what modification has been done on it. In this section, I will be covering the creation of the prediction model and the results of my initial model. I hope it is at least...
Predicting COVID-19 Spread Pt. 2 Data preparation
This is part 2 in my series about a small analytics project I am throwing together. The goal of the project is to create a one-day prediction model for new cases of the novel coronavirus. Check out part 1 for an overview of the data collection process. In this part of the series, I will be covering the preparation of the data for inputting into my model....
Predicting COVID-19 Spread Pt. 1: Data Collection
Hello and welcome to my blog! I hope I am able to provide you with something of interest or of use to you. I plan to use this as a place to write about my interests and my studies. I will use it to give updates and information about my projects and to write about tools and topics I find useful, interesting, or helpful. Generally, I will be writing about...