Quick Hits: Exclude Current Week in SQL

Problem

When creating weekly metrics charts, including data from the current week can cause graphs to dive sharply in the current week since data collection for the current week will be incomplete. Though the incomplete data is not problematic for an analyst, end users can be concerned if their number appear to be declining sharply.

The same issue can present itself when looking at daily, monthly, yearly, etc. data if you include the current timeframe. The problem then is finding a way to exclude the current timeframe’s data.

Solution

Luckily, the solution is quite easy! I discovered a method recently when faced with this exact problem at work. My solution relies on the DATE_TRUNC function which exists in Redshift, Postgresql, and several other varieties of SQL. If your variety does not have the same function, you should still be able to find some kind of equivalent (just use whatever method you were using to collate your data into weeks anyway).

When using DATE_TRUNC on a date to truncate to the week, it returns a date object specifying midnight of the first day of the week. Therefore, you can call DATE_TRUNC on CURRENT_DATE to obtain the timestamp of the beginning of your current week.

At that point, filtering is easy. You can filter out anything where the timestamp is greater than the truncated date. Alternatively, you can run the DATE_TRUNC function on both your timestamp and on the current date. Then you can compare the resulting dates and filter anywhere where you a match.

Example

As an example, let’s consider a table recording customer visits (visits). This table has two columns, visit_time and customer_id. If you want a weekly count of visits to use in a line graph or other visualization, you could get the result using this:

select date_trunc(‘week’, visit_time) as week, count(*) as visit_count from visits group by 1

However, that query produces the exact downturning graph we are wanting to avoid! To remove that current week’s data, just modify the query slightly to include this:

select date_trunc(‘week’, visit_time) as week, count(*) as visit_count from visits where visit_time < date_trunc(‘week’, current_date) group by 1

With this small filtering statement, you can cut out all the current week’s data and display only completed weeks in your weekly graph.

Conclusion

Filtering out the current timeframes’s data is ultimately pretty easy once you know one or two ways to do it! By filtering out this data may not give you the most up-to-date information, but it will help ease the fears of product managers and other end users who see a sudden dip in the graphs. Try it out in your workflows, and see if it works well for your work!

How to find data for data science or GIS projects

by Chase Thacker | Feb 12, 2021 | Appalachia, Data Science, Tools

Intro When starting a personal data science project, often the most difficult part is figuring out where to obtain the data you will use. Whether you are just looking for some test data to use when learning a new technique or are looking to investigate the answer to a...

The Birthday Problem with Real-world data (Birthday Problem Pt. 2)

by Chase Thacker | May 3, 2020 | Data Science, Statistics

As promised in the previous post, I have found some real-world datasets to test out our Birthday Problem predictions on! The more advanced visualizations will have to wait until another weekend when I have some time. For this post, I will be looking at four datasets...

Exploring the Birthday Problem/Paradox (Pt. 1)

by Chase Thacker | Apr 30, 2020 | Data Science, Statistics

My first sequence of posts was all about COVID-19 in the early days of the outbreak in the US. Now, I want to turn towards something a little more fun--the Birthday Problem! This classic stats problem is also known as the "Birthday Paradox" because its conclusions run...

Written by Chase Thacker

Data Science

June 11, 2021

Problem

Solution

Example

Conclusion

Recent Posts

Archives

Categories

You may also like…

How to find data for data science or GIS projects

The Birthday Problem with Real-world data (Birthday Problem Pt. 2)

Exploring the Birthday Problem/Paradox (Pt. 1)

Thanks for reading!

Success!