DATE and TIME values in PostgreSQL have a whole special set of functions and operators for their proper use. So many queries deal with DATE and TIME information that it’s important to get to know the date tools. Below we’ll cover and practice the main functions you’ll likely need. If you want to get detailed you can checkout the full list of PostgreSQL functions here.
There are 4 main ways to store date values in a PostgreSQL database:
|TIMESTAMP||date and time||
|DATE||date (no time)||
|TIME||time (no day)||
|INTERVAL||interval between two date/times||
||1 day, 2:00:10|
We’ll go over more about each of these.
Date String Formatting
Dates in a database aren’t stored as strings, but we input and fetch data from it as if it were a string with the following format for the information:
where the letters stand for Year, Month, Day, Hour, Minutes and Seconds. Let’s say for example that we want to record that we got a new user on January 23, 2020 at exactly 18:14. To represent that exact date and time we would use the format:
TODO: this format is also supported: January 8 04:05:06 1999 PST
To get some familiarity try creating and SELECTing a few TIMESTAMPS below. I was born on May 1st, 1983 at exactly 4:00am. Can you fetch that timestamp?
We’re just going to jump in here. We need to use a different table as none of the previous ones we’ve been using have had date fields in them. Another table available to us in chinook is employees. Let’s get familiar with what columns are in this table by looking at the first few rows. Note that there are several columns so you may have to scroll right to see all of the data:
Each employee has two TIMESTAMP columns, one for their birth_date and one for their hire_date. You can use all of the ORDERing, GROUPing and other functions we learned for other columns on DATE columns as well. Try getting a list of the 4 youngest employees in the company.
Formatting Dates to Strings
Often you don’t want to show the full raw TIMESTAMP, but rather a nicely formatted, potentially truncated version. For example, let’s say we want to get a list of the employees names and the year that they were hired. To do so we’ll need to parse the hired_date to just pull out the year. We can do so with the TO_CHAR function which works as follows
TO_CHAR([date type], [pattern])
where [date type] is a column or value of any of the above listed date/time data types, and [pattern] is a string indicating how to format the output date. The main symbols you’ll want to use to create your format patterns are here
|am||displays whether time is am or pm||
|YY||last 2 digits of the Year||
|YYYY||4 digits of the Year||
|MM||Month # of the year.||
|Month||written Month of the year capitalized||
|Mon||abbreviated of Month of year||
|DD||Day # of the month||
|Day||written Day of the week||
|Dy||abbreviated Day of the week||
|WW||Week # of the year||
|Q||Quarter of the year||
The above patterns can be string together to get the format you eventually want. Some common outputs are:
You don’t have to memorize these (it’s hard to!). It’s just good to get familiar with how it works and then reference back to it when you need it in the future.
There are a couple of extra tools you can use on patterns that output numbers.
|FM||Fill Mode will remove any 0’s
at the front of a 2 digit number.
|th||adds the ordinal suffixes
like st, nd or th to the end of a number
And of course you can combine the two to get
For string outputs, most of the patterns above support different casing output based on the case you use for the pattern. Some examples using different casings of “Day”:
And you can see the following common date format in UPPERCASE, Capitalized and lowercase formats:
Note that the case for numeric values doesn’t change. Still use DD for the day # of the month and YYYY for year.
We’re going to move on in the tutorial but if you’d like more details checkout the full list of PostgreSQL date formatting functions .
Current DATE and TIME Functions
PostgreSQL supports a number of special values, or functions to help bet the current DATE, TIMESTAMP or TIME. The most used ones are
CURRENT_DATE CURRENT_TIME CURRENT_TIMESTAMP
and they are used by just putting them in the query
GROUPing BY DATE
In analytic queries, it’s very common to group things by dates. For example you may want to see new users by year, month, week or day. To do so, you’ll want to use the TO_CHAR function to convert the dates into a truncated string before you GROUP BY it. You don’t want to simply GROUP BY the raw date as those are accurate down to the millisecond so grouping by the unaltered date would be like making GROUPs for each millisecond.
The following examples are using the hire_date field from the employees table and show a lot of common formats you can use for these groups. These are what we use at Chartio for our date group formatting standards.
|Group Period||Example SQL||Example Output|
|Hour of Day||
|Day of Week||
|Day of Month||
|Day of Year||
|Month of Year||
Feel free to try out any of the above formats on the query below:
There are only 8 employees in our database so we’re not dealing with too many groups there. You can get a little more granular with the invoices table and it’s invoice_date column with 250 rows.
The above query returns the number of invoices created per year. Can you modify it to get a SUM of the total amount invoiced by month?