GROUP BY clause

Use the GROUP BY clause to group data by one or more specified tags or into specified time intervals. GROUP BY requires an aggregate or selector function in the SELECT statement.

Syntax
GROUP BY clause behaviors
GROUP BY tags
- GROUP BY tags examples
GROUP BY time
- GROUP by time and fill gaps
- GROUP BY time examples - GROUP BY time with offset - GROUP BY time and fill gaps
Result set
- Default time range
Notable behaviors of the GROUP BY clause

Syntax

SELECT_clause FROM_clause [WHERE_clause] GROUP BY group_expression[, ..., group_expression_n]

group_expression: Expression to identify tags or time intervals to group by. Can be a tag key, constant, regular expression, wildcard (*), or function expression.

GROUP BY clause behaviors

GROUP BY tag_key - Groups data by a specific tag
GROUP BY tag_key1, tag_key2 - Groups data by more than one tag
GROUP BY * - Groups data by all tags
GROUP BY /regex/ - Groups data by tag keys that match the regular expression
GROUP BY time() - Groups data into time intervals (windows)

If a query includes WHERE and GROUP BY, the GROUP BY clause must appear after the WHERE clause.

GROUP BY tags

Groups data by one or more tag columns.

GROUP BY tags examples

The following examples use the Bitcoin price sample data.

Group data by a single tag

time	mean_price
1970-01-01T00:00:00Z	27328.848667840004

time	mean_price
1970-01-01T00:00:00Z	23441.832453919982

time	mean_price
1970-01-01T00:00:00Z	28054.160950480004

Group data by more than one tag

time	mean_price
1970-01-01T00:00:00Z	27328.848667840004

time	mean_price
1970-01-01T00:00:00Z	23441.832453919982

time	mean_price
1970-01-01T00:00:00Z	28054.160950480004

Group data by all tags

time	mean_price
1970-01-01T00:00:00Z	27328.848667840004

time	mean_price
1970-01-01T00:00:00Z	23441.832453919982

time	mean_price
1970-01-01T00:00:00Z	28054.160950480004

Group data by tag keys that match a regular expression

time	mean_price
1970-01-01T00:00:00Z	27328.848667840004

time	mean_price
1970-01-01T00:00:00Z	23441.832453919982

time	mean_price
1970-01-01T00:00:00Z	28054.160950480004

GROUP BY time

GROUP BY time() groups data by into specified time intervals, also known as “windows”, and applies the aggregate and selector functions in the SELECT clause to each interval. Use the time() function to specify the time interval to group by.

SELECT_clause FROM_clause WHERE <time_range> GROUP BY time(time_interval[, offset])[, group_expression (...)] [fill(behavior)]

GROUP BY time() intervals use preset round-number time boundaries that are independent of time conditions in the WHERE clause. Output data uses window start boundaries as the aggregate timestamps. Use the offset argument of the time() function to shift time boundaries forward or backward in time.

GROUP by time and fill gaps

When grouping by time, if a window in the queried time range does not contain data, results return a row for the empty window containing the timestamp of the empty window and null values for each queried field. Use the fill() function at the end of the GROUP BY clause to replace null field values. If no FILL clause is included, the default behavior is fill(null).

fill() provides the following behaviors for filling values:

numeric literal: Replaces null values with the specified numeric literal.
linear: Uses linear interpolation between existing values to replace null values.
none: Removes rows with null field values.
null: Keeps null values and associated timestamps.
previous: Replaces null values with the most recent non-null value.

See the fill() documentation for detailed examples.

GROUP BY time examples

The following examples use the Bitcoin price sample data.

Group and aggregate query results into 1 hour windows

time	mean
2023-05-01T00:00:00Z	24494.27265
2023-05-01T01:00:00Z	24452.1698
2023-05-01T02:00:00Z	23902.666124999996
2023-05-01T03:00:00Z	23875.211349999998
2023-05-01T04:00:00Z	23855.6441
…	…

Group and aggregate query results into 1 week intervals by tag

time	mean
2023-04-27T00:00:00Z	27681.21808576779
2023-05-04T00:00:00Z	27829.413580354256
2023-05-11T00:00:00Z	26210.24799033149

time	mean
2023-04-27T00:00:00Z	23744.083925842704
2023-05-04T00:00:00Z	23871.201395652173
2023-05-11T00:00:00Z	22482.33174723755

time	mean
2023-04-27T00:00:00Z	28415.88231123595
2023-05-04T00:00:00Z	28568.010941384844
2023-05-11T00:00:00Z	26905.87242099449

GROUP BY time with offset

Group and aggregate query results into 1 hour intervals and offset time boundaries by +15 minutes

time	mean
2023-04-30T23:15:00Z
2023-05-01T00:15:00Z	29313.6754
2023-05-01T01:15:00Z	28932.0882
2023-05-01T02:15:00Z	28596.375225000003
2023-05-01T03:15:00Z	28578.915075
…	…

Group and aggregate query results into 1 hour intervals and offset time boundaries by -15 minutes

time	mean
2023-04-30T23:45:00Z	29319.9092
2023-05-01T00:45:00Z	29285.3651
2023-05-01T01:45:00Z	28607.202666666668
2023-05-01T02:45:00Z	28576.056175
2023-05-01T03:45:00Z	28566.96315
…	…

GROUP BY time and fill gaps

Group and aggregate query results into 30 minute intervals and fill gaps with 0

time	mean
2023-05-01T00:00:00Z	29319.9092
2023-05-01T00:30:00Z	29307.4416
2023-05-01T01:00:00Z	0
2023-05-01T01:30:00Z	29263.2886

Group and aggregate query results into 30 minute intervals and fill gaps using linear interpolation

time	mean
2023-05-01T00:00:00Z	29319.9092
2023-05-01T00:30:00Z	29307.4416
2023-05-01T01:00:00Z	29285.3651
2023-05-01T01:30:00Z	29263.2886

Group and aggregate query results into 30 minute intervals and fill gaps with previous values

time	mean
2023-05-01T00:00:00Z	29319.9092
2023-05-01T00:30:00Z	29307.4416
2023-05-01T01:00:00Z	29307.4416
2023-05-01T01:30:00Z	29263.2886

Result set

If at least one row satisfies the query, InfluxDB Cloud Dedicated returns row data in the query result set. If a query uses a GROUP BY clause, the result set includes the following:

Columns listed in the query’s SELECT clause
A time column that contains the timestamp for the record or the group
An iox::measurement column that contains the record’s measurement (table) name
Columns listed in the query’s GROUP BY clause; each row in the result set contains the values used for grouping

Default time range

If a query doesn’t specify a time range in the WHERE clause, InfluxDB uses the default time range for filtering and grouping by time. If a query includes the GROUP BY clause and doesn’t specify a time range in the WHERE clause, the default time group is the default time range, and the time column in the result set contains the start of the range–for example:

SELECT mean(temp) FROM home GROUP BY room

name: home
tags: room=Kitchen

time	mean
1970-01-01T00:00:00Z	22.623076923076926

name: home
tags: room=Living Room

time	mean
1970-01-01T00:00:00Z	22.16923076923077

Notable behaviors of the GROUP BY clause

Cannot group by fields

InfluxQL does not support grouping data by fields.

Tag order does not matter

The order that tags are listed in the GROUP BY clause does not affect how data is grouped.

Grouping by tag and no time range returns unexpected timestamps

The time column contains the start of the default time range.

Data grouped by time may return unexpected timestamps

Because GROUP BY time() intervals use preset round-number time boundaries that are independent of time conditions in the WHERE clause, results may include timestamps outside of the queried time range. Results represent only data with timestamps in the specified time range, but output timestamps are determined by by the preset time boundaries.

The following example groups data by 1-hour intervals, but the time range defined in the WHERE clause covers only part of a window:

SELECT MEAN(field)
FROM example 
WHERE
  time >= '2022-01-01T00:30:00Z'
  AND time <= '2022-01-01T01:30:00Z'
GROUP BY time(1h)

Note: The timestamp in the first row of query results data occurs before the start of the queried time range. See why.

Example data

time	field
2022-01-01T00:00:00Z	8
2022-01-01T00:15:00Z	4
2022-01-01T00:30:00Z	0
2022-01-01T00:45:00Z	8
2022-01-01T01:00:00Z	5
2022-01-01T01:15:00Z	0
2022-01-01T01:30:00Z	8
2022-01-01T01:45:00Z	8
2022-01-01T02:00:00Z	9
2022-01-01T02:15:00Z	6
2022-01-01T02:30:00Z	3
2022-01-01T02:45:00Z	0

Query results

time	field
2022-01-01T00:00:00Z	4
2022-01-01T01:00:00Z	5.25
2022-01-01T02:00:00Z	6

Why do these results include timestamps outside of the queried time range?

Fill with no data in the queried time range

Queries ignore fill() if no data exists in the queried time range. This is the expected behavior.

Fill with previous if no previous value exists

fill(previous) doesn’t fill null values if there is no previous value in the queried time range.

Fill with linear interpolation if there are not two values to interpolate between

fill(linear) doesn’t fill null values if there are no values before or after the null value in the queried time range.

Was this page helpful?

Thank you for your feedback!

Support and feedback

Thank you for being part of our community! We welcome and encourage your feedback and bug reports for InfluxDB and this documentation. To find support, use the following resources:

Customers with an annual or support contract can contact InfluxData Support.

Edit this page Submit docs issue Submit InfluxDB issue