Optimize Flux queries

This page documents an earlier version of InfluxDB OSS. InfluxDB 3 Core is the latest stable version.

Optimize your Flux queries to reduce their memory and compute (CPU) requirements.

Start queries with pushdowns
Avoid processing filters inline
Avoid short window durations
Use “heavy” functions sparingly
Use set() instead of map() when possible
Balance time range and data precision
Measure query performance with Flux profilers

Start queries with pushdowns

Pushdowns are functions or function combinations that push data operations to the underlying data source rather than operating on data in memory. Start queries with pushdowns to improve query performance. Once a non-pushdown function runs, Flux pulls data into memory and runs all subsequent operations there.

Pushdown functions and function combinations

The following pushdowns are supported in InfluxDB OSS 1.11+.

Functions	Supported
count()
drop()
duplicate()
filter() *
fill()
first()
group()
keep()
last()
max()
mean()
min()
range()
rename()
sum()
window()
Function combinations
group() \|> max()
group() \|> min()
window() \|> count()
window() \|> first()
window() \|> last()
window() \|> max()
window() \|> min()
window() \|> sum()

* filter() only pushes down when all parameter values are static. See Avoid processing filters inline.

Use pushdown functions and function combinations at the beginning of your query. Once a non-pushdown function runs, Flux pulls data into memory and runs all subsequent operations there.

Pushdown functions in use

from(bucket: "db/rp")
    |> range(start: -1h)                       //
    |> filter(fn: (r) => r.sensor == "abc123") //
    |> group(columns: ["_field", "host"])      // Pushed to the data source
    |> aggregateWindow(every: 5m, fn: max)     //
    |> filter(fn: (r) => r._value >= 90.0)     //

    |> top(n: 10)                              // Run in memory

Avoid processing filters inline

Avoid using mathematic operations or string manipulation inline to define data filters. Processing filter values inline prevents filter() from pushing its operation down to the underlying data source, so data returned by the previous function loads into memory. This often results in a significant performance hit.

For example, the following query uses Chronograf dashboard template variables and string concatenation to define a region to filter by. Because filter() uses string concatenation inline, it can’t push its operation to the underlying data source and loads all data returned from range() into memory.

from(bucket: "db/rp")
    |> range(start: -1h)                      
    |> filter(fn: (r) => r.region == v.provider + v.region)

To dynamically set filters and maintain the pushdown ability of the filter() function, use variables to define filter values outside of filter():

region = v.provider + v.region

from(bucket: "db/rp")
    |> range(start: -1h)                      
    |> filter(fn: (r) => r.region == region)

Avoid short window durations

Windowing (grouping data based on time intervals) is commonly used to aggregate and downsample data. Increase performance by avoiding short window durations. More windows require more compute power to evaluate which window each row should be assigned to. Reasonable window durations depend on the total time range queried.

Use “heavy” functions sparingly

The following functions use more memory or CPU than others. Consider their necessity in your data processing before using them:

Use set() instead of map() when possible

set(), experimental.set(), and map can each set column values in data, however set functions have performance advantages over map().

Use the following guidelines to determine which to use:

If setting a column value to a predefined, static value, use set() or experimental.set().
If dynamically setting a column value using existing row data, use map().

Set a column value to a static value

The following queries are functionally the same, but using set() is more performant than using map().

data
    |> map(fn: (r) => ({ r with foo: "bar" }))

// Recommended
data
    |> set(key: "foo", value: "bar")

Dynamically set a column value using existing row data

data
    |> map(fn: (r) => ({ r with foo: r.bar }))

Balance time range and data precision

To ensure queries are performant, balance the time range and the precision of your data. For example, if you query data stored every second and request six months worth of data, results would include ≈15.5 million points per series. Depending on the number of series returned after filter()(cardinality), this can quickly become many billions of points. Flux must store these points in memory to generate a response. Use pushdowns to optimize how many points are stored in memory.

Measure query performance with Flux profilers

Use the Flux Profiler package to measure query performance and append performance metrics to your query output. The following Flux profilers are available:

query: provides statistics about the execution of an entire Flux script.
operator: provides statistics about each operation in a query.

Import the profiler package and enable profilers with the profile.enabledProfilers option.

import "profiler"

option profiler.enabledProfilers = ["query", "operator"]

// Query to profile

For more information about Flux profilers, see the Flux Profiler package.

Was this page helpful?

Thank you for your feedback!

Support and feedback

Thank you for being part of our community! We welcome and encourage your feedback and bug reports for InfluxDB and this documentation. To find support, use the following resources:

Customers with an annual or support contract can contact InfluxData Support.

Edit this page Submit docs issue Submit InfluxDB issue

Optimize Flux queries

Start queries with pushdowns

Pushdown functions and function combinations

Pushdown functions in use

Avoid processing filters inline

Avoid short window durations

Use “heavy” functions sparingly

Use set() instead of map() when possible

Set a column value to a static value

Dynamically set a column value using existing row data

Balance time range and data precision

Measure query performance with Flux profilers

Support and feedback

New in InfluxDB 3.8

InfluxDB Docker latest tag changing to InfluxDB 3 Core

Optimize Flux queries

Start queries with pushdowns

Pushdown functions and function combinations

Pushdown functions in use

Avoid processing filters inline

Avoid short window durations

Use “heavy” functions sparingly

Use set() instead of map() when possible

Set a column value to a static value

Dynamically set a column value using existing row data

Balance time range and data precision

Measure query performance with Flux profilers

Support and feedback

What is your InfluxDB OSS URL?

Default

Custom

Thank you for your feedback!

New in InfluxDB 3.8

InfluxDB Docker latest tag changing to InfluxDB 3 Core