Documentation

statsmodels.linearRegression() function

statsmodels.linearRegression() is a user-contributed function maintained by the package author.

statsmodels.linearRegression() performs a linear regression.

It calculates and returns ŷ (y_hat), and residual sum of errors (rse). Output data includes the following columns:

  • N: Number of points in the calculation.
  • slope: Slope of the calculated regression.
  • sx: Sum of x.
  • sxx: Sum of x squared.
  • sxy: Sum of x*y.
  • sy: Sum of y.
  • errors: Residual sum of squares. Defined by (r.y - r.y_hat) ^ 2 in this context
  • x: An index [1,2,3,4…n], with the assumption that the timestamps are regularly spaced.
  • y: Field value
  • y_hat: Linear regression values
Function type signature
(
    <-tables: stream[A],
) => stream[{
    B with
    y_hat: float,
    y: float,
    x: float,
    sy: H,
    sxy: G,
    sxx: F,
    sx: E,
    slope: D,
    errors: float,
    N: C,
}] where A: Record, D: Divisible + Subtractable

For more information, see Function type signatures.

Parameters

tables

Input data. Default is piped-forward data (<-).

Examples

Perform a linear regression on a dataset

import "contrib/anaisdg/statsmodels"
import "sampledata"

sampledata.float()
    |> statsmodels.linearRegression()

Was this page helpful?

Thank you for your feedback!


The future of Flux

Flux is going into maintenance mode. You can continue using it as you currently are without any changes to your code.

Flux is going into maintenance mode and will not be supported in InfluxDB 3.0. This was a decision based on the broad demand for SQL and the continued growth and adoption of InfluxQL. We are continuing to support Flux for users in 1.x and 2.x so you can continue using it with no changes to your code. If you are interested in transitioning to InfluxDB 3.0 and want to future-proof your code, we suggest using InfluxQL.

For information about the future of Flux, see the following: