Time series

`polars_ta.wq.time_series`

Functions:

Name	Description
`ts_arg_max`	最大值相对位置
`ts_arg_min`	最小值相对位置
`ts_co_kurtosis`	计算两个序列在滚动窗口内联合分布的协峰度
`ts_co_skewness`	计算两个序列在滚动窗口内联合分布的协偏度
`ts_corr`	时序滚动相关系数
`ts_count`	时序滚动计数
`ts_count_eq`	D天内最近连续出现N次
`ts_count_ge`	D天内最近连续出现至少N次
`ts_count_nans`	时序滚动统计nan出现次数
`ts_count_nulls`	时序滚动统计null出现次数
`ts_covariance`	时序滚动协方差
`ts_cum_count`	时序累计计数
`ts_cum_max`	时序累计最大值
`ts_cum_min`	时序累计最小值
`ts_cum_prod`	时序累乘
`ts_cum_prod_by`	带设置的累乘
`ts_cum_sum`	时序累加
`ts_cum_sum_by`	带设置的累加
`ts_cum_sum_reset`	时序累加。遇到0、nan、相反符号时重置
`ts_decay_exp_window`	指数衰减移动平均
`ts_decay_linear`	线性衰减移动平均
`ts_delay`	时序数据移动
`ts_delta`	时序差分
`ts_fill_null`	用上一个非空值填充空值
`ts_ir`	时序滚动信息系数
`ts_kurtosis`	时序滚动峰度
`ts_l2_norm`	欧几里得范数
`ts_log_diff`	求对数，然后时序滚动差分
`ts_max`	时序滚动最大值
`ts_max_diff`	窗口内最大值与当前值的差异‌
`ts_mean`	简单移动平均
`ts_median`	时序滚动中位数
`ts_min`	时序滚动最小值
`ts_min_diff`	窗口内最小值与当前值的差异‌
`ts_min_max_cps`	计算时间窗口内最小值与最大值的总和减去当前值的加权结果
`ts_min_max_diff`	计算当前值 x 与基于时间窗口内最小值、最大值的加权组合的差值
`ts_moment`	滚动k阶中心距
`ts_partial_corr`	滚动偏相关
`ts_percentage`	滚动百分位数
`ts_pred`	多元时序滚动回归预测
`ts_product`	时序滚动乘
`ts_rank`	时序滚动排名
`ts_realized_volatility`	已实现波动率
`ts_regression_intercept`	时序滚动回归取截距
`ts_regression_pred`	时序滚动回归取y的预测值
`ts_regression_resid`	时序滚动回归取残差
`ts_regression_slope`	时序滚动回归取斜率
`ts_resid`	多元时序滚动回归取残差
`ts_returns`	简单收益率
`ts_scale`	时序滚动缩放。相当于ts_minmax
`ts_shifts_v1`	时序上按顺序进行平移，然后逻辑与
`ts_shifts_v2`	时序上按顺序进行平移，然后逻辑与
`ts_shifts_v3`	时序上按顺序进行平移，然后逻辑或
`ts_signals_to_size`	多空信号转持仓。参考于`vectorbt`
`ts_skewness`	时序滚动偏度
`ts_std_dev`	时序滚动标准差
`ts_sum`	时序滚动求和
`ts_sum_split_by`	切割论求和。在d窗口范围内以by为依据进行从小到大排序。取最大的N个和最小的N个对应位置的x的和
`ts_triple_corr`	时序滚动三重相关系数
`ts_weighted_decay`	时序滚动加权衰减求和
`ts_weighted_mean`	时序滚动加权平均
`ts_weighted_sum`	时序滚动加权求和
`ts_zscore`	时序滚动zscore

`ts_arg_max(x: Expr, d: int = 5, reverse: bool = True, min_samples: Optional[int] = None) -> Expr`

最大值相对位置

最近的一天记为第 0 天，最远的一天为第 d-1 天

Returns the relative index of the max value in the time series for the past d days. If the current day has the max value for the past d days, it returns 0. If previous day has the max value for the past d days, it returns 1.

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`d`	`int`		`5`
`reverse`	`bool`	反向	`True`
`min_samples`	`Optional[int]`		`None`

`ts_arg_min(x: Expr, d: int = 5, reverse: bool = True, min_samples: Optional[int] = None) -> Expr`

最小值相对位置

最近的一天记为第 0 天，最远的一天为第 d-1 天

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`d`	`int`		`5`
`reverse`	`bool`	反向	`True`
`min_samples`	`Optional[int]`		`None`

`ts_co_kurtosis(x: Expr, y: Expr, d: int = 5, ddof: int = 0, min_samples: Optional[int] = None) -> Expr`

计算两个序列在滚动窗口内联合分布的协峰度

`ts_co_skewness(x: Expr, y: Expr, d: int = 5, ddof: int = 0, min_samples: Optional[int] = None) -> Expr`

计算两个序列在滚动窗口内联合分布的协偏度

`ts_corr(x: Expr, y: Expr, d: int = 5, ddof: int = 1, min_samples: Optional[int] = None) -> Expr`

时序滚动相关系数

rolling correlation between two columns

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`y`	`Expr`		required
`d`	`int`		`5`
`ddof`	`int`	自由度	`1`
`min_samples`	`Optional[int]`		`None`

Notes

x、y不区分先后

`ts_count(x: Expr, d: int = 30, min_samples: Optional[int] = None) -> Expr`

时序滚动计数

Parameters:

Name	Type	Default
`x`	`Expr`	required
`d`	`int`	`30`
`min_samples`	`Optional[int]`	`None`

Examples:

df = pl.DataFrame({
    'a': [None, -1, 0, 1, 2, 3],
    'b': [None, True, True, True, False, False],
}).with_columns(
    out1=ts_count(pl.col('a'), 3),
    out2=ts_count(pl.col('b'), 3),
)

shape: (6, 4)
┌──────┬───────┬──────┬──────┐
│ a    ┆ b     ┆ out1 ┆ out2 │
│ ---  ┆ ---   ┆ ---  ┆ ---  │
│ i64  ┆ bool  ┆ u32  ┆ u32  │
╞══════╪═══════╪══════╪══════╡
│ null ┆ null  ┆ null ┆ null │
│ -1   ┆ true  ┆ null ┆ null │
│ 0    ┆ true  ┆ null ┆ null │
│ 1    ┆ true  ┆ 2    ┆ 3    │
│ 2    ┆ false ┆ 2    ┆ 2    │
│ 3    ┆ false ┆ 3    ┆ 1    │
└──────┴───────┴──────┴──────┘

`ts_count_eq(x: Expr, d: int = 30, n: int = 10, min_samples: Optional[int] = None) -> Expr`

D天内最近连续出现N次

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`d`	`int`	窗口大小	`30`
`n`	`int`	连续出现次数	`10`

`ts_count_ge(x: Expr, d: int = 30, n: int = 10, min_samples: Optional[int] = None) -> Expr`

D天内最近连续出现至少N次

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`d`	`int`	窗口大小	`30`
`n`	`int`	至少连续出现次数	`10`

`ts_count_nans(x: Expr, d: int = 5, min_samples: Optional[int] = None) -> Expr`

时序滚动统计nan出现次数

Parameters:

Name	Type	Default
`x`	`Expr`	required
`d`	`int`	`5`
`min_samples`	`Optional[int]`	`None`

Examples:

df = pl.DataFrame({
    'a': [None, float('nan'), -1, 0, 1, 2, 3],
}).with_columns(
    out1=ts_count_nans(pl.col('a'), 3),
)

shape: (7, 2)
┌──────┬──────┐
│ a    ┆ out1 │
│ ---  ┆ ---  │
│ f64  ┆ u32  │
╞══════╪══════╡
│ null ┆ null │
│ NaN  ┆ null │
│ -1.0 ┆ null │
│ 0.0  ┆ 1    │
│ 1.0  ┆ 0    │
│ 2.0  ┆ 0    │
│ 3.0  ┆ 0    │
└──────┴──────┘

`ts_count_nulls(x: Expr, d: int = 5, min_samples: Optional[int] = None) -> Expr`

时序滚动统计null出现次数

Parameters:

Name	Type	Default
`x`	`Expr`	required
`d`	`int`	`5`
`min_samples`	`Optional[int]`	`None`

Examples:

df = pl.DataFrame({
    'a': [None, -1, 0, 1, 2, 3],
    'b': [None, True, True, True, False, False],
}).with_columns(
    out1=ts_count_nulls(pl.col('a'), 3),
    out2=ts_count_nulls(pl.col('b'), 3),
)
shape: (6, 4)
┌──────┬───────┬──────┬──────┐
│ a    ┆ b     ┆ out1 ┆ out2 │
│ ---  ┆ ---   ┆ ---  ┆ ---  │
│ i64  ┆ bool  ┆ u32  ┆ u32  │
╞══════╪═══════╪══════╪══════╡
│ null ┆ null  ┆ null ┆ null │
│ -1   ┆ true  ┆ null ┆ null │
│ 0    ┆ true  ┆ 1    ┆ 1    │
│ 1    ┆ true  ┆ 0    ┆ 0    │
│ 2    ┆ false ┆ 0    ┆ 0    │
│ 3    ┆ false ┆ 0    ┆ 0    │
└──────┴───────┴──────┴──────┘

`ts_covariance(x: Expr, y: Expr, d: int = 5, ddof: int = 1, min_samples: Optional[int] = None) -> Expr`

时序滚动协方差

rolling covariance between two columns

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`y`	`Expr`		required
`d`	`int`		`5`
`ddof`	`int`	自由度	`1`
`min_samples`	`Optional[int]`		`None`

Notes

x、y不区分先后

`ts_cum_count(x: Expr) -> Expr`

时序累计计数

Examples:

df = pl.DataFrame({
    'a': [None, None, -1, 0, 1, 2],
    'b': [None, None, True, True, True, False],
}).with_columns(
    out1=ts_cum_count(pl.col('a')),
    out2=ts_cum_count(pl.col('b')),
)
shape: (6, 4)
┌──────┬───────┬──────┬──────┐
│ a    ┆ b     ┆ out1 ┆ out2 │
│ ---  ┆ ---   ┆ ---  ┆ ---  │
│ i64  ┆ bool  ┆ u32  ┆ u32  │
╞══════╪═══════╪══════╪══════╡
│ null ┆ null  ┆ 0    ┆ 0    │
│ null ┆ null  ┆ 0    ┆ 0    │
│ -1   ┆ true  ┆ 1    ┆ 1    │
│ 0    ┆ true  ┆ 2    ┆ 2    │
│ 1    ┆ true  ┆ 3    ┆ 3    │
│ 2    ┆ false ┆ 4    ┆ 4    │
└──────┴───────┴──────┴──────┘

`ts_cum_max(x: Expr) -> Expr`

时序累计最大值

Examples:

df = pl.DataFrame({
    'a': [None, None, -1, 0, 2, 1],
    'b': [None, None, True, False, False, True],
}).with_columns(
    out1=ts_cum_max(pl.col('a')),
    out2=ts_cum_max(pl.col('b')),
)
shape: (6, 4)
┌──────┬───────┬──────┬──────┐
│ a    ┆ b     ┆ out1 ┆ out2 │
│ ---  ┆ ---   ┆ ---  ┆ ---  │
│ i64  ┆ bool  ┆ i64  ┆ bool │
╞══════╪═══════╪══════╪══════╡
│ null ┆ null  ┆ null ┆ null │
│ null ┆ null  ┆ null ┆ null │
│ -1   ┆ true  ┆ -1   ┆ true │
│ 0    ┆ false ┆ 0    ┆ true │
│ 2    ┆ false ┆ 2    ┆ true │
│ 1    ┆ true  ┆ 2    ┆ true │
└──────┴───────┴──────┴──────┘

`ts_cum_min(x: Expr) -> Expr`

时序累计最小值

Examples:

df = pl.DataFrame({
    'a': [None, None, -1, 0, -2, 1],
    'b': [None, None, True, False, False, True],
}).with_columns(
    out1=ts_cum_min(pl.col('a')),
    out2=ts_cum_min(pl.col('b')),
)
shape: (6, 4)
┌──────┬───────┬──────┬───────┐
│ a    ┆ b     ┆ out1 ┆ out2  │
│ ---  ┆ ---   ┆ ---  ┆ ---   │
│ i64  ┆ bool  ┆ i64  ┆ bool  │
╞══════╪═══════╪══════╪═══════╡
│ null ┆ null  ┆ null ┆ null  │
│ null ┆ null  ┆ null ┆ null  │
│ -1   ┆ true  ┆ -1   ┆ true  │
│ 0    ┆ false ┆ -1   ┆ false │
│ -2   ┆ false ┆ -2   ┆ false │
│ 1    ┆ true  ┆ -2   ┆ false │
└──────┴───────┴──────┴───────┘

`ts_cum_prod(x: Expr) -> Expr`

时序累乘

`ts_cum_prod_by(r: Expr, v: Expr) -> Expr`

带设置的累乘

可用于市值累乘日收益率得到新市值的需求

Parameters:

Name	Type	Description	Default
`r`	`Expr`	收益率。不能出现`null`, `null`需要提前用`1`代替 `CLOSE/ts_delay(CLOSE, 1)`	required
`v`	`Expr`	市值。非空时分配指定市值资产如果非`null`，直接返回`v` 如果`null`，返回`V[-1]*r`	required

Returns:

Type	Description
`Expr`	V。累乘后成新的市值

Examples:

df = pl.DataFrame({
    'r': [1, 2, 3, 4, 5, 6],
    'v': [None, None, 6, None, None, 12],

}).with_columns(
    V=ts_cum_prod_by(pl.col('r'), pl.col('v'))
)

shape: (6, 3)
┌─────┬──────┬───────┐
│ r   ┆ v    ┆ V     │
│ --- ┆ ---  ┆ ---   │
│ i64 ┆ i64  ┆ f64   │
╞═════╪══════╪═══════╡
│ 1   ┆ null ┆ null  │
│ 2   ┆ null ┆ null  │
│ 3   ┆ 6    ┆ 6.0   │
│ 4   ┆ null ┆ 24.0  │
│ 5   ┆ null ┆ 120.0 │
│ 6   ┆ 12   ┆ 12.0  │
└─────┴──────┴───────┘

`ts_cum_sum(x: Expr) -> Expr`

时序累加

`ts_cum_sum_by(r: Expr, v: Expr) -> Expr`

带设置的累加

可用于市值累加日收益得到新市值的需求

Parameters:

Name	Type	Description	Default
`r`	`Expr`	收益。不能出现`null`, `null`需要提前用`0`代替 `CLOSE-ts_delay(CLOSE, 1)`	required
`v`	`Expr`	市值。非空时分配指定市值资产如果非`null`，直接返回`v` 如果`null`，返回`V[-1]+r`	required

Examples:

df = pl.DataFrame({
    'r': [1, 2, 3, 4, 5, 6],
    'v': [None, None, 6, None, None, 12],

}).with_columns(
    V=ts_cum_sum_by(pl.col('r'), pl.col('v'))
)

shape: (6, 3)
┌─────┬──────┬──────┐
│ r   ┆ v    ┆ V    │
│ --- ┆ ---  ┆ ---  │
│ i64 ┆ i64  ┆ f64  │
╞═════╪══════╪══════╡
│ 1   ┆ null ┆ null │
│ 2   ┆ null ┆ null │
│ 3   ┆ 6    ┆ 6.0  │
│ 4   ┆ null ┆ 10.0 │
│ 5   ┆ null ┆ 15.0 │
│ 6   ┆ 12   ┆ 12.0 │
└─────┴──────┴──────┘

`ts_cum_sum_reset(x: Expr) -> Expr`

时序累加。遇到0、nan、相反符号时重置

Examples:

df = pl.DataFrame({
    'a': [1, 0, 1, 2, None, 3, -2, -3],
}).with_columns(
    A=ts_cum_sum_reset(pl.col('a'))
)

shape: (8, 2)
┌──────┬──────┐
│ a    ┆ A    │
│ ---  ┆ ---  │
│ i64  ┆ f64  │
╞══════╪══════╡
│ 1    ┆ 1.0  │
│ 0    ┆ 0.0  │
│ 1    ┆ 1.0  │
│ 2    ┆ 3.0  │
│ null ┆ 0.0  │
│ 3    ┆ 3.0  │
│ -2   ┆ -2.0 │
│ -3   ┆ -5.0 │
└──────┴──────┘

`ts_decay_exp_window(x: Expr, d: int = 30, factor: float = 1.0, min_samples: Optional[int] = None) -> Expr`

指数衰减移动平均

Examples:

from polars_ta.wq.time_series import ts_decay_linear, ts_decay_exp_window

df = pl.DataFrame({
    'a': [None, 6, 5, 4, 5, 30],
}).with_columns(
    out1=ts_decay_linear(pl.col('a'), 5),
    out2=ts_decay_exp_window(pl.col('a'), 5, 0.5),
)
shape: (6, 3)
┌──────┬──────┬───────────┐
│ a    ┆ out1 ┆ out2      │
│ ---  ┆ ---  ┆ ---       │
│ i64  ┆ f64  ┆ f64       │
╞══════╪══════╪═══════════╡
│ null ┆ null ┆ null      │
│ 6    ┆ null ┆ null      │
│ 5    ┆ null ┆ null      │
│ 4    ┆ null ┆ null      │
│ 5    ┆ null ┆ null      │
│ 30   ┆ 13.2 ┆ 17.806452 │
└──────┴──────┴───────────┘

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`d`	`int`		`30`
`factor`	`float`	衰减系数	`1.0`
`min_samples`	`Optional[int]`		`None`

References

https://platform.worldquantbrain.com/learn/operators/detailed-operator-descriptions#ts_decay_exp_windowx-d-factor-10-nan-true

`ts_decay_linear(x: Expr, d: int = 30, min_samples: Optional[int] = None) -> Expr`

线性衰减移动平均

Examples:

from polars_ta.talib import WMA as ts_WMA
from polars_ta.wq.time_series import ts_decay_linear

df = pl.DataFrame({
    'a': [None, 6, 5, 4, 5, 30],
}).with_columns(
    out1=ts_decay_linear(pl.col('a'), 5),
    out2=ts_WMA(pl.col('a'), 5),
)

shape: (6, 3)
┌──────┬──────┬──────┐
│ a    ┆ out1 ┆ out2 │
│ ---  ┆ ---  ┆ ---  │
│ i64  ┆ f64  ┆ f64  │
╞══════╪══════╪══════╡
│ null ┆ null ┆ null │
│ 6    ┆ null ┆ null │
│ 5    ┆ null ┆ null │
│ 4    ┆ null ┆ null │
│ 5    ┆ null ┆ null │
│ 30   ┆ 13.2 ┆ 13.2 │
└──────┴──────┴──────┘

References

https://platform.worldquantbrain.com/learn/operators/detailed-operator-descriptions#ts_decay_linearx-d-dense-false

`ts_delay(x: Expr, d: int = 1, fill_value: float = None) -> Expr`

时序数据移动

shift x

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`d`	`int`	向前或向后的移动天数	`1`
`fill_value`	`float`	填充。可用None、常量或Expr	`None`

`ts_delta(x: Expr, d: int = 1) -> Expr`

时序差分

`ts_fill_null(x: Expr, limit: int = None) -> Expr`

用上一个非空值填充空值

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`limit`	`int`	最大填充次数	`None`

Examples:

df = pl.DataFrame({
    'a': [None, -1, 1, None, None],
    'b': [None, True, False, None, None],
}).with_columns(
    out1=ts_fill_null(pl.col('a')),
    out2=ts_fill_null(pl.col('b'), 1),
)

shape: (5, 4)
┌──────┬───────┬──────┬───────┐
│ a    ┆ b     ┆ out1 ┆ out2  │
│ ---  ┆ ---   ┆ ---  ┆ ---   │
│ i64  ┆ bool  ┆ i64  ┆ bool  │
╞══════╪═══════╪══════╪═══════╡
│ null ┆ null  ┆ null ┆ null  │
│ -1   ┆ true  ┆ -1   ┆ true  │
│ 1    ┆ false ┆ 1    ┆ false │
│ null ┆ null  ┆ 1    ┆ false │
│ null ┆ null  ┆ 1    ┆ null  │
└──────┴───────┴──────┴───────┘

`ts_ir(x: Expr, d: int = 1, min_samples: Optional[int] = None) -> Expr`

时序滚动信息系数

rolling information ratio

`ts_kurtosis(x: Expr, d: int = 5, bias: bool = False, min_samples: Optional[int] = None) -> Expr`

时序滚动峰度

kurtosis of x for the last d days

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`d`	`int`		`5`
`bias`	`bool`	有偏	`False`
`min_samples`	`Optional[int]`		`None`

Notes

bias=False时与pandas结果一样

Examples:

df = pl.DataFrame({
    'a': [None, 1, 2, 3, 4, 999],
}).with_columns(
    out1=pl.col('a').map_batches(lambda x: pl.Series(pd.Series(x).rolling(4).kurt())),
    out2=ts_kurtosis(pl.col('a'), 4),
)

shape: (6, 3)
┌──────┬──────────┬──────────┐
│ a    ┆ out1     ┆ out2     │
│ ---  ┆ ---      ┆ ---      │
│ i64  ┆ f64      ┆ f64      │
╞══════╪══════════╪══════════╡
│ null ┆ null     ┆ null     │
│ 1    ┆ null     ┆ null     │
│ 2    ┆ null     ┆ null     │
│ 3    ┆ null     ┆ null     │
│ 4    ┆ -1.2     ┆ -1.2     │
│ 999  ┆ 3.999946 ┆ 3.999946 │
└──────┴──────────┴──────────┘

`ts_l2_norm(x: Expr, d: int = 5, min_samples: Optional[int] = None) -> Expr`

欧几里得范数

Euclidean norm

`ts_log_diff(x: Expr, d: int = 1) -> Expr`

求对数，然后时序滚动差分

log(current value of input or x[t] ) - log(previous value of input or x[t-1]).

`ts_max(x: Expr, d: int = 30, min_samples: Optional[int] = None) -> Expr`

时序滚动最大值

`ts_max_diff(x: Expr, d: int = 30, min_samples: Optional[int] = None) -> Expr`

窗口内最大值与当前值的差异‌

x - ts_max(x, d)

`ts_mean(x: Expr, d: int = 5, min_samples: Optional[int] = None) -> Expr`

简单移动平均

`ts_median(x: Expr, d: int = 5, min_samples: Optional[int] = None) -> Expr`

时序滚动中位数

`ts_min(x: Expr, d: int = 30, min_samples: Optional[int] = None) -> Expr`

时序滚动最小值

`ts_min_diff(x: Expr, d: int = 30, min_samples: Optional[int] = None) -> Expr`

窗口内最小值与当前值的差异‌

x - ts_min(x, d)

`ts_min_max_cps(x: Expr, d: int, f: float = 2.0, min_samples: Optional[int] = None) -> Expr`

计算时间窗口内最小值与最大值的总和减去当前值的加权结果

(ts_min(x, d) + ts_max(x, d)) - f * x

`ts_min_max_diff(x: Expr, d: int, f: float = 0.5, min_samples: Optional[int] = None) -> Expr`

计算当前值 x 与基于时间窗口内最小值、最大值的加权组合的差值

x - f * (ts_min(x, d) + ts_max(x, d))

`ts_moment(x: Expr, d: int, k: int = 0, min_samples: Optional[int] = None) -> Expr`

滚动k阶中心距

Returns K-th central moment of x for the past d days.

Parameters:

Name	Type	Default
`x`	`Expr`	required
`d`	`int`	required
`k`	`int`	`0`
`min_samples`	`Optional[int]`	`None`

`ts_partial_corr(x: Expr, y: Expr, z: Expr, d: int, min_samples: Optional[int] = None) -> Expr`

滚动偏相关

Returns partial correlation of x, y, z for the past d days.

`ts_percentage(x: Expr, d: int, percentage: float = 0.5, min_samples: Optional[int] = None) -> Expr`

滚动百分位数

Returns percentile value of x for the past d days.

Parameters:

Name	Type	Default
`x`	`Expr`	required
`d`	`int`	required
`percentage`	`float`	`0.5`
`min_samples`	`Optional[int]`	`None`

`ts_pred(y: Expr, *more_x: Expr, d: int = 30, min_samples: Optional[int] = None) -> Expr`

多元时序滚动回归预测

Parameters:

Name	Type	Description	Default
`y`	`Expr`		required
`*more_x`	`Expr`	多个x	`()`
`d`	`int`		`30`
`min_samples`	`Optional[int]`		`None`

`ts_product(x: Expr, d: int = 5, min_samples: Optional[int] = None) -> Expr`

时序滚动乘

`ts_rank(x: Expr, d: int = 5, min_samples: Optional[int] = None) -> Expr`

时序滚动排名

`ts_realized_volatility(close: Expr, d: int = 5, min_samples: Optional[int] = None) -> Expr`

已实现波动率

`ts_regression_intercept(y: Expr, x: Expr, d: int, min_samples: Optional[int] = None) -> Expr`

时序滚动回归取截距

`ts_regression_pred(y: Expr, x: Expr, d: int, min_samples: Optional[int] = None) -> Expr`

时序滚动回归取y的预测值

`ts_regression_resid(y: Expr, x: Expr, d: int, min_samples: Optional[int] = None) -> Expr`

时序滚动回归取残差

`ts_regression_slope(y: Expr, x: Expr, d: int, min_samples: Optional[int] = None) -> Expr`

时序滚动回归取斜率

`ts_resid(y: Expr, *more_x: Expr, d: int = 30, min_samples: Optional[int] = None) -> Expr`

多元时序滚动回归取残差

Parameters:

Name	Type	Description	Default
`y`	`Expr`		required
`*more_x`	`Expr`	多个x	`()`
`d`	`int`		`30`
`min_samples`	`Optional[int]`		`None`

`ts_returns(x: Expr, d: int = 1) -> Expr`

简单收益率

`ts_scale(x: Expr, d: int = 5, min_samples: Optional[int] = None) -> Expr`

时序滚动缩放。相当于ts_minmax

Returns (x – ts_min(x, d)) / (ts_max(x, d) – ts_min(x, d)) + constant

`ts_shifts_v1(*args) -> Expr`

时序上按顺序进行平移，然后逻辑与

比如今天反包，昨天阴跌，前天涨停。只需要写ts_shifts_v1(反包,阴跌,涨停)。使用此函数能一定程度上简化代码

Parameters:

Name	Type	Description	Default
`args`		pl.col按照顺序排列	`()`

Examples:

df = pl.DataFrame({
    'a': [None, False, False, True, True, True],
    'b': [None, False, True, True, True, True],
    'c': [None, True, True, True, True, True],
}).with_columns(
    out1=ts_shifts_v1(pl.col('a'), pl.col('b'), pl.col('c')),
    out2=pl.col('a').shift(0) & pl.col('b').shift(1) & pl.col('c').shift(2),
)

shape: (6, 5)
┌──────┬───────┬───────┬───────┬───────┐
│ c    ┆ b     ┆ a     ┆ out1  ┆ out2  │
│ ---  ┆ ---   ┆ ---   ┆ ---   ┆ ---   │
│ bool ┆ bool  ┆ bool  ┆ bool  ┆ bool  │
╞══════╪═══════╪═══════╪═══════╪═══════╡
│ null ┆ null  ┆ null  ┆ null  ┆ null  │
│ true ┆ false ┆ false ┆ false ┆ false │
│ true ┆ true  ┆ false ┆ false ┆ false │
│ true ┆ true  ┆ true  ┆ true  ┆ true  │
│ true ┆ true  ┆ true  ┆ true  ┆ true  │
│ true ┆ true  ┆ true  ┆ true  ┆ true  │
└──────┴───────┴───────┴───────┴───────┘

`ts_shifts_v2(*args) -> Expr`

时序上按顺序进行平移，然后逻辑与

遇到连续条件时可简化参数。如连续3天涨停后连续2天跌停，可用ts_shifts_v2(跌停,2,涨停,3) 另一种实现方法是：ts_delay((ts_count(涨停,3)==3),2)&(ts_count(跌停,2)==2)

Parameters:

Name	Type	Description	Default
`args`		pl.col, repeat。两个参数循环	`()`

Examples:

df = pl.DataFrame({
    'a': [None, False, False, True, True, True],
    'b': [None, False, True, True, True, True],
    'c': [None, True, True, True, True, True],
}).with_columns(
    out1=ts_shifts_v1(pl.col('a'), pl.col('b'), pl.col('b')),
    out2=ts_shifts_v2(pl.col('a'), 1, pl.col('b'), 2),
    out3=pl.col('a').shift(0) & pl.col('b').shift(1) & pl.col('b').shift(2),
)

shape: (6, 6)
┌───────┬───────┬──────┬───────┬───────┬───────┐
│ a     ┆ b     ┆ c    ┆ out1  ┆ out2  ┆ out3  │
│ ---   ┆ ---   ┆ ---  ┆ ---   ┆ ---   ┆ ---   │
│ bool  ┆ bool  ┆ bool ┆ bool  ┆ bool  ┆ bool  │
╞═══════╪═══════╪══════╪═══════╪═══════╪═══════╡
│ null  ┆ null  ┆ null ┆ null  ┆ null  ┆ null  │
│ false ┆ false ┆ true ┆ false ┆ false ┆ false │
│ false ┆ true  ┆ true ┆ false ┆ false ┆ false │
│ true  ┆ true  ┆ true ┆ false ┆ false ┆ false │
│ true  ┆ true  ┆ true ┆ true  ┆ true  ┆ true  │
│ true  ┆ true  ┆ true ┆ true  ┆ true  ┆ true  │
└───────┴───────┴──────┴───────┴───────┴───────┘

`ts_shifts_v3(*args) -> Expr`

时序上按顺序进行平移，然后逻辑或

如：涨停后连续下跌1~3天。这个案例会导致涨停可能出现在动态的一天。ts_shifts_v3(下跌,1,3,涨停,1,1) 它本质上是ts_shifts_v2(下跌,1,涨停,1)|ts_shifts_v2(下跌,2,涨停,1)|ts_shifts_v2(下跌,3,涨停,1)|

Parameters:

Name	Type	Description	Default
`args`		pl.col, [start, end]。三个参数循环。start/end都是闭区间	`()`

Examples:

df = pl.DataFrame({
    'a': [None, False, False, True, True, True],
    'b': [None, False, True, True, True, True],
    'c': [None, True, True, True, True, True],
}).with_columns(
    out0=ts_shifts_v3(pl.col('a'), 1, 3, pl.col('b'), 2, 2),
    out1=ts_shifts_v2(pl.col('a'), 1, pl.col('b'), 2),
    out2=ts_shifts_v2(pl.col('a'), 2, pl.col('b'), 2),
    out3=ts_shifts_v2(pl.col('a'), 3, pl.col('b'), 2)
).with_columns(
    out4=pl.col('out1') | pl.col('out2') | pl.col('out3')
)

shape: (6, 8)
┌───────┬───────┬──────┬───────┬───────┬───────┬───────┬───────┐
│ a     ┆ b     ┆ c    ┆ out0  ┆ out1  ┆ out2  ┆ out3  ┆ out4  │
│ ---   ┆ ---   ┆ ---  ┆ ---   ┆ ---   ┆ ---   ┆ ---   ┆ ---   │
│ bool  ┆ bool  ┆ bool ┆ bool  ┆ bool  ┆ bool  ┆ bool  ┆ bool  │
╞═══════╪═══════╪══════╪═══════╪═══════╪═══════╪═══════╪═══════╡
│ null  ┆ null  ┆ null ┆ null  ┆ null  ┆ null  ┆ null  ┆ null  │
│ false ┆ false ┆ true ┆ false ┆ false ┆ false ┆ false ┆ false │
│ false ┆ true  ┆ true ┆ false ┆ false ┆ false ┆ false ┆ false │
│ true  ┆ true  ┆ true ┆ false ┆ false ┆ false ┆ false ┆ false │
│ true  ┆ true  ┆ true ┆ true  ┆ true  ┆ false ┆ false ┆ true  │
│ true  ┆ true  ┆ true ┆ true  ┆ true  ┆ true  ┆ false ┆ true  │
└───────┴───────┴──────┴───────┴───────┴───────┴───────┴───────┘

`ts_signals_to_size(long_entry: Expr, long_exit: Expr, short_entry: Expr, short_exit: Expr, accumulate: bool = False, action: bool = False) -> Expr`

多空信号转持仓。参考于vectorbt

Parameters:

Name	Type	Description	Default
`long_entry`	`Expr`	多头入场	required
`long_exit`	`Expr`	多头出场	required
`short_entry`	`Expr`	空头入场	required
`short_exit`	`Expr`	空头出场	required
`accumulate`	`bool`	遇到重复信号时是否累计	`False`
`action`	`bool`	返回持仓状态还是下单操作	`False`

`ts_skewness(x: Expr, d: int = 5, bias: bool = False, min_samples: Optional[int] = None) -> Expr`

时序滚动偏度

Return skewness of x for the past d days

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`d`	`int`		`5`
`bias`	`bool`	有偏	`False`
`min_samples`	`Optional[int]`		`None`

Notes

bias=False时与pandas结果一样

`ts_std_dev(x: Expr, d: int = 5, ddof: int = 0, min_samples: Optional[int] = None) -> Expr`

时序滚动标准差

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`d`	`int`		`5`
`ddof`	`int`	自由度	`0`
`min_samples`	`Optional[int]`		`None`

`ts_sum(x: Expr, d: int = 30, min_samples: Optional[int] = None) -> Expr`

时序滚动求和

`ts_sum_split_by(x: Expr, by: Expr, d: int = 30, k: int = 10) -> Expr`

切割论求和。在d窗口范围内以by为依据进行从小到大排序。取最大的N个和最小的N个对应位置的x的和

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`by`	`Expr`		required
`d`	`int`	窗口大小	`30`
`k`	`int`	最大最小的k个	`10`

Returns:

Type	Description
`Expr`	top_k bottom_k

Examples:

df = pl.DataFrame({
    'a': [10, 20, 30, 40, 50, 60, 70, 80, 90, 100],
    'b': [10, 9, 8, 7, 6, 5, 4, 3, 2, 1],
}).with_columns(
    A=ts_sum_split_by(pl.col('a'), pl.col('b'), 8, 3)
)

shape: (10, 3)
┌─────┬─────┬───────────────┐
│ a   ┆ b   ┆ A             │
│ --- ┆ --- ┆ ---           │
│ i64 ┆ i64 ┆ struct[2]     │
╞═════╪═════╪═══════════════╡
│ 10  ┆ 10  ┆ {null,null}   │
│ 20  ┆ 9   ┆ {null,null}   │
│ 30  ┆ 8   ┆ {null,null}   │
│ 40  ┆ 7   ┆ {null,null}   │
│ 50  ┆ 6   ┆ {null,null}   │
│ 60  ┆ 5   ┆ {null,null}   │
│ 70  ┆ 4   ┆ {null,null}   │
│ 80  ┆ 3   ┆ {210.0,60.0}  │
│ 90  ┆ 2   ┆ {240.0,90.0}  │
│ 100 ┆ 1   ┆ {270.0,120.0} │
└─────┴─────┴───────────────┘

`ts_triple_corr(x: Expr, y: Expr, z: Expr, d: int, min_samples: Optional[int] = None) -> Expr`

时序滚动三重相关系数

Returns triple correlation of x, y, z for the past d days.

`ts_weighted_decay(x: Expr, k: float = 0.5, min_samples: Optional[int] = None) -> Expr`

时序滚动加权衰减求和

Instead of replacing today’s value with yesterday’s as in ts_delay(x, 1), it assigns weighted average of today’s and yesterday’s values with weight on today’s value being k and yesterday’s being (1-k).

Parameters:

Name	Type	Description	Default
`x`	`Expr`		required
`k`	`float`	衰减系数	`0.5`
`min_samples`	`Optional[int]`		`None`

`ts_weighted_mean(x: Expr, w: Expr, d: int, min_samples: Optional[int] = None) -> Expr`

时序滚动加权平均

`ts_weighted_sum(x: Expr, w: Expr, d: int, min_samples: Optional[int] = None) -> Expr`

时序滚动加权求和

`ts_zscore(x: Expr, d: int = 5, min_samples: Optional[int] = None) -> Expr`

时序滚动zscore