【发布时间】:2019-07-11 15:12:37
【问题描述】:
当我跑步时
spark.sql('''
select client,avg(amount) over (partition by client
order by my_timestamp
range between interval 30 days preceding and current row) as moving_avg_amount
from my_table''')
它有效。
但如果我想排除最后 X 天 - 它会失败。
... range between interval 30 days preceding and 12 days preceding ..
or :
... range between interval 30 days preceding and interval 12 days preceding ..
正确的语法是什么?
【问题讨论】:
标签: apache-spark pyspark pyspark-sql