Aggregation Functions
This page covers the aggregation functions available in Upsolver.
Aggregations are functions for grouping multiple events together to form a more significant result.
Unlike databases, Upsolver runs continuous queries rather than ad-hoc queries. With Upsolver aggregation results are updated incrementally with every incoming event.
Aggregation functions require windowing to split a stream into buckets of data that can be aggregated.
APPROXIMATE_COUNT_DISTINCT:
The approximate number of distinct values in the time window.
APPROX_COUNT_DISTINCT_EACH:
The approximate count of distinct values per group in the time window.
AVG:
The average value in the time window.
AVG_EACH:
The average value in the time window grouped by the given key.
AVG_TIME_SERIES:
The average of the values per time interval.
COLLECT_SET:
A set of all values encountered in the time interval.
COLLECT_SET_EACH:
A set of all values encountered in the time interval grouped by the given key.
COUNT:
The number of values in the time window.
COUNT(*):
The number of items in the time window.
COUNT_DISTINCT:
Counts the number of distinct values that appeared in the column per key value.
COUNT_EACH:
The number of items grouped by the given key.
COUNT_IF:
The number of true
values in the time window.
FIRST:
The first value in the time window.
FIRST_ARRAY:
The first array of values in the time window.
FIRST_EACH:
The first value per group.
FIRST_TIME_SERIES:
The first value per time interval.
LAST:
The last value in the time window.
LAST_ARRAY:
The last array of values in the time window.
LAST_EACH:
The last value per group.
LAST_K_EACH:
The last k
values per group.
LAST_TIME_SERIES:
The last value per interval.
MAX:
The maximum value in the time window.
MAX_BY:
The value correlating to the maximum sort in the time window.
MAX_EACH:
The maximum value per group.
MAX_TIME_SERIES:
The maximum value per time interval.
MIN:
The minimum value in the time window.
MIN_BY:
The value correlating to the minimum sort in the time window.
MIN_EACH:
The minimum value per group.
MIN_TIME_SERIES:
The minimum value per time interval.
SESSION_COUNT:
Stores the number of sessions.
STD_DEV: The standard deviation of values in the time window.
STD_DEV_EACH:
The standard deviation of the value per group.
STRING_MAX_EACH:
Stores the maximum string value per group, sorted case-sensitive lexicographically.
STRING_MIN_EACH:
Stores the minimum string value per group, sorted case-sensitive lexicographically.
SUM:
The sum of the values in the time window.
SUM_EACH:
The sum of the values per group.
SUM_TIME_SERIES:
The sum of the values per time interval.
WEIGHTED_AVERAGE:
The weighted average of a value in the time window.
Last updated