Schema

Table Last Update

The timestamp of the last update by jobs writing to the table, indicating when the table was last modified. This only applies to datasets in your data lake:


Overview

The Overview card provides instant visibility into the following:

  • Number of Fields: the total count of all fields in the dataset, including system columns that are automatically added by Upsolver.

  • Number of Keys: the count of fields that appear unique.

    A field will be considered as unique if all values in the sample data of the column are distinct.

  • Number of Arrays: The count of fields which are an array or sub-fields within an array. This number corresponds to the 'Array' column in the Written Data Statistics table.


Written Rows Over Time

This graph is a visual display to show the number of rows ingested over the selected timespan, enabling you to quickly discover spikes or drops in your pipeline and troubleshoot unexpected volumes. By default, the chart shows written rows from the initial ingestion through to now., however you can change this. Click on the Time View button, which displays Lifetime by default, to change the timespan displayed in the dataset report:


Ingested Data Statistics

This table provides powerful metadata about your dataset. You can click on a column name in the table to drill deeper into the Column details, which includes stats for the Upsolver system columns appended to your dataset. To filter the columns in the dataset, type your search string into the Search box. Click the x icon in the Search box to clear the results.

The system columns that Upsolver appends to your datasets are included in the column list:

The Ingested Data Statistics table provides the following information:

MeasurementDescription

Column

The name of the column.

Click on the column name to drill through to the Column page.

Type

The column data type, e.g. string, date, long. This is the type that Upsolver automatically inferred on ingestion.

Density

The percentage of records that have a non-null value.

Top Values

The most common data values within this column.

Is Uniqe

True if all values in the sample data of the column are distinct and false if the field has repeated values.(Note: this value might be true despite the field containing repeating values as the calculation is based on sample data).

Distinct Values

The count of distinct values within the column. Values written before the METADATA_RETENTION period will not be counted.

Array

True if the field is an array or a sub-field of an array.

First Seen

The first date and time that a non-null value was seen in this column.

Last Seen

The last date and time that a non-null value was seen in this column.

Min

The minimum data value in this column.

Max

The maximum data value in this column.

Furthermore, if your table is in the data lake, you can download the dataset statistics to a table inspection report in CSV format for your own consumption. Click the Download icon to the right of the Search box to download the contents of the Ingested Data Statistics table to CSV. You can then perform your own discovery using this data.

Last updated