Kafka data output
This article provides an introduction to Apache Kafka along with a guide on how to create a Kafka data output using Upsovler.
Last updated
Was this helpful?
This article provides an introduction to Apache Kafka along with a guide on how to create a Kafka data output using Upsovler.
Last updated
Was this helpful?
Apache Kafka is an open-source stream-processing software platform developed by the Apache Software Foundation, written in Scala and Java. The project aims to provide a unified, high-throughput, low-latency platform for handling real-time data feeds.
1. Go to the Outputs page and click New.
2. Select Kafka as your output type.
3. Name your output and select whether the output should be Tabular or Hierarchical. After adding your Data Sources, click Next.
4. Click the information iconin the fields tree to view information about a field. The following will be displayed:
How many of the events in this data source include this field, expressed as a percentage (e.g. 20.81%).
The percentage distribution of the field values. These distribution values can be exported by clicking Export.
5. Click the information iconnext to a hierarchy element (such as the overall data) to review the following metrics:
The number of fields in the selected hierarchy.
6. Click the plus iconin the fields tree to add a field from the data source to your output. This will be reflected under the Data Source Field in the Schema tab. If required, modify the Output Column Name.
7. Add any required calculated fields and review them in the Calculated Fields tab. See:
8. Add any required lookups and review them under the Calculated Fields tab.
12. Click Run and fill out the following fields:
Topic Name
Additional Kafka Properties (Optional)
Intermediate Storage: Where Upsolver will store the intermediate bulk files before loading them into Kafka
13. Click Next and complete the following:
14. Finally, click Deploy to run the output. It will show as Running in the output panel and is now live in production and consumes compute resources.
You have now successfully outputted your data to your Kafka topic.
9. Through the Filters tab, add a filter like WHERE
in SQL to the data source.
See:
10. Click Make Aggregated to turn the output into an aggregated output. Read the warning before clicking OK and then add the required aggregation. This aggregation field will then be added to the Schema tab. See:
11. In the Aggregation Calculated Fields area under the Calculated Fields tab, add any required calculated fields on aggregations. See: ,
Kafka Hosts:
See:
Select the compute cluster to run the calculation on. Alternatively, click the drop-down and .