API content formats

This page provides a guide on how to configure different content formats in Upsolver using API calls.

Auto-Detect

Example

{
	"clazz" : "AutoDetectContentType"
}

Avro

Example

{
	"clazz" : "AvroContentType"
}

Parquet

Example

{
	"clazz" : "ParquetContentType"
}

ORC

Example

{
	"clazz" : "OrcContentType"
}

JSON

JSON data. Multiple JSONs can be read from a single file/record by appending them with optional whitespace in between.

Fields

Example

{
	"clazz" : "JsonContentType"
}

CSV

Fields

Example

{
	"clazz" : "CsvContentType",
	"inferTypes" : true,
	"header" : "header1,header2,header2"
}

TSV

Fields

Example

{
	"clazz" : "TsvContentType",
	"inferTypes" : true,
	"header" : "header1,header2,header2"
}

x-www-form-urlencoded

Fields

Example

{
	"clazz" : "WWWFormUrlEncodedType",
	"inferTypes" : true,
}

Protobuf

Fields

Example

{
	"clazz" : "ProtobufContentType",
	"schemaFiles" : "schemaFiles",
	"mainFile" : "mainFile",
	"messageType" : "messageType",
	"bytesParsers" : "bytesParsers"
}

Avro-record

Individual Avro records without the framing or schema.

Fields

Example

{
	"clazz" : "AvroRecordContentType",
	"schema" : "{ \"type\": \"record\", \"name\": 
\"root\", \"fields\": [ {\"name\": \"value\", \"type\": 
\"string\" } ] }"
}

Avro w/ Schema Registry

Individual Avro records with schema provided by Schema Registry.

Fields

Example

{
	"clazz" : "AvroSchemaRegistryContentType",
	"schemaRegistryUrl" : "schemaRegistryUrl"
}

XML

XML data. Multiple XMLs can be read from a single file/record by appending them with optional whitespace in between.

Fields

Example

{
	"clazz" : "XmlContentType",
}

Last updated