BYTES_SUBSTRING
Returns a substring of the input value, using the offsets in bytes of the UTF-8 encoded byte representation. Partial characters and invalid UTF-8 code points are removed from the result.
Syntax
Use the following syntax to include the BYTES_SUBSTRING
function in your query:
Arguments
The BYTES_SUBSTRING
function requires the following arguments:
value
input is the string upon which you want to perform the substring function.startIndex
begins at1
and includes the first character in your string.endIndex
counts from the first character in the string, not from thestartIndex
position.
Name | Type | Description |
---|---|---|
value | string | The input value to perform the substring function on |
startIndex | int | The inclusive start index in bytes |
endIndex | int | The exclusive end index in bytes |
Returns
Returns a string
value.
Examples
The following example shows how the startIndex
and endIndex
arguments affect the Output
value. Here, the endIndex
value demonstrates that regardless of the startIndex
value, the endIndex
works from position 1 in the string to include 10 characters.
value | startIndex | endIndex | Output |
---|---|---|---|
'Hello World' | 1 | 10 |
|
'Hello World' | 2 | 10 |
|
'⻤Hello Wor⻤' | 2 | 10 |
|
'⻤Hello Wor⻤' | 1 | 10 |
|
'Hello' | 1 | 10 |
|
Transformation job example
The script below includes the BYTES_SUBSTRING
function in a SELECT
statement within a transformation job, using the "Hello World" string example.
SQL
Query result
value | startIndex | endIndex | Output |
---|---|---|---|
'Hello World' | 1 | 10 |
|
Last updated