Skip to main content

Delta Tables Schema - Binary

tip

See Delta Egress Sink page to learn more about sending data to Delta Tables in general.

When using this schema, single table is created right at the Directory Path specified in the Egress Route configuration.

The incoming message's payload can be any sequence of bytes:

  • Each message produces a single row in the Delta Table.
  • The payload can be up to 16 MiB in size. Larger payloads are discarded but the corresponding Delta Table row is still created with null value for payload column.

Schema

Output tables always contain the following columns:

NameTypeExampleDescription
payloadbinaryb'{"property1": 42}'Unmodified bytes of message payload
payload_content_typestringapplication/jsonContent type of the payload. It can be any string but it is recommended to use one of the common MIME types.
kindstringMessageIdentifier that allows to distinguish amongst different kinds of events. Currently, this is always set to Message because no other event kinds are supported.
stream_group_namestringgroup-aName of the stream group the message was sent into.
stream_namestringtelemetryName of the stream the message was sent into.
site_idstringfactory-a51Id of site that the device was at during sending the message. It is not always available, depending on the Stream configuration.
device_idstringrobot-125Id of the device that sent the message.
batch_idstring2023-12-19Identifier of batch. It is provided by device or auto-filled by the platform (if configured).
batch_slice_idstringlogsIdentifier of batch slice (if it was provided by the device).
message_idstringm00767Identifier of the message. It is provided by device or auto-filled by the platform (if configured).
workspace_idstring69f09b3f-ec0d-4b9e-a5ec-87150b935296Identifier of the Workspace that originating Device and Stream belong into. Formatted as GUID/UUID with 32 hexadecimal digits (lowercase) separated by hyphens.
ingress_enqueued_date_timetimestamp2023-12-19T11:25:56.1408925+01:00Time when the Message was ingested by the platform. ISO 8601 format.
ingress_enqueued_datedate2023-12-05The UTC date generated from ingress_enqueued_date_time.

Spark SQL - interpretable schema

payload BINARY,
payload_content_type STRING,
kind STRING,
stream_group_name STRING,
stream_name STRING,
site_id STRING,
device_id STRING,
batch_id STRING,
batch_slice_id STRING,
message_id STRING,
workspace_id STRING,
ingress_enqueued_date_time TIMESTAMP,
ingress_enqueued_date DATE

Partition key columns

Tables are partitioned by ingress_enqueued_date column.