System requirements for Databricks

Supported versions of Databricks

Tonic Structural supports Spark 2.4.x, Spark 3.0.x, and Spark 3.1.x. Spark 2.4.2 is not supported.

Any version of Databricks that runs one of those Spark versions should be compatible. Structural specifically has been tested against Databricks versions 9.1, and 10.4.

Supported providers

Structural supports the following data providers:

Source ProviderOutput Provider

Parquet

Parquet

CSV

Parquet

Avro

Avro

JSON

JSON

ORC

ORC

Delta

Delta

Supported table types

Databricks supports both MANAGED and EXTERNAL tables.

  • MANAGED tables store all of their data within Databricks.

  • EXTERNAL tables store their data on a separate file system (often S3).

Structural can read from both table types. When it writes output data, Structural only writes to EXTERNAL tables.

Last updated