Supported file and content types

A file group can contain files that contain CSV, XML, JSON, Parquet, or Avro content.

The file connector can read files that are ASCII encoded.

Allowed file types

Files can include the following types:

  • .csv, .tsv, .xml, .json, .parquet, and .avro files.

  • .txt files that contain CSV, XML, or JSON content.

  • .gzip files that contain compressed CSV, XML, or JSON content. .gzip files are only supported in workspaces that use files from cloud storage. They are not supported in workspaces that use local files.

Parquet file limitations

For Parquet files:

  • The files must use plain encoding.

  • The files must be uncompressed. For example, you cannot select a .snappy.parquet file.

  • You cannot select files with the following Parquet data types:

    • HalfFloat

    • Struct

    • Union

    • Dictionary

    • Map

    • List

    • FixedSizeList

    • Arrays of any type

Avro file limitations

For Avro files, you cannot select files with the following Avro data types:

  • Map

  • Record

Last updated