Amazon EMR

Amazon Elastic MapReduce (EMR) is Amazon's managed Spark Cluster.

Tonic Structural uses EMR to support the processing of flat files (such as parquet, csv, and avro) in S3.

Structural process for Amazon EMR

How Structural data generation works with Amazon EMR.

System requirements

Supported versions of Amazon EMR and other requirements related to Amazon EMR.

Structural differences and limitations

Features that are unavailable or work differently for the Amazon EMR data connector.

Required Amazon EMR configuration

Required configuration in Amazon EMR before you create an Amazon EMR workspace.

Configure the workspace data connections

Data connection settings for Amazon EMR workspaces.

Last updated 8 months ago

Was this helpful?