System requirements for Amazon EMR
Last updated
Was this helpful?
Last updated
Was this helpful?
The following table lists the supported versions of Amazon EMR, along with the corresponding versions of Spark, based on .
We recommend EMR-6.1.0 with Spark 3.0.0 or EMR-6.2.0 with Spark 3.0.1.
6.2.x
3.0.1
6.1.x
3.0.0
6.0.x
2.4.4
5.36.x
2.4.8
5.35.x
2.4.8
5.34.0
2.4.8
5.33.x
2.4.7
5.32.x
2.4.7
5.31.x
2.4.6
5.30.x
2.4.5
5.29.0
2.4.4
5.28.x
2.4.4
Structural supports the following data providers:
Parquet
Parquet
CSV
CSV
Avro
Avro
JSON
JSON
ORC
ORC
Structural requires a metadata catalog when connecting to your data. Currently only AWS Glue is supported when working with Amazon EMR.
Structural writes data to Amazon S3 only. Structural does not write output data back into a catalog.
If your S3 buckets have server side encryption enabled via AWS KMS, then your Spark cluster must have Hadoop 2.8.1+ installed.