#spark
-
A Deep Dive into the Latest Performance Improvements of Stateful Pipelines in Apache Spark Structured Streaming
-
Use Result Fragment Caching with EMR runtime for Apache Spark to boost query performance by up to 15x
-
Amazon EMR 6.6 adds support for Apache Spark 3.2, HUDI 0.10.1, Iceberg 0.13, Trino 0.367, PrestoDB 0.267, and more
-
Amazon SageMaker Processing now supports built-in Spark containers for big data processing
-
Amazon EMR now provides up to 30% lower cost and up to 15% improved performance for Spark workloads on Graviton2-based instances
-
AWS Glue now supports workload partitioning to further improve the reliability of Spark applications