Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Default compression method for flat files #36

Closed
Labels
enhancementNew feature or requestquestionFurther information is requested
@DyfanJones

Description

@DyfanJones

CurrentlyRAthena andnoctua support gzip compression when uploading data to S3 and Athena. Is there a better compression algorithm for flat files?Top 10 Performance Tuning Tips for Amazon Athena

AlgorithmSplittable?Compression ratioCompress + Decompress speed
Gzip (DEFLATE)NoHighMedium
bzip2YesVery highSlow
LZONoLowFast
SnappyNoLowVery fast

For Athena, we recommend using either Apache Parquet or Apache ORC, which compress data by default and are splittable. When they are not an option, then try BZip2 or Gzip with an optimal file size.

From this it looks like BZIP2/GZIP are currently recommended. Might need to benchmark speed of BZip2 and GZIP files when reading from Athena

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestquestionFurther information is requested

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions


      [8]ページ先頭

      ©2009-2025 Movatter.jp