Breaking News: Grepper is joining You.com. Read the official announcement!
Check it out

Athena Performance optimization

Pragya Keshap answered on February 21, 2023 Popularity 2/10 Helpfulness 1/10

Contents


More Related Answers

  • database optimization
  • Indexing and Query Optimization:

  • Athena Performance optimization

    0

    top 10 performance tips (same best practices can be applied to EMR too)

    Storage optimisations:

    Partition your data (define virtual columns at table creation to keep related data together and reduce the amount of scanned data per query)

    Bucket your data within a single partition

    Use compression ( format like Apache Parquet or ORC are recommended because they compress data by default and are splittable - meaning that can be read in parallel by the execution engine)

    Optimise file sizes (files smaller than 128 MB could take longer due to the overhead of opening s3 files, listing directories , getting object metadata and so on).

    Optimise columnar data store generation

    Query optimisations:

    Optimise ORDER BY (by using Limit)

    Optimise joins (specifying larger table on the left side of the joins)

    Optimise GROUP BY (reducing columns in select to reduce amount of memory and by ordering the coluns by the highest cardinality - most unique values )

    Use approximate functions

    Only include the columns that you need  

    Popularity 2/10 Helpfulness 1/10 Language whatever
    Source: Grepper
    Link to this answer
    Share Copy Link
    Contributed on Feb 21 2023
    Pragya Keshap
    0 Answers  Avg Quality 2/10


    X

    Continue with Google

    By continuing, I agree that I have read and agree to Greppers's Terms of Service and Privacy Policy.
    X
    Grepper Account Login Required

    Oops, You will need to install Grepper and log-in to perform this action.