My Study Notes on Amazon Athena

(Photo by Alexander Sinn on Unsplash)


Briefing

Athena helps you analyze unstructured, semi-structured, and structured data stored in Amazon S3. Examples include CSV, JSON, or columnar data formats such as Apache Parquet and Apache ORC. You can use Athena to run ad-hoc queries using ANSI SQL, without the need to aggregate or load the data into Athena.

  • Serverless
  • Interactive query platform
  • Support formats:
    • CSV
    • JSON
    • Avro
    • Apache Parquet (columnar)
    • Apache ORC (columnar)
  • Use Presto (a distributed SQL query engine for big data)
  • Integrate with AWS Glue Data Catalog
  • Integrate with QuickSight for data visualization

Reference

Comparison


Loading comments…