My Study Notes on Amazon Athena

(Photo by Alexander Sinn on Unsplash)


Athena helps you analyze unstructured, semi-structured, and structured data stored in Amazon S3. Examples include CSV, JSON, or columnar data formats such as Apache Parquet and Apache ORC. You can use Athena to run ad-hoc queries using ANSI SQL, without the need to aggregate or load the data into Athena.

  • Serverless
  • Interactive query platform
  • Support formats:
    • CSV
    • JSON
    • Avro
    • Apache Parquet (columnar)
    • Apache ORC (columnar)
  • Use Presto (a distributed SQL query engine for big data)
  • Integrate with AWS Glue Data Catalog
  • Integrate with QuickSight for data visualization


From ancient to modern times. Get through all the context.



Articles & Talks

  • How Small and Medium Businesses Can Develop a Modern Data Strategy, 2023-06-14, by John Walker, Dimple Dhar, and Kunle Adeleke
    • In the era of big data, small and medium-sized businesses (SMBs) often find themselves wrestling with a deluge of data from an ever-growing range of sources. According to Gartner, 60 percent of organizations do not measure the costs of poor data quality. A lack of measurement results in reactive responses to data quality issues, missed business growth opportunities, and increased risks.


Loading comments…