What is AWS Athena
What is AWS Athena?
Amazon Athena is an interactive query service that makes it simple to analyze data in Amazon S3 applying standard SQL. Athena is serverless, so there is no infrastructure to handle, and you pay just for the queries that you run.
Athena is simple to use. You simply had to point to your data in Amazon S3, establish the schema, and begin querying utilizing standard SQL. Most outcomes are conveyed in practically no time. With Athena, there's no requirement for complex ETL jobs to set up your data for analysis. This makes it easier for anybody with SQL skills to rapidly analyze large-scale datasets.
Athena is out-of-the-box integrated with AWS Glue Data Catalog, permitting you to generate a unified metadata storehouse across different services, crawl data sources to explore schemas, and populate your Catalog with new and remodeled table and partition definitions, and sustain schema versioning.
When should you use Athena?
Athena helps you examine unstructured, semi-structured, and structured data put stored in Amazon S3. Models include CSV, JSON, or columnar data formats like Apache Parquet and Apache ORC. You can utilize Athena to run ad-hoc utilizing ANSI SQL, without the need to aggregate or load the data into Athena.
Athena combines with Amazon QuickSight for simple data visualization. You can utilize Athena to produce reports or to investigate data with business intelligence tools or SQL clients associated with a JDBC or an ODBC driver.
Athena combines with the AWS Glue Data Catalog, which offers a determined metadata store for your data in Amazon S3. This enables you to make tables and inquiry data in Athena dependent on a central metadata store available throughout your AWS account and combined with the ETL and data discovery features of AWS Glue.
Benefits of Amazon Athena –
Start querying instantly: Serverless, no ETL
Athena is serverless. You can rapidly query your information without having to set up and deal with any servers or data distribution centers. Simply point to your data in Amazon S3, define the schema, and begin querying applying the built-in query editor. Amazon Athena permits you to tap into all your data in S3 without the need to set up complex procedures to extract, modify, and load the data (ETL).
Pay per query: Only pay for data scanned
With Amazon Athena, you pay just for the queries that you run. You are charged $5 per terabyte examined by your queries. You can spare from 30% to 90% on your per-query costs and get improved performance by compressing, partitioning, and changing your data into columnar formats. Athena queries data legitimately in Amazon S3. There are no extra storage charges beyond S3.
Open, powerful, standard: Built on Presto, run standard SQL
Amazon Athena uses Presto with ANSI SQL assistance and works with a variety of standard data formats, including CSV, JSON, ORC, Avro, and Parquet. Athena is perfect for swift, ad-hoc querying but it can also deal with complex analysis, including massive joins, window functions, and arrays. Amazon Athena is profoundly accessible and executes queries utilizing estimated resources over various facilities and various devices in every facility. Amazon Athena utilizes Amazon S3 as its underlying data store, making your data profoundly accessible and durable.
Fast, Really fast
Interactive performance even for large datasets
With Amazon Athena, you don't need to stress over having enough computed resources to get speedy, interactive query performance. Amazon Athena automatically executes inquiries parallelly, so most outcomes return in practically no time.
Partners
Upsolver is an industry-driving Data Lake Platform that engages any developer to manage, coordinate, and structure gushing data for analysis at extraordinary simplicity.
It is a data lake ETL service. It provides a visual, SQL-based interface for creating real-time tables in Athena with little engineering overhead and according to performance best practices. Upsolver's ETL also enables updates/deletes to tables in Athena for common CDC and compliance use cases.
Customers:
Movable Ink uses Amazon Athena to query seven years’ worth of historical data and get results in no time, with the flexibility to explore data for deeper insights.
Atlassian built a self-service data lake using Amazon Athena and other AWS Analytics services.
OLX improved time and reduced costs to the market by deploying Athena across their organization.