Skip to main content

5 docs tagged with "data"

View All Tags

Airflow

Apache Airflow is an open-source Python framework used to programmatically author, schedule and monitor workflows. It is primarily used for data extract, transformation and load (ETL) pipelines.

AWS Athena

AWS Athena is a service that allows SQL queries to be executed against files held in S3.

Azure Data Factory

Azure Data Factory is a managed, serverless ETL tool with a drag & drop UI for use in the Azure cloud. It is a good product, but lacking in maturity in some areas - mainly surrounding the UI itself.