Glue scripts for converting AWS Service Logs for use in Athena
-
Updated
Feb 1, 2024 - Python
Glue scripts for converting AWS Service Logs for use in Athena
The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.
Build and deploy a serverless data pipeline on AWS with no effort.
Extract, transform, and load data for analytic processing using AWS Glue
This is a data pipeline built with the purpose of serving a business team.
Terraform configuration that creates several AWS services, uploads data in S3 and starts the Glue Crawler and Glue Job.
Terraform module which creates Glue Job resources on AWS.
This project outlines the final project requirements for Information Architectures, focusing on group assignments, scoring criteria, topic selection, core requirements, and project components such as design, development, visualization, and executive presentation.
Pipeline ETL na AWS
Terraform module to create and manage a AWS Glue job
Data Engineering project using data streaming produced by python applications, ETL process and availability for ad-hoc SQL queries in the AWS cloud
This project creates a serverless data pipeline to extract data from the Colombo Stock Market ASI Index API using AWS Lambda, Kinesis Firehose, and S3. An AWS Glue workflow processes and transforms the data, storing it in an Apache Iceberg table via Athena and Glue ETL jobs.
IMDB Movie Data ETL Pipeline using S3, Glue, Redshift, EventBridge, SNS
DeepLearning.AI & AWS Data Engineering Course Exercises
This project is an end-to-end, fully automated warehouse management solution designed to tackle real-world inventory challenges in the FMCG sector. From real-time data ingestion and predictive analytics to interactive dashboards, this project combines cutting-edge technologies and an event-driven architecture to simulate a business-ready system.
ETL using application streaming and creating a Data Lake
Add a description, image, and links to the glue-job topic page so that developers can more easily learn about it.
To associate your repository with the glue-job topic, visit your repo's landing page and select "manage topics."