This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS CDK applications.
-
Updated
Dec 21, 2021 - Python
This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS CDK applications.
This repository contains source code for the AWS Database Blog Post Reduce data archiving costs for compliance by automating RDS snapshot exports to Amazon S3
Automation framework to catalog AWS data sources using Glue
ETL Data pipeline using aws services
Terraform configuration that creates several AWS services, uploads data in S3 and starts the Glue Crawler and Glue Job.
Smart City Realtime Data Engineering Project
It is a project build using ETL(Extract, Transform, Load) pipeline using Spotify API on AWS.
AWS Athena, Glue Database, Glue Crawler and S3 buckets deployment through AWS GUI console.
Analyzed a multicategory e-commerce store using big data techniques on a Kaggle dataset with the help of AWS EC2, AWS S3, PySpark, AWS Glue ETL, AWS Athena, AWS CloudFormation, AWS Lambda and Power BI!
In this project I have used the Trending YouTube Video Statistics data from Kaggle to analyze and prepare it for usage.
This project automates the extraction, transformation, and loading (ETL) of Reddit data into a Redshift data warehouse using Airflow. Key technologies include Celery, PostgreSQL, S3, Glue, Athena, and Redshift, providing a complete data pipeline solution.
An end-to-end data pipeline built with AWS S3, Glue, Crawler, Athena, Tableau visulization
Creating an audit table for a DynamoDB table using CloudTrail, Kinesis Data Stream, Lambda, S3, Glue and Athena and CloudFormation
Unlocking Spotify insights with an AWS data pipeline: S3 data lake, Glue ETL, Athena queries, and QuickSight dashboards.
AWS Athena, Glue Database, Glue Crawler and S3 buckets deployment through CloudFormation stack on AWS console.
An end-to-end solution for managing and analyzing YouTube video data from Kaggle, leveraging AWS services and visualized through Quicksight and Tableau
Working with Glue Data Catalog and Running the Glue Crawler On Demand
Unveiling job market trends with Scrapy and AWS
Implementing data pipeline using AWS services for airlines data
Working with Glue Data Catalog and running the using S3 Event Notification and creating the entire stack using AWS CloudFormation
Add a description, image, and links to the aws-glue-crawler topic page so that developers can more easily learn about it.
To associate your repository with the aws-glue-crawler topic, visit your repo's landing page and select "manage topics."