- Overview
- Audience
- Prerequisites
- Curriculum
Description:
This 5-day AWS Associate Data Engineer training is designed to provide participants with the skills required to design, build, and manage data pipelines on AWS. The program covers the end-to-end lifecycle of data engineering, from ingestion and storage to processing, analytics, and governance, leveraging AWS-native services.
The training begins with an overview of AWS global infrastructure, IAM roles, and storage fundamentals with S3 and CloudFront. Participants will then progress to databases and data warehousing with RDS and Redshift, ETL with Glue, querying with Athena, and data lake governance using Lake Formation. Advanced modules include real-time streaming with Kinesis, batch processing with AWS Batch, big data processing with EMR and Spark, and migration with DMS.
Hands-on labs are embedded throughout, ensuring participants gain practical experience in setting up secure environments, building data lakes, performing ETL, running analytics, and optimizing costs. By the end of the program, participants will be able to confidently implement scalable, secure, and cost-efficient AWS-based data engineering solutions, while preparing for AWS Associate-level certification paths
Duration: 5 Days
Course Code: BDT 516
Learning Objectives:
After this training, participants will be able to:
- Develop, deploy, and secure applications using AWS compute, storage, and database services.
- Build and manage serverless applications with AWS Lambda and Step Functions.
- Implement messaging, workflow, and integration services such as SQS, SNS, and SWF.
- Apply best practices for scalability, performance, and cost optimization in AWS applications.
This course is ideal for:
- Aspiring and practicing data engineers who want to build AWS cloud-native data pipelines
- Data analysts and developers transitioning into data engineering roles
- Cloud engineers working on storage, data processing, and analytics workloads
- Professionals preparing for AWS Associate-level Data Analytics or Data Engineering certifications
- Basic programming knowledge (Python, Node.js, or Java)
- Familiarity with fundamental cloud computing concepts
- Prior exposure to AWS Free Tier services (helpful but not mandatory)
- Comfort with command-line interfaces and scripting
Course Outline:
Module 1: Introduction to AWS
- Getting Started
- AWS at 10,000 feet
- History of AWS
- Overview of AWS tools and services
Module 2: Identity and Access Management (IAM)
- IAM 101
- Security Tokens
- Active Directory Federation
- Web Identity Federation
Module 3: EC2 and Setup
- EC2 basics
- Launching the first EC2 instances
- Using PuTTY with EC2
- Security groups
- Upgrading storage
Module 4: AWS CLI and SDKs
- CLI Commands
- Bash Scripting
- Using PHP SDK
Module 5: Load Balancing and Containers
- Elastic Load Balancer
- Amazon ECR
- Amazon ECS
Module 6: AWS Lambda
- Essentials of Lambda
- Creating and Testing a Node.js Lambda Function
- Creating and Testing a Python Lambda Function
- Building an AWS Lambda-enabled Browser Application
- Creating Node.js and Python State Machines with AWS Step Functions
Module 7: Amazon S3 and Storage Services
- S3 essentials
- Creating S3 buckets
- Hosting static websites on S3
- Building serverless webpages
- S3 versioning, lifecycle management, and cross-region replication
- Glacier for archiving
- CloudFront (CDN)
- S3 Security and Encryption
- Storage Gateway, Snowball, Transfer Acceleration
Module 8: Databases
- Database concepts and Amazon RDS
- Using MySQL as a DB
- Amazon Redshift
- DynamoDB essentials (tables, indexes, scan vs query)
Module 9: AWS IoT
- Introduction to AWS IoT
- Implementing IoT solutions
Module 10: Messaging and Workflow Services
- Simple Queue Service (SQS)
- Simple Notification Service (SNS)
- Simple Workflow Service (SWF)
Module 11: Application Deployment
- Elastic Beanstalk
- AWS CloudFormation
- AWS Data Pipeline
Module 12: Analytics
- AWS EMR
- AWS Kinesis
- AWS Redshift
- AWS Data Pipeline (advanced usage)
Module 13: Networking and Routing
- DNS and Route53
- Routing Policies: Simple, Weighted, Latency, Failover, Geo-location
- VPC overview and setup
- Custom VPC design
- NATs vs Bastion
- VPC Flow Logs
Training material provided: Yes (Digital format)




