Big Data on AWS

Price
Net
VAT

Price
Price on Request

Duration
3 days

For companies and job seekers:
this course is 100% fundable!
 

Location

Course Language
English

Training Solutions
Online Live

Data volumes are growing rapidly and require scalable, secure, and intelligent cloud solutions. Modern data processing on AWS combines big data, analytics, and AI to form a powerful basis for informed decisions and automated processes.

Key topics

  • Big data architectures and data pipelines on AWS.
  • Processing large amounts of data with cloud-native services.
  • Scalability, security, and cost optimization.
  • Integration of analytics, machine learning, and AI.
  • Data visualization and data-based decision models.

Prerequisites
Basic knowledge of IT, data processing, or cloud environments is helpful.

Target audience
Suitable for specialists and managers from IT, data, analytics, and architecture, as well as anyone who wants to classify and evaluate data-driven solutions in the cloud.

The content taught strengthens understanding of modern big data strategies on AWS and supports the secure, efficient, and AI-oriented use of data as a strategic success factor.
 

Print as PDF
Course content
  • Definition and significance of big data
  • Setting up a data pipeline
  • Principles of modern data architecture
  • Overview: Data collection
  • Data transmission
  • Real-time streams with mass data
  • Amazon's Kinesis platform
  • Data flow via Kinesis Firehose
  • Video transmission via Kinesis Streams
  • Analytics with Kinesis Data Analytics
  • Overview of AWS storage types
  • Fundamentals of modern storage strategies
  • Criteria for choosing the right storage
  • Analysis of large amounts of data
  • Amazon Athena
  • Basics of Amazon EMR & Hadoop
  • Best practices for data collection
  • Using Amazon EMR
  • Architecture of Amazon EMR
  • Create and operate a program
  • Start and set up clusters
  • Evaluate results of completed jobs
  • Hadoop libraries
  • Alternative frameworks on Amazon EMR
  • Hue in Amazon EMR
  • Cluster Monitoring
  • Spark Basics
  • Using Spark
  • Basics of AWS Glue
  • Orchestrating Glue jobs
  • Data warehouses vs. traditional databases
  • Amazon Redshift Platform
  • Architecture of Amazon Redshift
  • Security for Amazon deployments
  • Overview Amazon EMR Security
  • Overview of AWS Identity & Access Management
  • Protecting Data Assets
  • Overview of Amazon Kinesis Security
  • Overview of Amazon DynamoDB Security
  • Overview of Amazon Redshift Security
  • Total cost with Amazon EMR
  • Amazon EC2 pricing options
  • Amazon Kinesis cost models
  • Budget focus with Amazon DynamoDB
  • Price overview for Amazon Redshift
  • AWS cost optimization
  • Data visualization in the big data environment
  • Amazon QuickSight in use
  • Workflow orchestration for big data
  • Central structures
  • What happens next?

Frequently asked questions

  • Big Data on AWS means storing, processing, and analyzing large amounts of data using Amazon Web Services. Scalable tools such as Amazon EMR, Redshift, and Kinesis are used for this purpose.
  • Particularly relevant for professionals in IT, data science, or business intelligence. Ideal for anyone who wants to analyze data professionally or make data-driven decisions.
  • Topics covered include data processing with Amazon EMR, data visualization with QuickSight, data storage in S3, streaming with Kinesis, security, and cost control.
  • Basic knowledge of cloud computing and a fundamental understanding of data processing are helpful. Experience with AWS or programming is an advantage, but not essential.
  • Demanded skills in AWS and big data tools increase your chances of finding jobs in areas such as cloud architecture, data engineering, or analytics—in companies of all sizes.
  • Yes, the training program concludes with an official AWS certification. This is internationally recognized as proof of practical knowledge in the field of big data and cloud technology.
  • AWS enables flexible scaling, high reliability, numerous analysis functions, and a wide range of services for different data formats, workflows, and real-time applications.

Do you have any further questions? Please contact us.