Loading...

Terminy kursów:

17.12.2018 - 21.12.2018
18.02.2019 - 22.02.2019
03.06.2019 - 07.06.2019

Oficjalny konspekt Microsoft Pozostałe szkolenia

MS-20775 Performing Data Engineering on Microsoft HD Insight

Czas trwania kursu: 5 dni
Cena: 2299zł

KLUCZOWE ZAGADNIENIA

  • wprowadzenie do HDInsight
  • wdrażanie klastrów HDInsight
  • przyznawanie dostepu do zasobów
  • ETL
  • analiza danych przy pomocy Spark SQL
  • przetwarzanie danych w czasie rzeczywistym
            Przemysław Rosłon, Microsoft Certified Trainer

 

 CERTYFIKACJE ZAWODOWE

Certyfikacja podstawowa CertyfIkacja specjalistyczna Certyfikacja ekspercka
FUNDAMENTALS ASSOCIATE EXPERT

 

Tematyka szkolenia pokrywa zagadnienia wymagane na ścieżce MCSA: Data Engineering with Azure
Dowiedz się więcej o certyfikacjach Microsoft


POZIOM SZKOLENIA

podstawowy średniozaawansowany zaawansowany ekspercki
100 200 300 400

 

TEMATYKA ZAJĘĆ

  • Module 1: Getting Started with HDInsight
    What is Big Data?
    Introduction to Hadoop
    Working with MapReduce Function
    Introducing HDInsight
    Lab : Working with HDInsight

 

  • Module 2: Deploying HDInsight Clusters
    Identifying HDInsight cluster types
    Managing HDInsight clusters by using the Azure portal
    Managing HDInsight Clusters by using Azure PowerShell
    Lab : Managing HDInsight clusters with the Azure Portal

 

  • Module 3: Authorizing Users to Access Resources
    Non-domain Joined clusters
    Configuring domain-joined HDInsight clusters
    Manage domain-joined HDInsight clusters
    Lab : Authorizing Users to Access Resources

 

  • Module 4: Loading data into HDInsight
    Storing data for HDInsight processing
    Using data loading tools
    Maximising value from stored data
    Lab : Loading Data into your Azure account

 

  • Module 5: Troubleshooting HDInsight
    Analyze HDInsight logs
    YARN logs
    Heap dumps
    Operations management suite
    Lab : Troubleshooting HDInsight

 

  • Module 6: Implementing Batch Solutions
    Apache Hive storage
    HDInsight data queries using Hive and Pig
    Operationalize HDInsight
    Lab : Implement Batch Solutions

 

  • Module 7: Design Batch ETL solutions for big data with Spark
    What is Spark?
    ETL with Spark
    Spark performance
    Lab : Design Batch ETL solutions for big data with Spark.

 

  • Module 8: Analyze Data with Spark SQL
    Implementing iterative and interactive queries
    Perform exploratory data analysis
    Lab : Performing exploratory data analysis by using iterative and interactive queries

 

  • Module 9: Analyze Data with Hive and Phoenix
    Implement interactive queries for big data with interactive hive.
    Perform exploratory data analysis by using Hive
    Perform interactive processing by using Apache Phoenix
    Lab : Analyze data with Hive and Phoenix

 

  • Module 10: Stream Analytics
    Stream analytics
    Process streaming data from stream analytics
    Managing stream analytics jobs
    Lab : Implement Stream Analytics

 

  • Module 11: Implementing Streaming Solutions with Kafka and HBase
    Building and Deploying a Kafka Cluster
    Publishing, Consuming, and Processing data using the Kafka Cluster
    Using HBase to store and Query Data
    Lab : Implementing Streaming Solutions with Kafka and HBase

 

  • Module 12: Develop big data real-time processing solutions with Apache Storm
    Persist long term data
    Stream data with Storm
    Create Storm topologies
    Configure Apache Storm
    Lab : Developing big data real-time processing solutions with Apache Storm

 

  • Module 13: Create Spark Streaming Applications
    Working with Spark Streaming
    Creating Spark Structured Streaming Applications
    Persistence and Visualization
    Lab : Building a Spark Streaming Application