Training & Exams

Analyzing Big Data with Microsoft R.

About This Course

Course Code
M20773

Course Type
Specialist

Vendor
Microsoft

Duration
3 Days

RRP
£1,656.00

Course Overview
Download

Special Notices

Please note: for Attend from Anywhere customers an additional screen is required. The additional screen must have a minimum screen size of 19 inch and minimum resolution of 1280x1024, with the vertical resolution 1024 being the most critical.

Overview

The main purpose of the course is to give students the ability to use Microsoft R Server to create and run an analysis on a large dataset, and show how to utilize it in Big Data environments, such as a Hadoop or Spark cluster, or a SQL Server database.

Audience profile

Objectives

After completing this course, students will be able to:

Course Outline

Module 1: Microsoft R Server and R Client
Explain how Microsoft R Server and Microsoft R Client work.

Lessons

Lab : Exploring Microsoft R Server and Microsoft R Client

Module 2: Exploring Big Data
At the end of this module the student will be able to use R Client with R Server to explore big data held in different data stores.

Lessons

Lab : Exploring Big Data

Module 3: Visualizing Big Data
Explain how to visualize data by using graphs and plots.

Lessons

Lab : Visualizing data

Module 4: Processing Big Data
Explain how to transform and clean big data sets.

Lessons

Lab : Processing big data

Module 5: Parallelizing Analysis Operations
Explain how to implement options for splitting analysis jobs into parallel tasks.

Lessons

Lab : Using rxExec and RevoPemaR to parallelize operations

Module 6: Creating and Evaluating Regression Models
Explain how to build and evaluate regression models generated from big data

Lessons

Lab : Creating a linear regression model


Module 7: Creating and Evaluating Partitioning Models
Explain how to create and score partitioning models generated from big data.

Lessons

Lab : Creating and evaluating partitioning models

Module 8: Processing Big Data in SQL Server and Hadoop
Explain how to transform and clean big data sets.

Lessons

Lab : Processing big data in SQL Server and Hadoop

Prerequisites

In addition to their professional experience, students who attend this course should have:

It is recommended that delegates review this self-pace content to gain an introduction to the R language

https://www.edx.org/course/introduction-r-data-science-microsoft-dat204x-5

About This Course

Course Code
M20773

Course Type
Specialist

Vendor
Microsoft

Duration
3 Days

RRP
£1,656.00

Course Overview
Download