Skip to content

Latest commit

 

History

History
76 lines (55 loc) · 3.48 KB

Lab1.md

File metadata and controls

76 lines (55 loc) · 3.48 KB

Lab 1: Load Data into Azure SQL Data Warehouse using Azure Data Factory Pipelines

In this lab you will configure the Azure environment to allow relational data to be transferred from a SQL Server 2017 database to an Azure SQL Data Warehouse database using Azure Data Factory. The dataset you will use contains data about motor vehicle collisions that happened in New York City from 2012 to 2019. You will use Power BI to visualise collision data loaded from Azure SQL Data Warehouse.

The estimated time to complete this lab is: 60 minutes.

Lab Architecture

Lab Architecture

IMPORTANT: Some of the Azure services provisioned require globally unique name and a “-suffix” has been added to their names to ensure this uniqueness. Please take note of the suffix generated as you will need it for the following resources:

Name Type
mdwcosmosdb-suffix Cosmos DB account
MDWDataFactory-suffix Data Factory (V2)
mdwdatalakesuffix Storage Account
MDWEventHubs-suffix Event Hubs Namespace
MDWKeyVault-suffix Key vault
mdwsqlvirtualserver-suffix SQL server
MDWStreamAnalytics-suffix Stream Analytics job

What is your suffix?
suffix: suffix here

Connect to MDWDesktop

In this section you are going to establish a Remote Desktop Connection to MDWDesktop virtual machine. This Azure VM is where we will install Azure Data Studio and Power BI.

IMPORTANT
Execute these steps on your host computer
  1. In the Azure Portal, navigate to the MDW-Lab resource group and click the MDWDesktop virtual machine.

  2. On the MDWDesktop blade, from the Overview menu, click the Connect button.

  3. On the Connect to virtual machine blade, click Download RDP File. This will download a .rdp file that you can use to establish a Remote Desktop Connection with the virtual machine.

Install required software onto MDWDesktop

In this section you are going to install Power BI Desktop and Azure Data Studio on MDWDesktop vm. If you have this software on your laptop already you can skip this section. You can also install this software on your laptop instead.

![](./Media/Lab1-Image04.jpg)
IMPORTANT
Execute these steps inside the MDWDesktop remote desktop connection
  1. Once the RDP file is downloaded, click on it to establish an RDP connection with MDWDesktop

  2. User the following credentials to authenticate:
    - User Name: MDWAdmin
    - Password: P@ssw0rd123!

  3. Once logged in, accept the default privacy settings.

  4. Using the browser, download and install the latest version of following software. During the setup, accept all default settings:

    Azure Data Studio (User Installer)
    https://docs.microsoft.com/en-us/sql/azure-data-studio/download
    we will use this to query the data. You could also download SSMS instead, if that is more familiar for you.


    Power BI Desktop (64-bit)
    https://www.microsoft.com/en-us/download/details.aspx?id=45331
    we will use this for visualization and querying

    Azure Storage Explorer
    https://azure.microsoft.com/en-us/features/storage-explorer/
    we can use this to connect to our source datasets to understand how they are laid out in their storage account