title | titleSuffix | description | services | author | ms.service | ms.subservice | ms.topic | ms.date | ms.author |
---|---|---|---|---|---|---|---|---|---|
Tutorial: Copy to Blob storage via REST APIs |
Azure Data Box |
In this tutorial, learn how to connect to Azure Data Box Blob storage by using REST APIs over http or https, then copy data from Azure Data Box. |
databox |
alkohli |
databox |
pod |
tutorial |
07/02/2020 |
alkohli |
This tutorial describes procedures to connect to Azure Data Box Blob storage via REST APIs over http or https. Once connected, the steps required to copy the data to Data Box Blob storage and prepare the Data Box to ship, are also described.
In this tutorial, you learn how to:
[!div class="checklist"]
- Prerequisites
- Connect to Data Box Blob storage via http or https
- Copy data to Data Box
Before you begin, make sure that:
- You've completed the Tutorial: Set up Azure Data Box.
- You've received your Data Box and the order status in the portal is Delivered.
- You've reviewed the system requirements for Data Box Blob storage and are familiar with supported versions of APIs, SDKs, and tools.
- You've access to a host computer that has the data that you want to copy over to Data Box. Your host computer must
- Run a Supported operating system.
- Be connected to a high-speed network. We strongly recommend that you have at least one 10-GbE connection. If a 10-GbE connection isn't available, a 1-GbE data link can be used but the copy speeds will be impacted.
- Download AzCopy 7.1.0 on your host computer. You'll use AzCopy to copy data to Azure Data Box Blob storage from your host computer.
You can connect to Data Box Blob storage over http or https.
- Https is the secure and recommended way to connect to Data Box Blob storage.
- Http is used when connecting over trusted networks.
The steps to connect are different when you connect to Data Box Blob storage over http or https.
Connection to Data Box Blob storage REST APIs over http requires the following steps:
- Add the device IP and blob service endpoint to the remote host
- Configure third-party software and verify the connection
Each of these steps is described in the following sections.
[!INCLUDE data-box-add-device-ip]
[!INCLUDE data-box-configure-partner-software]
[!INCLUDE data-box-verify-connection]
Connection to Azure Blob storage REST APIs over https requires the following steps:
- Download the certificate from Azure portal
- Import the certificate on the client or remote host
- Add the device IP and blob service endpoint to the client or remote host
- Configure third-party software and verify the connection
Each of these steps is described in the following sections.
Use the Azure portal to download certificate.
-
Sign into the Azure portal.
-
Go to your Data Box order and navigate to General > Device details.
-
Under Device credentials, go to API access to device. Click Download. This action downloads a <your order name>.cer certificate file. Save this file. You will install this certificate on the client or host computer that you will use to connect to the device.
Accessing Data Box Blob storage over HTTPS requires a TLS/SSL certificate for the device. The way in which this certificate is made available to the client application varies from application to application and across operating systems and distributions. Some applications can access the certificate after it is imported into the system's certificate store, while other applications do not make use of that mechanism.
Specific information for some applications is mentioned in this section. For more information on other applications, consult the documentation for the application and the operating system used.
Follow these steps to import the .cer
file into the root store of a Windows or Linux client. On a Windows system, you can use Windows PowerShell or the Windows Server UI to import and install the certificate on your system.
-
Start a Windows PowerShell session as an administrator.
-
At the command prompt, type:
Import-Certificate -FilePath C:\temp\localuihttps.cer -CertStoreLocation Cert:\LocalMachine\Root
-
Right-click the
.cer
file and select Install certificate. This action starts the Certificate Import Wizard. -
For Store location, select Local Machine, and then click Next.
-
Select Place all certificates in the following store, and then click Browse. Navigate to the root store of your remote host, and then click Next.
-
Click Finish. A message that tells you that the import was successful appears.
The method to import a certificate varies by distribution.
Several, such as Ubuntu and Debian, use the update-ca-certificates
command.
- Rename the Base64-encoded certificate file to have a
.crt
extension and copy it into the/usr/local/share/ca-certificates directory
. - Run the command
update-ca-certificates
.
Recent versions of RHEL, Fedora, and CentOS use the update-ca-trust
command.
- Copy the certificate file into the
/etc/pki/ca-trust/source/anchors
directory. - Run
update-ca-trust
.
Consult the documentation specific to your distribution for details.
Follow the same steps to add device IP address and blob service endpoint when connecting over http.
Follow the steps to Configure partner software that you used while connecting over http. The only difference is that you should leave the Use http option unchecked.
Once you are connected to the Data Box Blob storage, the next step is to copy data. Prior to data copy, review the following considerations:
- While copying data, ensure that the data size conforms to the size limits described in the Azure storage and Data Box limits.
- If data, which is being uploaded by Data Box, is concurrently uploaded by other applications outside of Data Box, this may result in upload job failures and data corruption.
Important
Make sure that you maintain a copy of the source data until you can confirm that the Data Box has transferred your data into Azure Storage.
In this tutorial, AzCopy is used to copy data to Data Box Blob storage. You can also use Azure Storage Explorer (if you prefer a GUI-based tool) or a partner software to copy the data.
The copy procedure has the following steps:
- Create a container
- Upload contents of a folder to Data Box Blob storage
- Upload modified files to Data Box Blob storage
Each of these steps is described in detail in the following sections.
The first step is to create a container, because blobs are always uploaded into a container. Containers organize groups of blobs like you organize files in folders on your computer. Follow these steps to create a blob container.
-
Open Storage Explorer.
-
In the left pane, expand the storage account within which you wish to create the blob container.
-
Right-click Blob Containers, and from the context menu, select Create Blob Container.
-
A text box appears below the Blob Containers folder. Enter the name for your blob container. See the Create the container and set permissions for information on rules and restrictions on naming blob containers.
-
Press Enter when done to create the blob container, or Esc to cancel. Once the blob container is successfully created, it is displayed under the Blob Containers folder for the selected storage account.
Use AzCopy to upload all files in a folder to Blob storage on Windows or Linux. To upload all blobs in a folder, enter the following AzCopy command:
azcopy \
--source /mnt/myfolder \
--destination https://data-box-storage-account-name.blob.device-serial-no.microsoftdatabox.com/container-name/files/ \
--dest-key <key> \
--recursive
AzCopy /Source:C:\myfolder /Dest:https://data-box-storage-account-name.blob.device-serial-no.microsoftdatabox.com/container-name/files/ /DestKey:<key> /S
Replace <key>
with your account key. To get your account key, in the Azure portal, go to your storage account. Go to Settings > Access keys, select a key, and paste it into the AzCopy command.
If the specified destination container does not exist, AzCopy creates it and uploads the file into it. Update the source path to your data directory, and replace data-box-storage-account-name
in the destination URL with the name of the storage account associated with your Data Box.
To upload the contents of the specified directory to Blob storage recursively, specify the --recursive
(Linux) or /S
(Windows) option. When you run AzCopy with one of these options, all subfolders and their files are uploaded as well.
Use AzCopy to upload files based on their last-modified time. To try this, modify or create new files in your source directory for test purposes. To upload only updated or new files, add the --exclude-older
(Linux) or /XO
(Windows) parameter to the AzCopy command.
If you only want to copy source resources that do not exist in the destination, specify both --exclude-older
and --exclude-newer
(Linux) or /XO
and /XN
(Windows) parameters in the AzCopy command. AzCopy uploads only the updated data, based on its time stamp.
azcopy \
--source /mnt/myfolder \
--destination https://data-box-storage-account-name.blob.device-serial-no.microsoftdatabox.com/container-name/files/ \
--dest-key <key> \
--recursive \
--exclude-older
AzCopy /Source:C:\myfolder /Dest:https://data-box-storage-account-name.blob.device-serial-no.microsoftdatabox.com/container-name/files/ /DestKey:<key> /S /XO
If there are any errors during the connect or copy operation, see Troubleshoot issues with Data Box Blob storage.
Next step is to prepare your device to ship.
In this tutorial, you learned about Azure Data Box topics such as:
[!div class="checklist"]
- Prerequisites
- Connect to Data Box Blob storage via http or https
- Copy data to Data Box
Advance to the next tutorial to learn how to ship your Data Box back to Microsoft.
[!div class="nextstepaction"] Ship your Azure Data Box to Microsoft