Skip to content

Latest commit

 

History

History
28 lines (22 loc) · 1.56 KB

machine-learning-data-science-ingest-data.md

File metadata and controls

28 lines (22 loc) · 1.56 KB
title description services documentationcenter author manager editor ms.assetid ms.service ms.workload ms.tgt_pltfrm ms.devlang ms.topic ms.date ms.author
Load data into storage environments for analytics | Microsoft Docs
Move Data to and from Azure Blob Storage
machine-learning,storage
bradsev
jhubbard
cgronlun
b8fbef77-3e80-4911-8e84-23dbf42c9bee
machine-learning
data-services
na
na
article
12/16/2016
bradsev

Load data into storage environments for analytics

The Team Data Science Process requires that data be ingested or loaded into a variety of different storage environments to be processed or analyzed in the most appropriate way in each stage of the process. Data destinations commonly used for processing include Azure Blob Storage, SQL Azure databases, SQL Server on Azure VM, HDInsight (Hadoop), and Azure Machine Learning.

[!INCLUDE cap-ingest-data-selector]

This menu links to topics that describe how to ingest data into these target environments where the data is stored and processed.

Technical and business needs, as well as the initial location, format and size of your data will determine the target environments into which the data needs to be ingested to achieve the goals of your analysis. It is not uncommon for a scenario to require data to be moved between several environments to achieve the variety of tasks required to construct a predictive model. This sequence of tasks can include, for example, data exploration, pre-processing, cleaning, down-sampling, and model training.