Skip to content

Latest commit

 

History

History
93 lines (62 loc) · 6.87 KB

machine-learning-data-science-for-beginners-copy-other-peoples-work-to-do-data-science.md

File metadata and controls

93 lines (62 loc) · 6.87 KB
title description keywords services documentationcenter author manager editor ms.assetid ms.service ms.devlang ms.topic ms.tgt_pltfrm ms.workload ms.date ms.author
Copy others' data science work - machine learning example | Microsoft Docs
Trade secret of data science: Get others to do your work for you. See examples in Cortana Analytics Gallery such as a clustering algorithm example.
data science examples,machine learning example,clustering algorithm,clustering algorithm example
machine-learning
na
cjgronlund
jhubbard
cjgronlund
ec2be823-c325-4ad8-b8b2-3e664f1a44b4
machine-learning
na
article
na
na
10/20/2016
cgronlun;garye

Copy other people's work to do data science

Video 5: Data Science for Beginners series

One of the trade secrets of data science is getting other people to do your work for you. Find a clustering algorithm example to use for your own machine learning experiment.

To get the most out of the series, watch them all. Go to the list of videos

[!VIDEO https://channel9.msdn.com/Blogs/Windows-Azure/data-science-for-beginners-series-copy-other-peoples-work-to-do-data-science/player]

Other videos in this series

Data Science for Beginners is a quick introduction to data science in five short videos.

Transcript: Copy other people's work to do data science

Welcome to the fifth video in the series “Data Science for Beginners.”

In this one, you’ll discover a place to find examples that you can borrow from as a starting point for your own work. You might get the most out of this video if you first watch the earlier videos in this series.

One of the trade secrets of data science is getting other people to do your work for you.

Find examples in the Cortana Intelligence Gallery

Microsoft has a cloud-based service called Azure Machine Learning that you're welcome to try for free. It provides you with a workspace where you can experiment with different machine learning algorithms, and, when you've got your solution worked out, you can launch it as a web service.

Part of this service is something called the Cortana Intelligence Gallery. It contains a variety of resources, one of which is a collection of Azure Machine Learning experiments, or models, that people have built and contributed for others to use. These experiments are a great way to leverage the thought and hard work of others to get you started on your own solutions.

You can find the gallery at aka.ms/CortanaIntelligenceGallery. Everyone is welcome to browse through it.

Cortana Intelligence Gallery

If you click Experiments at the top, you'll see a number of the most recent and popular experiments in the gallery. You can search through the rest of experiments by clicking Browse All at the top of the screen, and there you can enter search terms and choose search filters.

Find and use a clustering algorithm example

So, for instance, let's say you want to see an example of how clustering works, so you search for "clustering" experiments.

Search for clustering experiments

Here's an interesting one that someone contributed to the gallery.

Clustering experiment

Click on that experiment and you get a web page that describes the work that this contributor did, along with some of their results.

Clustering experiment description page

Notice the link that says Open in Studio.

Open in Studio button

I can click on that and it takes me right to Azure Machine Learning Studio. It creates a copy of the experiment and puts it in my own workspace. This includes the contributor's dataset, all the processing that they did, all of the algorithms that they used, and how they saved out the results.

Open a Gallery experiment in Machine Learning Studio - clustering algorithm example

And now I have a starting point. I can swap out their data for my own and do my own tweaking of the model. This gives me a running start, and it lets me build on the work of people who really know what they’re doing.

Find experiments that demonstrate machine learning techniques

There are other experiments in the Cortana Intelligence Gallery that were contributed specifically to provide how-to examples for people new to data science. For instance, there's an experiment in the gallery that demonstrates how to handle missing values (Methods for handling missing values). It walks you through 15 different ways of substituting empty values, and talks about the benefits of each method and when to use it.

Gallery experiment open in Machine Learning Studio - methods for missing values

Cortana Intelligence Gallery is a place to find working experiments that you can use as a starting point for your own solutions.

Be sure to check out the other videos in “Data Science for Beginners” from Microsoft Azure Machine Learning.

Next steps