Skip to content

Commit

Permalink
Update H2O1 PCA tutorial
Browse files Browse the repository at this point in the history
  • Loading branch information
jessica0xdata committed Mar 19, 2015
1 parent dcd2658 commit c4829b9
Show file tree
Hide file tree
Showing 4 changed files with 6 additions and 9 deletions.
Binary file modified h2o-docs/source/tutorial/PCAoutput.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified h2o-docs/source/tutorial/PCAparse.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified h2o-docs/source/tutorial/PCArequest.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
15 changes: 6 additions & 9 deletions h2o-docs/source/tutorial/pca.rst
Original file line number Diff line number Diff line change
Expand Up @@ -47,26 +47,23 @@ Before modeling, parse data into H2O:
Building a Model
""""""""""""""""

#. Once data are parsed, a horizontal menu displays at the top
of the screen reading "Build model using ... ". Select
PCA here, or go to the drop-down **Model** menu and
select PCA.
#. Click the the drop-down **Model** menu and select *PCA*.


#. In the "source" field, enter the .hex key for the Arrhythmia data set.
#. In the **source** field, enter the .hex key for the Arrhythmia data set.


#. In the "Ignored Columns" field, select the set of columns to
omit from the analysis.
omit from the analysis. For this example, do not select any columns.

**Note**: PCA ignores categorical variables and constant columns. Categoricals can be included by expanding the categorical into a set of binomial indicators.


#. To specify the maximum number of principal components to
be returned, enter a value in the "max pc" field. In this example, the maximum number of components is 100.
be returned, enter a value in the **max pc** field. For this example, enter `100`.


#. To omit components exhibiting low standard deviation (which indicates a lack of contribution to the overall variance observed in the data), enter a value in the "tolerance" field. In this example, set Tolerance to .5.
#. To omit components exhibiting low standard deviation (which indicates a lack of contribution to the overall variance observed in the data), enter a value in the **tolerance** field. For this example, enter `.5`.


#. To standardize, check the "standardize" checkbox. Standardizing is highly
Expand All @@ -75,7 +72,7 @@ Building a Model
variances relative to other attributes purely as a matter of scale,
rather than true contribution.


#. To generate the model, click the **Submit** button.

.. image:: PCArequest.png
:width: 70%
Expand Down

0 comments on commit c4829b9

Please sign in to comment.