Skip to content

Commit

Permalink
[SPARK-1874][MLLIB] Clean up MLlib sample data
Browse files Browse the repository at this point in the history
1. Added synthetic datasets for `MovieLensALS`, `LinearRegression`, `BinaryClassification`.
2. Embedded instructions in the help message of those example apps.

Per discussion with Matei on the JIRA page, new example data is under `data/mllib`.

Author: Xiangrui Meng <[email protected]>

Closes apache#833 from mengxr/mllib-sample-data and squashes the following commits:

59f0a18 [Xiangrui Meng] add sample binary classification data
3c2f92f [Xiangrui Meng] add linear regression data
050f1ca [Xiangrui Meng] add a sample dataset for MovieLensALS example
  • Loading branch information
mengxr authored and tdas committed May 20, 2014
1 parent b0ce22e commit bcb9dce
Show file tree
Hide file tree
Showing 6 changed files with 2,138 additions and 2 deletions.
100 changes: 100 additions & 0 deletions data/mllib/sample_binary_classification_data.txt

Large diffs are not rendered by default.

501 changes: 501 additions & 0 deletions data/mllib/sample_linear_regression_data.txt

Large diffs are not rendered by default.

Loading

0 comments on commit bcb9dce

Please sign in to comment.