Skip to content

The easiest way to get started with CS246 assignments (TM)

Notifications You must be signed in to change notification settings

rcchen/cs246-baseline

Repository files navigation

cs246-baseline

An easy way to start Hadoop based assignments in CS246.

Requirements

  • Java 8 SDK
  • Eclipse (optional, project contains plugins for generating Eclipse project files)

Instructions

  1. Clone this repository
  2. Run the command ./gradlew eclipse
  3. Import the root folder into Eclipse

To run the code you have written, simply call the corresponding Gradle run task. Run tasks are defined in build.gradle. A sample run task for WordCount is included.

task runWordCount(type: JavaExec) {
  dependsOn removeOutput

  main = "edu.stanford.cs246.wordcount.WordCount"
  classpath = sourceSets.main.runtimeClasspath
  args = ["data/pg100.txt", "output"]
}

This task is executed by typing in ./gradlew runWordCount from the root directory. It will automatically delete an output folder if it exists from a previous run, then execute the WordCount program with the specified arguments.

To define your own run task, just copy that block, paste it somewhere else in build.gradle, and name the task differently. You will also want to change main and args as appropriate. Everything else can probably be left the same though.

FAQ

Q: Is the Java 8 SDK strictly necessary? I have the Java <8 SDK installed. A: No, it is not necessary. However, there are a ton of features in Java 8 that you might like to use like lambdas. If you want to use an older Java SDK, just change the sourceCompatibility line in build.gradle to match your installed SDK version.

Q: Don't we need a cluster to run things for this class? A: Nope. See https://piazza.com/class/ixhxcsxn21l1fu?cid=99

About

The easiest way to get started with CS246 assignments (TM)

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages