Word Count

The infamous word counting MapReduce example using Hadoop MapReduce.

Input

Files containing words to be counted

A list of words and their corresponding occurance in the files you provide as the input.

Instructions require prior Hadoop set up.

Clone the repository and open a terminal in the main folder where the pom.xml is located.
Make a .jar in a new /target folder as such:

mvn clean install

hadoop fs -mkdir wordcount/input
hadoop fs -copyFromLocal input.txt wordcount/input

Check that the file is in the file system with hadoop fs -ls wordcount/input, producing the output of:

Found 1 items
-rw-r--r--   1 hduser supergroup         25 2016-12-04 01:52 wordcount/input/input.txt

Run the jar using the command below where the <jar-name> is the name of your jar file and the <user-name> is the name of your user group.

hadoop jar <jar-name>.jar WordCountJob /user/<user-name>/wordcount/input /user/<user-name>/wordcount/output

hadoop fs -copyToLocal wordcount/output .

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
src/main/java		src/main/java
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml