Skip to content

xh2002/CSC4160_FinalProject

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

53 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CSC4160 Final Project

Final Project of CSC4160 at CUHK(SZ): Cloud-Based Distributed MapReduce System

Introduction

This project implements a cloud-based distributed MapReduce system inspired by Google’s original MapReduce framework. It demonstrates scalability, fault tolerance, and integration with cloud services for large-scale data processing.

Demo

The demostration is provide via this link, we recommend to turn on English Subtitles when watching.

Setup Instructions

1. Prerequisites

  • Go (1.20 or later) installed on your system.
  • Access to a cloud environment such as AWS EC2 for multi-node execution (optional).
  • Source code cloned from the Github Repository.

2. Build locally and Run WordCount

  1. Navigate to the src/main directory:

    cd src/main  
  2. Build the WordCount plugin:

    go build -buildmode=plugin ../mrapps/wc.go  
  3. Clean up previous outputs (if any):

    rm mr-out*  
  4. Run the WordCount task in sequential mode:

    go run mrsequential.go wc.so pg*.txt  
  5. View the output:

    more mr-out-0  

About

Final Project of CSC4160 at CUHK(SZ)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published