Skip to content
/ glow Public
forked from chrislusf/glow

Glow is an easy-to-use distributed computation system written in Go, an alternative to Hadoop Map Reduce, Spark, Flint, etc. Currently just started and not feature rich yet.

Notifications You must be signed in to change notification settings

wxdublin/glow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

65 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

glow

Examples are in this repo https://github.com/chrislusf/glow_examples

Purpose

Glow is providing a library to easily compute in parallel threads or distributed to clusters of machines.

One minute tutorial

Simple Start

Here is an simple full example:

package main

import (
	"flag"
	"strings"

	"github.com/chrislusf/glow/flow"
)

func main() {
	flag.Parse()

	flow.NewContext().TextFile(
		"/etc/passwd", 3,
	).Filter(func(line string) bool {
		return !strings.HasPrefix(line, "#")
	}).Map(func(line string, ch chan string) {
		for _, token := range strings.Split(line, ":") {
			ch <- token
		}
	}).Map(func(key string) int {
		return 1
	}).Reduce(func(x int, y int) int {
		return x + y
	}).Map(func(x int) {
		println("count:", x)
	})
}

Try it.

./word_count

It will run the input text file, '/etc/passwd', in 3 go routines, filter/map/map, and then reduced to one number in one goroutine (not exactly correct, but let's skip the details for now.) and print it out.

This is useful already, saving lots of idiomatic but repetitive code on channels, sync wait, etc.

However, there is one more thing!

Scale it out

We need to setup the cluster first. We do not need experts on Zookeeper/HDFS/Mesos/YARN etc. Just need to download one binary file.

Setup the cluster

  // fetch and install via go, or just download it from somewhere
  go get github.com/chrislusf/glow
  // start a leader on one computer
  glow leader
  // run one or more agents on computers
  glow agent --dir . --max.executors=16 --memory=2048 --leader="localhost:8930" --port 8931

Start the driver program

To leap from one computer to clusters of computers, add this line to the import list:

	_ "github.com/chrislusf/glow/driver"

This will "steroidize" the code to run in cluster mode!

./word_count -driver -driver.leader="localhost:8930"

About

Glow is an easy-to-use distributed computation system written in Go, an alternative to Hadoop Map Reduce, Spark, Flint, etc. Currently just started and not feature rich yet.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Go 99.1%
  • Protocol Buffer 0.9%