GitHub - dxxConan/DexiangXu-Scala: kafka Tutorial

Project description

To play with Akka, Kafka, Scala, ElasticSearch and REST (micro)service. To build a data processing pipeline illustrated below.

Installation

Docker

Kafka

Tasks

Phase 1

Implements the first part of the data pipeline. Read the data from CSV file and send it to Kafka topic by producer. And check if the topic has been set up properly by build a consumer subscriped to that topic.

Phase 2

Set up a commitable source subscribe to the configured topic in Phase 1. And do some data process by micro service(optional) asyncronously. Use another producer to send the processed data to a new topic by Akka flow.

Phase 3

Built Akka stream which listens the data on Kafka consumer of the topic created in phase 2. Use REST Api to send data to Elastic search by connection flow based on HttpRequest. And notice that the data we got in phase 2 is in String format, we need to transfer it into JSON object before sending it to elastic search

Phase 4

Implements a REST service based on Elestic search which will provide user methods to search the data stored in the CSV file. Use Akka HTTP DSL listen to the incoming request from users,query data from elastic search and send back to user.

Usage and test

Start each project by sbt run
The csv file path is hard coded in phase 1
Producer should run before start loading to consumer
Can check the Kafka topics status by Kafka tool
Or use command line bin/kafka-console-consumer.sh --bootstrap-server localhost:9092 --topic test --from-beginning.
For docker the ip is set to be 192.168.99.100
Kafka port: 9092, ElasticSearch port: 9200
Http request for elastic search port: locahost:8200

Reference

Akka Scala Seed : https://github.com/typesafehub/activator-akka-scala-seed
Akka HTTP Microservice : https://github.com/theiterators/akka-http-microservice
Reactive Kafka : https://github.com/softwaremill/activator-reactive-kafka-scala
REST service : https://github.com/colaberry/scala-course-projects-lokesh973
Csv parser : https://github.com/tototoshi/scala-csv

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
Phase1		Phase1
Phase2		Phase2
Phase3		Phase3
Phase4		Phase4
day1		day1
day2		day2
sample-projects		sample-projects
Concepts of Kafka.md		Concepts of Kafka.md
HTTP Methods		HTTP Methods
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project description

Installation

Tasks

Phase 1

Phase 2

Phase 3

Phase 4

Usage and test

Reference

About

Releases

Packages

Languages

dxxConan/DexiangXu-Scala

Folders and files

Latest commit

History

Repository files navigation

Project description

Installation

Tasks

Phase 1

Phase 2

Phase 3

Phase 4

Usage and test

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages