spark-emr-ddb-writetoddb

This example helps you to write Dataset/query output to DynamoDB table.

Steps taken:

Create external tables using Hive
Run queries on s3 data and save result to genreRatingsCount DataSet
Convert the DataSet to RDD and run map function on it to create ITEMs
Using saveAsHadoopDataset and DDBConf(emr-ddb-hadoop), write the ITEMs to DynamoDB table.

Command to run

spark-submit --jars /usr/share/aws/emr/ddb/lib/emr-ddb-hadoop.jar --class com.chappidm.spark_emr_ddb.writetoddb.UserRatingCountDDB spark_emr_ddb.writetoddb-0.0.1-SNAPSHOT.jar

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
src/main/scala/com/chappidm/spark_emr_ddb/writetoddb		src/main/scala/com/chappidm/spark_emr_ddb/writetoddb
.classpath		.classpath
.gitignore		.gitignore
.project		.project
README.md		README.md
myHive.hql		myHive.hql
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spark-emr-ddb-writetoddb

About

Releases

Packages

Languages

chappidim/spark-emr-ddb-writetoddb

Folders and files

Latest commit

History

Repository files navigation

spark-emr-ddb-writetoddb

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages