Skip to content
View cakiem8x's full-sized avatar

Block or report cakiem8x

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

8 results for source starred repositories written in Scala
Clear filter

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,415 28,472 Updated Jan 29, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,780 1,754 Updated Jan 29, 2025

Simple and Distributed Machine Learning

Scala 5,091 836 Updated Jan 10, 2025

State of the Art Natural Language Processing

Scala 3,906 719 Updated Jan 29, 2025

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Scala 1,238 750 Updated Jan 28, 2025

Examples for High Performance Spark

Scala 506 234 Updated Nov 3, 2024

My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on ​lambda architecture​, that aggregates Twitter and US stock market data for user sentiment anal…

Scala 501 128 Updated Aug 24, 2022

This repository contains code for Spark Streaming

Scala 21 12 Updated Mar 11, 2021