Skip to content
View guihaojin's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report guihaojin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

25 stars written in Scala
Clear filter

Source code for Twitter's Recommendation Algorithm

Scala 62,947 12,176 Updated Jul 10, 2024

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,596 28,516 Updated Feb 24, 2025

Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3

Scala 14,358 3,115 Updated Feb 20, 2025

A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.

Scala 13,112 3,586 Updated Feb 19, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,827 1,769 Updated Feb 20, 2025

ZIO — A type-safe, composable library for async and concurrent programming in Scala

Scala 4,165 1,322 Updated Feb 23, 2025

REST job server for Apache Spark

Scala 2,834 992 Updated Jan 4, 2025

JSON library

Scala 1,486 331 Updated Feb 22, 2025

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Scala 1,006 315 Updated Oct 5, 2022

Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

Scala 903 605 Updated Nov 12, 2024

TiSpark is built for running Apache Spark on top of TiDB/TiKV

Scala 887 249 Updated Jan 6, 2025

Lightweight real-time big data streaming engine over Akka

Scala 763 152 Updated Mar 1, 2022

Mirror of Apache Toree (Incubating)

Scala 741 224 Updated Feb 20, 2025

Data Lineage Tracking And Visualization Solution

Scala 613 156 Updated Feb 6, 2025

Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apa…

Scala 612 118 Updated Jan 8, 2020

XML data source for Spark SQL and DataFrames

Scala 509 226 Updated Aug 11, 2024

Supporting code for the tutorials on https://www.baeldung.com/scala

Scala 342 213 Updated Feb 23, 2025

Snowflake Data Source for Apache Spark.

Scala 224 101 Updated Dec 3, 2024

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Scala 185 34 Updated Feb 12, 2023

A simple hello world using Apache Spark

Scala 27 14 Updated Dec 26, 2024

Distributed. Columnar. Versioned. Streaming. SQL.

Scala 10 2 Updated Aug 3, 2020

Mirror of Apache Toree (Incubating)

Scala 1 Updated Feb 11, 2020

Mirror of Apache livy (Incubating)

Scala 1 Updated May 22, 2021

Apache Spark

Scala 1 Updated Apr 11, 2022