Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Apache Spark - A unified analytics engine for large-scale data processing
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
PredictionIO, a machine learning server for developers and ML engineers.
State of the Art Natural Language Processing
Spark: The Definitive Guide's Code Repository
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
A connector for Spark that allows reading and writing to/from Redis cluster
CTR prediction model based on spark(LR, GBDT, DNN)
The code examples used in Programming Scala, 2nd and 3rd Editions (O'Reilly)
Learning Apache spark,including code and data .Most part can run local.
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
Abstractions from Category theory with simple description & implementation, links to further resources.
The official repository for the Rock the JVM Scala 2 for beginners course
Spark, Spark Streaming and Spark SQL unit testing strategies
Spark 学习之路,包含 Spark Core,Spark SQL,Spark Streaming,Spark mllib 学习笔记
Apache Spark Course Material