Stars
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Distributed lock for your scheduled tasks
Free monospaced font with programming ligatures
Java DSL for easy testing of REST services
Very spicy additions to the Java programming language.
Apache Druid: a high performance real-time analytics database.
The Java gRPC implementation. HTTP/2 based RPC
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
A Wiki containing helpful information for new and existing students of the Machine Learning Nanodegree at Udacity. Written by students of the Nanodegree.
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
Code, exercises, answers, and hints to go along with the book "Functional Programming in Scala"
A TestNG like dataprovider runner for JUnit with many additional features
Clean and simple clipboard manager for developers
Guice (pronounced 'juice') is a lightweight dependency injection framework for Java 11 and above, brought to you by Google.