Starred repositories
Free and Open Source, Distributed, RESTful Search Engine
Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas
A tool for reverse engineering Android apk files
High Performance Inter-Thread Messaging Library
Logstash - transport and process your logs, events, or other data
Apache Druid: a high performance real-time analytics database.
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
A Flexible and Powerful Parameter Server for large-scale machine learning
BTrace - a safe, dynamic tracing tool for the Java platform
A Spring Framework based, pragmatic style JavaEE application reference architecture.
Reformats Java source code to comply with Google Java Style.
Apache Kafka® running on Kubernetes
A high performance scripting language hosted on the JVM.
The official AWS SDK for Java 1.x (In Maintenance Mode, End-of-Life on 12/31/2025). The AWS SDK for Java 2.x is available here: https://github.com/aws/aws-sdk-java-v2/
Deep Learning (Python, C, C++, Java, Scala, Go)
Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of …
中文自然语言处理工具包 Toolkit for Chinese natural language processing
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and bat…
P6Spy is a framework that enables database data to be seamlessly intercepted and logged with no code changes to the application.
Java serialization library, proto compiler, code generator
Xinfra Monitor monitors the availability of Kafka clusters by producing synthetic workloads using end-to-end pipelines to obtain derived vital statistics - E2E latency, service produce/consume avai…
Secor is a service implementing Kafka log persistence
JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter
Apache InLong - a one-stop, full-scenario integration framework for massive data
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
An extensible distributed system for reliable nearline data streaming at scale