Starred repositories
Free and Open Source, Distributed, RESTful Search Engine
Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas
A tool for reverse engineering Android apk files
High Performance Inter-Thread Messaging Library
Logstash - transport and process your logs, events, or other data
Apache Druid: a high performance real-time analytics database.
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
A Flexible and Powerful Parameter Server for large-scale machine learning
BTrace - a safe, dynamic tracing tool for the Java platform
A Spring Framework based, pragmatic style JavaEE application reference architecture.
Reformats Java source code to comply with Google Java Style.
Apache Kafka® running on Kubernetes
A high performance scripting language hosted on the JVM.
The official AWS SDK for Java 1.x (In Maintenance Mode, End-of-Life on 12/31/2025). The AWS SDK for Java 2.x is available here: https://github.com/aws/aws-sdk-java-v2/
Deep Learning (Python, C, C++, Java, Scala, Go)
Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of …
中文自然语言处理工具包 Toolkit for Chinese natural language processing
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and bat…
P6Spy is a framework that enables database data to be seamlessly intercepted and logged with no code changes to the application.
XPrivacy - The ultimate, yet easy to use, privacy manager
Java serialization library, proto compiler, code generator
Xinfra Monitor monitors the availability of Kafka clusters by producing synthetic workloads using end-to-end pipelines to obtain derived vital statistics - E2E latency, service produce/consume avai…
Secor is a service implementing Kafka log persistence
JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter
Apache InLong - a one-stop, full-scenario integration framework for massive data
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.