Starred repositories
Free and Open Source, Distributed, RESTful Search Engine
APM, Application Performance Monitoring System
A Java serialization/deserialization library to convert Java Objects into JSON and back
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
Open-source IoT Platform - Device management, data collection, processing and visualization.
QuestDB is a high performance, open-source, time-series database
Apache Doris is an easy-to-use, high performance and unified analytics database.
Guice (pronounced 'juice') is a lightweight dependency injection framework for Java 11 and above, brought to you by Google.
OpenRefine is a free, open source power tool for working with messy data and improving it
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Activiti is a light-weight workflow and Business Process Management (BPM) Platform targeted at business people, developers and system admins. Its core is a super-fast and rock-solid BPMN 2 process …
The Metadata Platform for your Data and AI Stack
A simple expressive web framework for java. Spark has a kotlin DSL https://github.com/perwendel/spark-kotlin
Apache Beam is a unified programming model for Batch and Streaming data processing.
Pentaho Data Integration ( ETL ) a.k.a Kettle
Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.
Micronaut Application Framework
Apache Pinot - A realtime distributed OLAP datastore
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...