Stars
Free and Open Source, Distributed, RESTful Search Engine
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
📈 Capturing JVM- and application-level metrics. So you know what's going on.
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
A pure-Java Markdown processor based on a parboiled PEG parser supporting a number of extensions
Elegant parsing in Java and Scala - lightweight, easy-to-use, powerful.
Hollow is a java library and toolset for disseminating in-memory datasets from a single producer to many consumers for high performance read-only access.
Contains the code used in the HBase: The Definitive Guide book.
Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append
An open-source columnar data format designed for fast & realtime analytic with big data.
A lightweight platform monitoring tool for Java VMs
Eclipse Editor for the Swagger-OpenAPI Description Language
A Java library that manages component action/event bindings for MVC patterns
Riak data as input to hadoop m/r and output of hadoop m/r
medcl / elasticsearch
Forked from elastic/elasticsearchOpen Source, Distributed, RESTful Search Engine
hammer / hbase-trunk-with-avro
Forked from kovyrin/hbaseApache HBase with an Avro service definition and Java server implementation
INACTIVE - http://mzl.la/ghe-archive - Clustering worker for the grouperfish engine
x1B / grouperfish
Forked from mozilla-metrics/grouperfishText clustering service for the web
A small Solr plugin exposing basic geographical search functionality provided by lucene-spatial in Solr 1.4