- San Francisco, CA
Highlights
- Pro
Starred repositories
OpenRefine is a free, open source power tool for working with messy data and improving it
A machine learning software for extracting information from scholarly documents
Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch
Fess is very powerful and easily deployable Enterprise Search Server.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Generic Data Ingestion & Dispersal Library for Hadoop
Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/booknlp)
An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP
GraphAware Framework Module for Integrating Neo4j with Elasticsearch
Elasticsearch plugin offering Neo4j integration for Personalized Search
SARL Agent-Oriented Programming Language http://www.sarl.io
The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation
A simple scoring plugin for vector in Elasticsearch.
A machine learning plugin for Elasticsearch providing aggregations to compute multiple linear regression on search results in real-time for predictive analytics.
Lucene Auto Phrase TokenFilter implementation
A cookiecutter template for an elasticsearch ingest processor plugin
Java code from the 2008 EMNLP paper "Bayesian Unsupervised Topic Segmentation" by Eisenstein and Barzilay
Elasticsearch plugin for Sentiment Analysis using Stanford CoreNLP