Skip to content
View dimitryslavin's full-sized avatar
  • San Francisco, CA

Highlights

  • Pro

Block or report dimitryslavin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

23 stars written in Java
Clear filter

OpenRefine is a free, open source power tool for working with messy data and improving it

Java 11,049 2,004 Updated Jan 21, 2025

AI + Data, online. https://vespa.ai

Java 5,920 612 Updated Jan 22, 2025

A machine learning software for extracting information from scholarly documents

Java 3,726 461 Updated Jan 19, 2025

Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch

Java 1,492 369 Updated Nov 4, 2024

Fess is very powerful and easily deployable Enterprise Search Server.

Java 1,013 168 Updated Jan 18, 2025

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Java 972 123 Updated Jan 22, 2025

Generic Data Ingestion & Dispersal Library for Hadoop

Java 478 111 Updated Mar 19, 2023

NLP Capabilities in Neo4j

Java 336 82 Updated May 5, 2021

Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/booknlp)

Java 310 48 Updated Feb 4, 2022

Carrot2 plugin for ElasticSearch

Java 292 56 Updated Jan 2, 2023

An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP

Java 270 63 Updated Nov 5, 2022

GraphAware Framework Module for Integrating Neo4j with Elasticsearch

Java 261 57 Updated May 5, 2021
Java 184 60 Updated Nov 21, 2018

Elasticsearch plugin offering Neo4j integration for Personalized Search

Java 155 40 Updated May 5, 2021

SARL Agent-Oriented Programming Language http://www.sarl.io

Java 144 46 Updated Dec 15, 2024

The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation

Java 142 50 Updated Apr 14, 2023

A simple scoring plugin for vector in Elasticsearch.

Java 69 24 Updated Apr 5, 2017

A machine learning plugin for Elasticsearch providing aggregations to compute multiple linear regression on search results in real-time for predictive analytics.

Java 64 21 Updated Oct 7, 2018

Lucene Auto Phrase TokenFilter implementation

Java 59 63 Updated Jul 11, 2018

A cookiecutter template for an elasticsearch ingest processor plugin

Java 47 21 Updated Aug 18, 2022

Java code from the 2008 EMNLP paper "Bayesian Unsupervised Topic Segmentation" by Eisenstein and Barzilay

Java 36 17 Updated Sep 12, 2015

Elasticsearch plugin for Sentiment Analysis using Stanford CoreNLP

Java 11 3 Updated Oct 17, 2018