Stars
A simple tutorial of Lucene for LIS 501 Introduction to Text Mining students at the University of Wisconsin-Madison (Fall 2021).
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
ClickHouse® is a real-time analytics DBMS
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
A technical report on convolution arithmetic in the context of deep learning
jgrapht / jgrapht
Forked from lingeringsocket/jgraphtMaster repository for the JGraphT project
Unsupervised text tokenizer for Neural Network-based text generation.
PMML evaluator library for the Apache Spark cluster computing system (http://spark.apache.org/)
Java library and command-line application for converting Scikit-Learn pipelines to PMML
Python package for Korean natural language processing.
The Moby Project - a collaborative project for the container ecosystem to assemble container-based systems
Openresty lua client for redis cluster.
A library for efficient similarity search and clustering of dense vectors.
Lucene based secondary indexes for Cassandra
TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
SQL powered operating system instrumentation, monitoring, and analytics.
Fastsocket is a highly scalable socket and its underlying networking implementation of Linux kernel. With the straight linear scalability, Fastsocket can provide extremely good performance in multi…
Mcrouter is a memcached protocol router for scaling memcached deployments.