Starred repositories
Tesseract Open Source OCR Engine (main repository)
GoogleTest - Google Testing and Mocking Framework
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Cloud-native high-performance edge/middle/service proxy
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …
Unsupervised text tokenizer for Neural Network-based text generation.
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…
Rapid fuzzy string matching in Python using various string metrics
A lightning fast Finite State machine and REgular expression manipulation library.
A deep learning package for many-body potential energy representation and molecular dynamics
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
2021/3/30 ~ 2021/7/12 に行われる企画「競プロ典型 90 問」の問題・解説・ソースコードなどの資料をアップロードしています。
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
PROPhet is a code to integrate machine learning techniques with first-principles quantum chemistry approaches