Skip to content
View atamborrino's full-sized avatar

Block or report atamborrino

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimal keyword extraction with BERT

Python 3,689 363 Updated Feb 11, 2025

Segment documents into coherent parts using word embeddings.

Jupyter Notebook 149 29 Updated Mar 6, 2022

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,233 401 Updated Nov 18, 2024

A multilingual version of MS MARCO passage ranking dataset

Python 143 9 Updated Oct 19, 2023
Jupyter Notebook 54 10 Updated Apr 10, 2022

Benchmarks of approximate nearest neighbor libraries in Python

Python 5,095 775 Updated Jan 22, 2025

Python library containing BART query generation and BERT-based Siamese models for neural retrieval.

Python 41 9 Updated Oct 30, 2020

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

Python 19,143 2,045 Updated Feb 12, 2025

A collection of Magnolia add-on modules

Scala 173 26 Updated Feb 12, 2025

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Python 683 101 Updated Feb 3, 2025

gentle forced aligner

Python 1,509 299 Updated Apr 25, 2024

Manipulate audio with a simple and easy high level interface

Python 9,173 1,071 Updated Jul 25, 2024

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Python 956 246 Updated Feb 9, 2025

dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.

Python 488 78 Updated Jul 11, 2023

Automatic differentiation with weighted finite-state transducers.

C++ 454 38 Updated Sep 20, 2021

Fast Block Sparse Matrices for Pytorch

C++ 546 35 Updated Jan 21, 2021

Python bindings for FFmpeg - with complex filtering support

Python 10,292 898 Updated Aug 4, 2024

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

TypeScript 3,521 355 Updated Dec 20, 2024

Shared repository for open-sourced projects from the Google AI Language team.

Python 1,646 351 Updated Feb 11, 2025

AIStore: scalable storage for AI applications

Go 1,384 192 Updated Feb 12, 2025

PyTorch elastic training

Python 730 100 Updated Jun 15, 2022

Resources for the MRQA 2019 Shared Task

Python 292 30 Updated Aug 5, 2021

Simple text to phones converter for multiple languages

Python 1,321 181 Updated Sep 26, 2024

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Python 833 158 Updated Oct 10, 2023

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Jupyter Notebook 2,023 206 Updated Jan 9, 2024

S3D Text-Video model trained on HowTo100M using MIL-NCE

Python 195 21 Updated Jul 3, 2020
Python 1,618 317 Updated Jul 20, 2023

A resource to create a multi domain Dialog Act Tagger for conversational agents using publicly available data

HTML 51 11 Updated Sep 6, 2021

BLEURT is a metric for Natural Language Generation based on transfer learning.

Python 714 85 Updated Aug 4, 2023
Next