Skip to content
View tmoroney's full-sized avatar
🙃
🙃

Highlights

  • Pro

Block or report tmoroney

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 653 67 Updated Dec 11, 2024

Audio transcription using mlx whisper and vad silence processing

Python 12 1 Updated Oct 14, 2024

Implementation of the Descript Audio Codec in MLX

Python 8 1 Updated Oct 28, 2024
Python 344 25 Updated Nov 5, 2024
Python 83 7 Updated Jul 30, 2024

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 251 15 Updated Aug 11, 2024

TTS with kokoro and onnx runtime

Python 1,340 113 Updated Jan 29, 2025

Super simple MLX (apple silicon) CLIP based photo similarity web app

Python 465 35 Updated May 17, 2024

MLX implementation of xLSTM model by Beck et al. (2024)

Python 26 2 Updated Jun 5, 2024

MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.

Python 93 7 Updated Oct 17, 2024
Python 740 64 Updated Jan 24, 2025

A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

Python 107 6 Updated Sep 25, 2024

A fast multimodal LLM for real-time voice

Python 3,310 213 Updated Jan 22, 2025

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 797 52 Updated Jan 22, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,879 472 Updated Dec 26, 2024
Python 114 8 Updated Jul 12, 2024

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Python 575 42 Updated Jan 29, 2025

👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.

Zig 25,438 627 Updated Jan 29, 2025

mlx image models for Apple Silicon machines

Jupyter Notebook 70 4 Updated Nov 24, 2024

Efficient framework-agnostic data loading

C++ 392 43 Updated Jan 25, 2025

Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022

Python 148 11 Updated Nov 30, 2022

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seaml…

Python 241 13 Updated Jan 15, 2025

MLX support for the Open Neural Network Exchange (ONNX)

43 5 Updated Feb 21, 2024

Large Concept Models: Language modeling in a sentence representation space

Python 1,813 152 Updated Jan 29, 2025

📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)

Jupyter Notebook 232 17 Updated Oct 31, 2024

Blazing fast whisper turbo for ASR (speech-to-text) tasks

Python 186 9 Updated Oct 20, 2024

MLX: An array framework for Apple silicon

C++ 18,660 1,070 Updated Jan 29, 2025

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,525 130 Updated Jan 27, 2025
Next