-
Hvar Consulting
- Tatuapé - São Paulo - Brasil
-
11:02
(UTC -03:00) - valeriocardoso.github.io
- in/valeriocardoso
Lists (32)
Sort Name ascending (A-Z)
Agent LLMs
airflow
asr
awesone
backlog
causal machine learning
Certification
Courses
cyber security
devops
diffusion models
Interpretability ML
interview
kaggle
mlops
🚀 My stack
NERF
newsletter
NLP
Productivity
Projects
Reinforcement Learning
staff
survival-analysis
system design
tech leader
time series
tools
TTS
ui llm
video analytics
vision
Starred repositories
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
🚀 Fast and simple Node.js version manager, built in Rust
End-to-end Generative Optimization for AI Agents
A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.
This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.
A course on aligning smol models.
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
This repository holds the whole code structure and documentation for a ML application focused on Demand Forecasting using GCP and Kedro Framework as main features.
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Simple, unified interface to multiple Generative AI providers
Examples and guides for using the Gemini API
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
High quality resources & applications for LLMs, multi-modal models and VectorDBs
VPTQ, A Flexible and Extreme low-bit quantization algorithm
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…
first base model for full-duplex conversational audio
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Robust recipes to align language models with human and AI preferences
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
A library for advanced large language model reasoning