- Mexico City
- https://www.linkedin.com/in/carlosleyson/
- @cleysonl
Stars
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
Slides, scripts and materials for the Machine Learning in Finance Course at NYU Tandon, 2022
A native Rust library for Delta Lake, with bindings into Python
An Awesome List of Open-Source Data Engineering Projects
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured โฆ
An example of using uv in Docker images
Kickstart your LLMOps initiative with a flexible, robust, and productive Python package.
Polars Cookbook, Published by Packt
This is a repo with links to everything you'd ever want to learn about data engineering
Running load tests on a FastAPI application using Locust
DuckDB is an analytical in-process SQL database management system
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
Roadmap and Resource Compilation for System Design Fight Club
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Learn how to design systems at scale and prepare for system design interviews
A curated list of awesome System Design (A.K.A. Distributed Systems) resources.
A curated collection of approaches to designing large scale distributed systems.
Dedicated Resources for the Low-Level System Design. Learn how to design and implement large-scale systems. Prep for the system design interview.
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
Robust recipes to align language models with human and AI preferences
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Ask Me Anything language model prompting
What's in your data? Extract schema, statistics and entities from datasets
๐ Online machine learning resources
๐ Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations
An API Client package to access the APIs for NBA.com
Leveraging BERT and c-TF-IDF to create easily interpretable topics.