Skip to content
View JESUSC1's full-sized avatar

Block or report JESUSC1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 19,858 1,616 Updated Feb 23, 2025

The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"

Python 388 18 Updated Mar 10, 2025
Jupyter Notebook 4,256 824 Updated Mar 10, 2025

ICML 2024 "From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation"

4 Updated Oct 13, 2024

We write your reusable computer vision tools. 💜

Python 26,132 1,972 Updated Mar 10, 2025

Behavioral probing of language acquisition models at the lexical and syntactic level

Python 15 1 Updated Jul 17, 2023

Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"

Python 524 123 Updated Mar 27, 2019

Scrape data from Sephora and do product Analysis

HTML 25 10 Updated Aug 18, 2020

A web scraper that gets product names, brands, formatted ingredients, images, and available sizes from Sephora's makeup category, and inserts them into a relational DB with several foreign key rela…

Ruby 5 Updated Jul 8, 2023

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,574 766 Updated Aug 12, 2024

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …

Python 23,360 6,394 Updated Jun 7, 2024

This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…

Jupyter Notebook 7,238 1,143 Updated Feb 24, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 19,159 2,213 Updated Mar 7, 2025

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…

Python 24,330 4,637 Updated Oct 15, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,294 4,285 Updated Mar 10, 2025

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 954 60 Updated Feb 25, 2025

Code for paper, 'Extracting Entities of Interest from Comparative Product Reviews', CIKM'17

Python 6 1 Updated Sep 13, 2018

A curated list of awesome LLVM (including Clang, etc) related resources.

Python 593 44 Updated Nov 25, 2024

Introduction to predictive modeling in Spark with applications in pharmaceutical bioinformatics

Scala 39 24 Updated Feb 13, 2016

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 13,226 1,362 Updated Mar 5, 2025

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 24,002 1,998 Updated Sep 26, 2024

PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" (CVPR2023) and "Visual Context-driven Audio Feature Enhan…

Python 14 Updated Apr 3, 2024

Visual Speech Recognition for Multiple Languages

Python 388 60 Updated Aug 17, 2023

[CVPR2020] "Detecting Attended Visual Targets in Video"

Python 189 49 Updated May 24, 2021

Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025)

Python 513 61 Updated Mar 7, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,143 494 Updated Feb 26, 2025

TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's …

Jupyter Notebook 2,626 215 Updated Mar 5, 2025

Lightning ⚡️ fast forecasting with statistical and econometric models.

Python 4,178 302 Updated Mar 3, 2025

👕 Open-source course on architecting, building and deploying a real-time personalized recommender for H&M fashion articles.

Jupyter Notebook 276 55 Updated Mar 6, 2025
Next