Skip to content
View lizhaoliu-Lec's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Tencent
  • Shenzhen/China

Block or report lizhaoliu-Lec

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3 Updated Feb 17, 2025

AI-driven Yu-Gi-Oh! bot using deep reinforcement learning and LLMs

Python 88 9 Updated Aug 16, 2024

Robust and Efficient Occupancy Prediction

17 Updated Jan 3, 2025

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 4,718 416 Updated Jan 22, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,279 1,466 Updated Dec 25, 2024

Enable macOS HiDPI and have a native setting.

Shell 9,364 1,035 Updated Jul 3, 2024

official implementation for ECCV 2024 paper "Prioritized Semantic Learning for Zero-shot Instance Navigation"

Python 27 2 Updated Sep 25, 2024
11 Updated Jul 16, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 2,134 38 Updated Oct 22, 2024

This is the source code for Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy (ICLR2024).

Python 40 2 Updated Aug 12, 2024

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

1,479 89 Updated Feb 14, 2025

Grok open release

Python 50,187 8,365 Updated Aug 30, 2024
Jupyter Notebook 378 47 Updated Dec 5, 2023

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,243 510 Updated May 3, 2024

SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation

Python 117 8 Updated Jan 12, 2024

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 771 45 Updated Jul 29, 2024

Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"

Python 200 11 Updated Sep 7, 2023

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 831 41 Updated Nov 23, 2024

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 11,182 1,597 Updated Jan 19, 2025
1 Updated Jan 19, 2024

RelTR: Relation Transformer for Scene Graph Generation: https://arxiv.org/abs/2201.11460v2

Python 262 52 Updated Aug 20, 2024

Dual Regression Compression for SR Models

Python 5 1 Updated Jan 8, 2024
Python 16 Updated Dec 13, 2023

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

Python 267 18 Updated Jun 7, 2023
Jupyter Notebook 769 71 Updated Aug 7, 2024

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Python 522 27 Updated Jun 11, 2024
Python 390 14 Updated Jul 29, 2024
Next