-
SCSE, Beihang University
- Beijing
-
19:44
(UTC -12:00) - https://larry0454.github.io/
Stars
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
[Arxiv 2024] Adversarial attacks on multimodal agents
Fetch citations and abstracts of a Google Scholar paper and generate prompt for LLM
It's not a list of papers, but a list of paper reading lists...
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Align Anything: Training All-modality Model with Feedback
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
An easy-to-use reverse engineered text to image generation Python API wrapper for Lexica.art
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…
Official repository of Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning
PyTorch code and models for the DINOv2 self-supervised learning method.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
一个计算机视觉、机器学习与深度学习相关的项目,看课程的笔记还有自己做的程序
Official Code for DragGAN (SIGGRAPH 2023)
📄 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"