We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFMs). This plug-and-play module can be easily integrated into …

Python 308 29 Updated Jan 26, 2025

sophia-ai-agent / sophia

JavaScript 111 16 Updated Feb 21, 2025

DCDmllm / HealthGPT

Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’

Python 351 48 Updated Mar 8, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 2,689 355 Updated Mar 2, 2025

zhiyuancui / Creative-AIGC-Suite

AIGC Creative Suite

Go 202 30 Updated Feb 18, 2025

hku-mars / FAST-LIVO2

FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry

C++ 1,912 254 Updated Mar 5, 2025

SSSYDYSSS / TransProPy

A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classification and attribute them accordingly.

Python 220 29 Updated Dec 11, 2024

AsterCass / Tomoyo

Tomoyo is a Kotlin Compose Multiplatform app that is a sample for for common functionalities such as navigation, socket (for chat), video, audio, and db

Kotlin 62 8 Updated Feb 20, 2025

yuechenedu / EduLearn

辅学院企业培训系统是一套基于点播、培训、考试、面授、报表等功能完善的企业培训系统，开源版是基于企业版精简实现的一个线上学习系统，致力于打造一个各行业都适用的在线培训系统、员工培训平台、企业内部培训系统、在线教育系统、开源培训系统。

JavaScript 293 53 Updated Feb 21, 2025

bird-bench / BIRD-CRITIC-1

BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?

Python 391 29 Updated Feb 20, 2025

bcmi / Awesome-Object-Insertion

A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which aims to generate realistic composite image.

494 81 Updated Feb 2, 2025

adonis-dym / memory_reduced_optimizer

Python 462 49 Updated Feb 1, 2025

FellouAI / eko

Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai

TypeScript 2,718 175 Updated Mar 8, 2025

facebookresearch / uco3d

Uncommon Objects in 3D dataset

Python 1,110 170 Updated Mar 8, 2025

om-ai-lab / OmAgent

Build multimodal language agents for fast prototype and production

Python 2,184 230 Updated Mar 4, 2025

gdswcxzljj / ai_paper

🔥🔥🔥AI论文生成、AI论文写作，一键论文生成，AI写毕业论文、开题报告、文献综述、课程论文，AI写报告、写方案、降AIGC率

292 8 Updated Mar 4, 2025

SSSYDYSSS / MetaTrx

MetaTrx: Comprehensive Cross-Species Transcriptome Analysis

R 110 6 Updated Jun 4, 2024

jinchengyang98 / Re-ccscaner

Go 446 50 Updated Dec 14, 2024

microsoft / VidTok

a family of versatile and state-of-the-art video tokenizers.

Python 350 19 Updated Jan 15, 2025

fanjunkai1 / DCL

[AAAI 2025] Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video

Python 122 7 Updated Feb 16, 2025

alinGmail / LiveMock

LiveMock is a comprehensive tool for API development and testing, offering mock data, request proxying, and logging, to streamline workflows and track traffic.

TypeScript 456 70 Updated Dec 6, 2024

Docta-ai / docta

A Doctor for your data

Python 2,742 207 Updated Jan 14, 2025

Everlyn-Labs / Everlyn-1

The first open autoregressive foundational video AI model.

2,875 487 Updated Oct 14, 2024

fudan-generative-vision / hallo2

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,502 505 Updated Feb 27, 2025

sunvim / mq

embed message queue for golang

Go 63 19 Updated Oct 20, 2024

HKUDS / LightRAG

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,579 1,774 Updated Mar 8, 2025

hengwei-chan / AttentionDTA

Python 7 Updated Aug 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kiddos Wyndham Wyndhan

Block or report Wyndhan

Stars

Wiselnn570 / VideoRoPE

ai-decentralized / BloomBee

360CVGroup / FancyVideo

xid32 / NAACL_2025_TWM