Stars
跨专业补计算机基础知识(Physics --> Computer science)
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
AI人工智能、深度学习领域,2025年全网最全即插即用模块,包含各种卷积变种、最新注意力机制、特征融合模块、上下采样模块,持续更新中......
A benchmark fault diagnosis dataset comprises vibration data collected from a gearbox under variable working conditions with intentionally induced faults, encompassing diverse fault severities and …
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Task oriented AI agent framework for digital workers and vertical AI agents
[CVPR 2024] Generalizable Tumor Synthesis - Realistic Synthetic Tumors in Liver, Pancreas, and Kidney
PyTorch Tutorial for Deep Learning Researchers
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
pengwei-iie / llama_bugs
Forked from meta-llama/llamaInference code for LLaMA models
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
A fast medical imaging analysis library in Python with algorithms for registration, segmentation, and more.
Official repository of Agent Attention (ECCV2024)
The official implementation of "EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa"
[ACL'19] [PyTorch] Multimodal Transformer
Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"
A baseline for Weibo User Depression Detection Dataset (WU3D)
deep learning for image processing including classification and object-detection etc.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
Extending MaxViT (Multi-Axis Vision Transformer) to 3D Space
Curated list of project-based tutorials