Starred repositories
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Jupyter notebooks for the code samples of the book "Deep Learning with Python"
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entr…
Pytorch0.4.1 codes for InsightFace
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
SPIGA: Shape Preserving Facial Landmarks with Graph Attention Networks.
MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;
Research and Materials on Hardware implementation of Transformer Model
The official implementation of "Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting"
Multi-class semantic segmentation performed on "Semantic Drone Dataset."
A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.
Simple PyTorch U-Net for semantic segmentation of fish images.
A face detector model for both real-life and comic images based on RetinaFace model.
jerichosu / MoViNet-pytorch
Forked from Atze00/MoViNet-pytorchMoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;
Used Generative Adversarial Networks (GANs) to generate new images of human faces.