Stars
we want to create a repo to illustrate usage of transformers in chinese
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Code and data for crosstalk text generation tasks, exploring whether large models and pre-trained language models can understand humor.
PointRend for instance segmentation on TensorFlow
PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), ResNetV2, EfficientNetV2, NeRF, SegFormer, MixTransformer, (pla…
这是一个segformer-pytorch的源码,可以用于训练自己的模型。
EasyPortrait - Face Parsing and Portrait Segmentation Dataset
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
FaRL for Facial Representation Learning [Official, CVPR 2022]
Pytorch Implementation of "Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model"
Use lama cleaner before inpainting inside stable-diffusion-webui
DifFace: Blind Face Restoration with Diffused Error Contraction (TPAMI, 2024)
Official implementations for paper: Anydoor: zero-shot object-level image customization
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Speechless at the original stable-diffusion