Lists (1)
Sort Name ascending (A-Z)
Stars
Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.
Automating the Search for Artificial Life with Foundation Models!
An implementation of the transformer architecture onto an Nvidia CUDA kernel
ICDE2023-CLDG: Contrastive Learning on Dynamic Graphs
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah
Code for the paper "Language Models are Unsupervised Multitask Learners"
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."
Time series Timeseries Deep Learning Machine Learning Python Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
A professionally curated list of awesome resources (paper, code, data, etc.) on transformers in time series.
The TensorFlow-specific implementation of the Keras API, which was the default Keras from 2019 to 2023.
Keras documentation, hosted live at keras.io
Making text a first-class citizen in TensorFlow.
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Reference implementation of Megalodon 7B model
Open weights language model from Google DeepMind, based on Griffin.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
clone from https://github.com/karpathy/nanoGPT.git
Source codes of Discovering Modern C++
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""