Skip to content
View osilvarom's full-sized avatar

Block or report osilvarom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.

Python 51 Updated Dec 20, 2024

Automating the Search for Artificial Life with Foundation Models!

Jupyter Notebook 315 27 Updated Jan 12, 2025

An implementation of the transformer architecture onto an Nvidia CUDA kernel

Cuda 167 10 Updated Sep 24, 2023

ICDE2023-CLDG: Contrastive Learning on Dynamic Graphs

Python 22 Updated Dec 18, 2024

Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah

Python 1,400 229 Updated Jan 3, 2025

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,773 5,558 Updated Aug 14, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,064 2,318 Updated Aug 12, 2024

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

3,419 401 Updated Jan 7, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,541 573 Updated Jan 11, 2025

Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."

Jupyter Notebook 816 194 Updated Jan 18, 2024

Time series Timeseries Deep Learning Machine Learning Python Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai

Jupyter Notebook 5,391 666 Updated Dec 19, 2024

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

Python 5,608 1,155 Updated May 27, 2024

Pretrained model hub for Keras 3.

Python 828 245 Updated Jan 8, 2025

A professionally curated list of awesome resources (paper, code, data, etc.) on transformers in time series.

2,586 250 Updated Aug 8, 2024

The TensorFlow-specific implementation of the Keras API, which was the default Keras from 2019 to 2023.

Python 66 33 Updated Jan 8, 2025

Keras documentation, hosted live at keras.io

Jupyter Notebook 2,808 2,057 Updated Jan 9, 2025

Making text a first-class citizen in TensorFlow.

C++ 1,241 346 Updated Jan 11, 2025

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,266 1,025 Updated Sep 26, 2024

Reference implementation of Megalodon 7B model

Cuda 512 54 Updated Apr 18, 2024

Open weights language model from Google DeepMind, based on Griffin.

Python 615 28 Updated Jul 9, 2024

Mamba SSM architecture

Python 13,767 1,181 Updated Jan 6, 2025

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,014 773 Updated Jun 24, 2024

clone from https://github.com/karpathy/nanoGPT.git

Python 1 1 Updated Mar 26, 2023

Source codes of Discovering Modern C++

C++ 207 85 Updated Nov 12, 2017

Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

Python 1,307 160 Updated Nov 21, 2024

Pico TensorFlow Lite Port

C++ 654 99 Updated Dec 27, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,717 279 Updated Nov 25, 2024

Network traffic in Python.

Jupyter Notebook 25 5 Updated Mar 14, 2023
Next