Skip to content
View miko8422's full-sized avatar
⛩️
Preparing GRE
⛩️
Preparing GRE

Block or report miko8422

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 1,002 61 Updated Jan 7, 2025

PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba

Python 45 5 Updated Nov 14, 2024

A comparable corpus of Kalaallisut and Danish web-crawled sentences, along with some noisy aligned texts and code for MT finetuning experiments between Kalaallisut and English. Currently looking to…

Jupyter Notebook 2 1 Updated Mar 15, 2022

Unsupervised Language Model Pre-training for French

Python 247 32 Updated Apr 11, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,733 2,593 Updated Feb 6, 2025

Collections of CS PhD Application Fee Waivers of schools in North America

376 30 Updated Nov 27, 2023

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 474 48 Updated Feb 29, 2024
Jupyter Notebook 460 81 Updated Apr 4, 2021

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,264 761 Updated Sep 20, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,583 1,192 Updated Feb 11, 2025

Head tracking software for MS Windows, Linux, and Apple OSX

C++ 3,922 472 Updated Dec 26, 2024

Data augmentation for NLP, presented at EMNLP 2019

Python 1,620 316 Updated Mar 19, 2023

Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction

Python 20 6 Updated Jun 25, 2022

Code for the ACL 2022 paper "Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning"

Python 36 3 Updated Dec 5, 2022

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

HTML 98 37 Updated Feb 2, 2023

Graph Convolutional Networks for Text Classification. AAAI 2019

Python 1,378 438 Updated Dec 29, 2021
Python 10 6 Updated Aug 2, 2023

简单的 Python 异步多后端机器人框架

Python 147 13 Updated Feb 4, 2025

An Incremental Learning, Continual Learning, and Life-Long Learning Repository

568 50 Updated May 11, 2024

Awesome Incremental Learning

3,928 581 Updated Jan 2, 2025

Facebook Low Resource (FLoRes) MT Benchmark

Python 719 125 Updated Nov 20, 2023

painting-the-world,一个基于我的世界rcon实现的图片生成器

Python 2 1 Updated Jun 3, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,293 6,413 Updated Dec 9, 2024

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,818 2,243 Updated Jan 8, 2025

Byte-Pair Encoding (BPE) (subword-based tokenization) algorithm implementaions from scratch with python

Python 12 Updated Jan 30, 2023

Kolmogorov Arnold Networks

Jupyter Notebook 15,377 1,446 Updated Jan 19, 2025

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Python 344 54 Updated Oct 1, 2024

🎂 your waifu, right on your desktop!

C# 91 7 Updated Aug 2, 2024
Next