Skip to content
View shyyhs's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Organizations

@NLPforCOVID-19

Block or report shyyhs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric Evaluation of Machine Translation with a Densely Annotated P…

Python 75 9 Updated Sep 21, 2023

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,017 475 Updated May 3, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 22 3 Updated Dec 6, 2024

A library for minimum Bayes risk (MBR) decoding

Python 31 4 Updated Dec 15, 2024

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ

Python 535 17 Updated Dec 4, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,676 870 Updated Oct 3, 2024
C++ 799 111 Updated May 24, 2023

Compact Language Detector 2

C++ 845 130 Updated May 22, 2021
Python 9 Updated Nov 6, 2024

Streamlit — A faster way to build and share data apps.

Python 36,195 3,122 Updated Dec 15, 2024

String-to-String Algorithms for Natural Language Processing

Jupyter Notebook 540 29 Updated Jul 26, 2024

Go ahead and axolotl questions

Python 8,089 892 Updated Dec 13, 2024

Translation models for 22 scheduled languages of India

Python 242 66 Updated Oct 17, 2024
Python 10 Updated Apr 2, 2024
Jupyter Notebook 9,410 648 Updated Jul 29, 2024

The FLORES+ Machine Translation Benchmark

99 15 Updated Nov 12, 2024

📋 A list of open LLMs available for commercial use.

11,324 753 Updated Jul 5, 2024

日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark

26 2 Updated Feb 27, 2024

PyTorch native finetuning library

Python 4,479 456 Updated Dec 15, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 14,737 1,193 Updated Dec 12, 2024

Given a scholarly PDF, extract figures, tables, captions, and section titles.

Scala 615 123 Updated Mar 10, 2024

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,117 576 Updated Sep 23, 2024

Official repository of Evolutionary Optimization of Model Merging Recipes

Python 1,240 92 Updated Nov 29, 2024

Tools for merging pretrained large language models.

Python 4,952 458 Updated Dec 15, 2024

A repo for resources for our EAMT 2024 tutorial

6 Updated Jun 30, 2024

The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"

49 1 Updated Oct 7, 2024

Editing Models with Task Arithmetic

Python 436 38 Updated Jan 11, 2024
Next