Skip to content
View shtosti's full-sized avatar

Highlights

  • Pro

Block or report shtosti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
16 stars written in Python
Clear filter

Inference code for Llama models

Python 57,201 9,657 Updated Aug 18, 2024

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,016 2,104 Updated Jan 11, 2025

A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

Python 6,561 4,401 Updated Jan 13, 2025

Aligning pretrained language models with instruction data generated by themselves.

Python 4,238 496 Updated Mar 27, 2023

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,448 248 Updated Apr 24, 2024

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Python 2,213 464 Updated Aug 7, 2024

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,852 206 Updated May 20, 2024

Minimalist NMT for educational purposes

Python 683 213 Updated Jan 29, 2024

MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multil…

Python 479 55 Updated Mar 20, 2023

The package connects to Telegram's API to generate JSON files containing data for channels, including information and posts. It allows you to search for specific channels or a set of channels provi…

Python 345 60 Updated Aug 8, 2024

An OpenAI-like LLaMA inference API

Python 113 9 Updated Sep 17, 2023

An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship types.

Python 81 16 Updated May 21, 2024

Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models". ACL 2023. Best Paper Award.

Python 31 3 Updated Mar 7, 2024

Repository for data and evaluation of 2024 Shared Task on SDG classification held by the Swiss Text Conference.

Python 5 2 Updated Jul 9, 2024

MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multil…

Python 3 Updated Jun 30, 2023