Skip to content
View shtosti's full-sized avatar

Highlights

  • Pro

Block or report shtosti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Aligning pretrained language models with instruction data generated by themselves.

Python 4,238 496 Updated Mar 27, 2023

Package to extract connotation frames

Jupyter Notebook 80 8 Updated Dec 19, 2023

Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models". ACL 2023. Best Paper Award.

Python 31 3 Updated Mar 7, 2024

Curated papers on Large Language Models in Healthcare and Medical domain

273 32 Updated Jan 13, 2025

Entry for the shared task at SwissText 2024 - Automatic Classification of the united nations’ sustainable development goals in scientific abstracts

Jupyter Notebook 1 Updated May 31, 2024

Repository for data and evaluation of 2024 Shared Task on SDG classification held by the Swiss Text Conference.

Python 5 2 Updated Jul 9, 2024

A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

Python 6,560 4,399 Updated Jan 13, 2025

Tutorial for surrogate gradient learning in spiking neural networks

Jupyter Notebook 299 78 Updated Jul 25, 2024

MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multil…

Python 3 Updated Jun 30, 2023

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,016 2,104 Updated Jan 11, 2025

Python package of Tomoto, the Topic Modeling Tool

C++ 569 64 Updated Aug 7, 2024
HTML 1 Updated Sep 25, 2024

A Python wrapper around the topic modeling functions of MALLET.

Jupyter Notebook 100 17 Updated Nov 1, 2024

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,886 2,313 Updated Jan 14, 2025

Inference code for Llama models

Python 57,189 9,658 Updated Aug 18, 2024

An OpenAI-like LLaMA inference API

Python 113 9 Updated Sep 17, 2023

The package connects to Telegram's API to generate JSON files containing data for channels, including information and posts. It allows you to search for specific channels or a set of channels provi…

Python 345 60 Updated Aug 8, 2024

Tutorials, datasets, and other material associated with textbook "A First Course in Network Science" by Menczer, Fortunato & Davis

Jupyter Notebook 375 180 Updated Nov 3, 2023

A vocoder framework which had been widely used in research community since 1999.

MATLAB 178 44 Updated Dec 24, 2018

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,851 206 Updated May 20, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,929 3,401 Updated Jul 23, 2024

repo for "Natural Language Processing for Law and Social Science" @ ETH Zurich, Spring 2022

Jupyter Notebook 56 53 Updated Feb 18, 2024

An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship types.

Python 80 16 Updated May 21, 2024

Reading list for research topics in multimodal machine learning

6,201 859 Updated Aug 20, 2024

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Python 2,213 464 Updated Aug 7, 2024

Minimalist NMT for educational purposes

Python 683 213 Updated Jan 29, 2024

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,893 1,258 Updated Apr 7, 2024

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,448 248 Updated Apr 24, 2024

MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multil…

Python 479 55 Updated Mar 20, 2023

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 52,377 5,085 Updated Jan 9, 2025
Next