Skip to content
View vukosim's full-sized avatar

Highlights

  • Pro

Organizations

@dsfsi

Block or report vukosim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, supporting formats like binary, base64, leetspeak, special ch…

TypeScript 435 32 Updated Jan 11, 2025

How are topics encoded in semantic space? Repository to accompany PNAS article: https://www.pnas.org/doi/10.1073/pnas.2108801119

Jupyter Notebook 27 9 Updated Jun 18, 2023

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 2,659 430 Updated Jan 24, 2025

A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.

102 21 Updated Apr 26, 2024

Distribute and run LLMs with a single file.

C++ 21,875 1,149 Updated Mar 4, 2025

SERENGETI: Massively Multilingual Language Models for Africa

Jupyter Notebook 15 1 Updated Oct 26, 2023

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 10,213 798 Updated Mar 4, 2025

StableLM: Stability AI Language Models

Jupyter Notebook 15,835 1,034 Updated Apr 8, 2024

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,821 379 Updated Mar 14, 2024

S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/

Python 881 67 Updated Apr 26, 2024

MinT: Minimal Transformer Library and Tutorials

Python 253 14 Updated Jul 26, 2022

Run a conference from your backyard.

JavaScript 535 109 Updated Jun 25, 2024

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,093 2,112 Updated Mar 4, 2025

Download pictures (or videos) along with their captions and other metadata from Instagram.

Python 9,458 1,250 Updated Jan 29, 2025

🤖 Top-rated tools to scrape all major public sections from Facebook, Instagram, and Twitter (X) including posts (likes/comments), photos/videos, contact information, followers, following and much m…

2,993 743 Updated Jan 30, 2025

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…

Jupyter Notebook 2,015 171 Updated Aug 15, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 33,423 3,778 Updated Mar 3, 2025

RootMyTV is a user-friendly exploit for rooting/jailbreaking LG webOS smart TVs.

HTML 2,307 65 Updated Jan 5, 2025

Embedding Evaluation Data for South African Languages

1 Updated Oct 26, 2023

Repo containing code for Towards Data Science articles

HTML 29 21 Updated Aug 13, 2020

Draw pretty maps from OpenStreetMap data! Built with osmnx +matplotlib + shapely

Jupyter Notebook 11,551 546 Updated Mar 4, 2025

Data Lineage Tracking And Visualization Solution

Scala 613 156 Updated Mar 4, 2025

Visual analysis and diagnostic tools to facilitate machine learning model selection.

Python 4,316 562 Updated Feb 19, 2025

A delightful community-driven framework for managing your bash configuration, and an auto-update tool so that makes it easy to keep up with the latest updates from the community.

Shell 6,345 698 Updated Feb 19, 2025

WebRTC/RTSP/RTMP/LL-HLS bridge for Wyze cams in a docker container

Python 2,847 197 Updated Feb 8, 2025

A collective list of free APIs

Python 328,996 34,880 Updated Oct 31, 2024

Must-read Papers on pre-trained language models.

3,352 439 Updated Nov 6, 2022

Text autoencoder with LSTMs

Python 262 93 Updated May 27, 2019

Backup and Migrate IMAP Email Accounts

Ruby 1,490 76 Updated Nov 15, 2024
Next