- Somewhere between South Africa and somewhere else
- http://www.vima.co.za
- https://orcid.org/0000-0002-6731-6267
- @vukosi
- in/vukosimarivate
- https://linktr.ee/vukosim
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Parseltongue is a powerful prompt hacking tool/browser extension for real-time tokenization visualization and seamless text conversion, supporting formats like binary, base64, leetspeak, special ch…
How are topics encoded in semantic space? Repository to accompany PNAS article: https://www.pnas.org/doi/10.1073/pnas.2108801119
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.
Distribute and run LLMs with a single file.
SERENGETI: Massively Multilingual Language Models for Africa
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
StableLM: Stability AI Language Models
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Download pictures (or videos) along with their captions and other metadata from Instagram.
🤖 Top-rated tools to scrape all major public sections from Facebook, Instagram, and Twitter (X) including posts (likes/comments), photos/videos, contact information, followers, following and much m…
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…
A library for efficient similarity search and clustering of dense vectors.
RootMyTV is a user-friendly exploit for rooting/jailbreaking LG webOS smart TVs.
Embedding Evaluation Data for South African Languages
Repo containing code for Towards Data Science articles
Draw pretty maps from OpenStreetMap data! Built with osmnx +matplotlib + shapely
Visual analysis and diagnostic tools to facilitate machine learning model selection.
A delightful community-driven framework for managing your bash configuration, and an auto-update tool so that makes it easy to keep up with the latest updates from the community.
WebRTC/RTSP/RTMP/LL-HLS bridge for Wyze cams in a docker container