Skip to content
View sameeravithana's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report sameeravithana

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 942 43 Updated Nov 26, 2024

Data processing with ML, LLM and Vision LLM

Python 4,039 395 Updated Dec 18, 2024

A large-scale information-rich web dataset, featuring millions of real clicked query-document labels

312 17 Updated Dec 16, 2024

potato: portable text annotation tool

Jupyter Notebook 300 51 Updated Oct 23, 2024

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 23,500 4,031 Updated Dec 19, 2024

AICI: Prompts as (Wasm) Programs

Rust 1,972 78 Updated Nov 10, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,413 4,486 Updated Dec 19, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,109 156 Updated Dec 18, 2024

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

470 49 Updated Jul 10, 2024

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 15,249 1,789 Updated Dec 19, 2024

AgentSearch is a framework for powering search agents and enabling customizable local search.

Python 450 47 Updated Apr 22, 2024

library supporting NLP and CV research on scientific papers

Python 718 57 Updated Nov 8, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,081 575 Updated Apr 16, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 9,486 797 Updated Dec 19, 2024

AI Observability & Evaluation

Jupyter Notebook 4,327 319 Updated Dec 19, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,768 2,287 Updated Aug 12, 2024
Python 105 5 Updated Jul 8, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,151 838 Updated Dec 19, 2024

A curation of awesome tools, documents and projects about LLM Security.

987 99 Updated Nov 21, 2024

Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"

Python 259 12 Updated Jun 12, 2024

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

757 24 Updated Jul 20, 2023

the LLM vulnerability scanner

Python 3,071 263 Updated Dec 19, 2024

LLM Prompt Injection Detector

TypeScript 1,144 82 Updated Aug 7, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,561 736 Updated May 31, 2024

Multi-tool for semantic search

Python 2,527 143 Updated Aug 27, 2024

Reimplementation of the task generation part from the Alpaca paper

Python 119 8 Updated Apr 4, 2023

Fast & Simple repository for pre-training and fine-tuning T5-style models

Python 983 74 Updated Aug 21, 2024

Tools for understanding how transformer predictions are built layer-by-layer

Python 446 48 Updated Jun 2, 2024
Next