Skip to content
View sameeravithana's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report sameeravithana

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scientific multimodal instruction tuning with large language and vision models.

Python 3 1 Updated Jan 22, 2025

Synthetic data curation for post-training and structured data extraction

Python 937 64 Updated Mar 7, 2025

React + Next.js template for research websites (for PhD students, researchers, etc)

TypeScript 146 52 Updated Jan 12, 2025

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,045 55 Updated Mar 7, 2025

Data processing with ML, LLM and Vision LLM

Python 4,401 436 Updated Mar 7, 2025

A large-scale information-rich web dataset, featuring millions of real clicked query-document labels

316 17 Updated Dec 16, 2024

potato: portable text annotation tool

Jupyter Notebook 323 53 Updated Mar 7, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 26,932 5,490 Updated Feb 22, 2025

AICI: Prompts as (Wasm) Programs

Rust 2,004 83 Updated Jan 22, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,426 5,316 Updated Mar 7, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,276 169 Updated Mar 4, 2025

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

530 51 Updated Dec 31, 2024

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,529 2,289 Updated Mar 8, 2025

AgentSearch is a framework for powering search agents and enabling customizable local search.

Python 474 46 Updated Apr 22, 2024

library supporting NLP and CV research on scientific papers

Python 747 59 Updated Nov 8, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,306 600 Updated Feb 21, 2025

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 10,416 867 Updated Mar 8, 2025

AI Observability & Evaluation

Jupyter Notebook 4,982 364 Updated Mar 8, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,726 2,387 Updated Aug 12, 2024
Python 114 5 Updated Jul 8, 2024

✨✨Latest Advances on Multimodal Large Language Models

14,160 909 Updated Mar 5, 2025

A curation of awesome tools, documents and projects about LLM Security.

1,098 121 Updated Mar 7, 2025

Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"

Python 265 14 Updated Jun 12, 2024

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

765 24 Updated Jul 20, 2023

the LLM vulnerability scanner

Python 3,988 368 Updated Mar 6, 2025

LLM Prompt Injection Detector

TypeScript 1,199 95 Updated Aug 7, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,757 756 Updated May 31, 2024

Multi-tool for semantic search

Python 2,579 152 Updated Aug 27, 2024
Next