s-jse

Follow

Sina Semnani s-jse

Follow

PhD student @ Stanford NLP

23 followers · 1 following

Stanford
SF Bay Area
@sina_semnani

Achievements

Achievements

Organizations

Stars

DS4SD / docling

Get your documents ready for gen AI

Python 14,389 716 Updated Dec 13, 2024

milvus-io / milvus

A cloud-native vector database, storage for next generation AI applications

Go 31,293 2,967 Updated Dec 14, 2024

DS4SD / deepsearch-toolkit

Interact with the Deep Search platform for new knowledge explorations and discoveries

Python 140 21 Updated Dec 9, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,885 4,847 Updated Dec 14, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 6,515 578 Updated Dec 14, 2024

ej0cl6 / TextEE

A standardized, fair, and reproducible benchmark for evaluating event extraction approaches

Python 45 12 Updated Jul 2, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 32,955 3,581 Updated Dec 3, 2024

lkiesow / python-feedgen

Python module to generate ATOM feeds, RSS feeds and Podcasts.

Python 742 124 Updated Jul 4, 2024

idiap / News-Media-Reliability

Reliability Estimation of News Media Sources: Birds of a Feather Flock Together

HTML 5 Updated Oct 31, 2024

amosjyng / langchain-visualizer

Visualization and debugging tool for LangChain workflows

Python 724 52 Updated Mar 6, 2024

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 7,929 583 Updated Dec 6, 2024

huggingface / llm-vscode

LLM powered development for VSCode

TypeScript 1,243 134 Updated Jul 17, 2024

wbolster / plyvel

Plyvel, a fast and feature-rich Python interface to LevelDB

Cython 536 75 Updated May 15, 2024

AndyTheFactory / newspaper4k

📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.

HTML 521 53 Updated Jun 5, 2024

AutoGPTQ / AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,560 492 Updated Nov 27, 2024

tatsu-lab / gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Python 492 120 Updated Mar 26, 2024

NathanPB / progress.js

Library for creating highly customizable CLI-like progress bars in javascript

TypeScript 87 1 Updated Sep 1, 2023

spencermountain / dumpster-dip

parse a wikipedia dump into tiny files

JavaScript 7 9 Updated Dec 28, 2023

spencermountain / dumpster-dive

roll a wikipedia dump into mongo

JavaScript 242 45 Updated Jul 1, 2024

eth-sri / lmql

A language for constraint-guided and efficient LLM programming.

Python 3,728 203 Updated Jun 3, 2024

adbar / trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 3,727 268 Updated Dec 11, 2024

erikbern / ann-benchmarks

Benchmarks of approximate nearest neighbor libraries in Python

Python 5,011 749 Updated Oct 29, 2024

huggingface / text-embeddings-inference

A blazing fast inference solution for text embeddings models

Rust 2,932 191 Updated Dec 13, 2024

gabriben / awesome-generative-information-retrieval

630 49 Updated Oct 15, 2024

HITsz-TMG / awesome-llm-attributions

A Survey of Attributions for Large Language Models

174 8 Updated Aug 24, 2024

THU-KEG / EvaluationPapers4ChatGPT

Resource, Evaluation and Detection Papers for ChatGPT

455 24 Updated Mar 21, 2024

Tongji-KGLLM / RAG-Survey

1,879 124 Updated May 8, 2024

HillZhang1999 / llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

955 52 Updated Nov 21, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,015 474 Updated May 3, 2024

naver / splade

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Python 795 86 Updated May 3, 2024