Skip to content
View hyukkyukang's full-sized avatar
  • 02:11 (UTC +09:00)

Highlights

  • Pro

Block or report hyukkyukang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Large Concept Models: Language modeling in a sentence representation space

Python 1,793 149 Updated Jan 23, 2025
Python 12 1 Updated May 16, 2024

PyTorch native post-training library

Python 4,751 499 Updated Jan 25, 2025

PyTorch per step fault tolerance (actively under development)

Python 223 17 Updated Jan 25, 2025

Bringing BERT into modernity via both architecture changes and scaling

Python 1,103 64 Updated Jan 21, 2025
Jupyter Notebook 8 8 Updated Jan 8, 2025

Dataset and code for paper: "Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese".

Python 16 7 Updated Nov 21, 2024
Python 8 Updated Dec 8, 2024
Python 1 Updated Nov 28, 2024

[SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval

Python 30 1 Updated Oct 18, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 7,724 749 Updated Jan 25, 2025

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…

C++ 4,857 580 Updated Jan 7, 2025

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,347 896 Updated Jan 17, 2025

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Python 818 88 Updated May 3, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 138,003 27,665 Updated Jan 24, 2025
Python 26 1 Updated May 27, 2024

Official repository of the xLSTM.

Python 1,650 120 Updated Jan 14, 2025

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 4,268 375 Updated Jan 22, 2025

An Efficient Transfer Learning Based Configuration Adviser for Database Tuning

Python 6 Updated Apr 2, 2024

This is the github repo for our CoLM 2024 paper "Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness".

Jupyter Notebook 7 1 Updated Jan 22, 2025

PyTorch Extension Library of Optimized Scatter Operations

Python 1,591 182 Updated Jan 10, 2025

XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval

Jupyter Notebook 40 3 Updated Jun 20, 2024

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,845 3,420 Updated Jan 22, 2025
Python 213 19 Updated Jun 11, 2024

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)

Jupyter Notebook 311 14 Updated Jan 22, 2025

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Python 1,263 86 Updated Jan 25, 2025

[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.org/abs/2012.12624

Python 604 75 Updated Jun 15, 2022

CLIR version of ColBERT

Python 67 12 Updated Sep 26, 2024

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

TypeScript 2,878 373 Updated Aug 21, 2024
Next