sourjyadip

Follow

Sourjyadip Ray sourjyadip

Follow

NLP + CV

15 followers · 29 following

IIT Kharagpur
Kolkata

Achievements

Achievements

Highlights

Pro

Stars

156 results for source starred repositories

domoritz / domoritz.github.io

My homepage

JavaScript 39 51 Updated Dec 6, 2024

Kapi2910 / kapi2910.github.io

Personal Website

HTML 2 Updated Nov 10, 2023

nyquist / awesome-networking

Curated list of awesome computer networking resources

479 56 Updated Nov 22, 2023

0xFA11 / GameNetworkingResources

A Curated List of Multiplayer Game Network Programming Resources

C 7,539 482 Updated Sep 29, 2024

Computer-Vision-in-the-Wild / CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,223 58 Updated Mar 14, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,036 712 Updated Aug 12, 2024

Yui010206 / CREMA

☕️ CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion

Python 28 2 Updated Jun 14, 2024

lucidrains / mirasol-pytorch

Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch

Python 89 2 Updated Dec 22, 2023

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 35,679 4,434 Updated Nov 18, 2024

allenai / unified-io-2

Python 582 27 Updated Feb 15, 2024

rishikksh20 / ViViT-pytorch

Implementation of ViViT: A Video Vision Transformer

Python 518 67 Updated Jun 21, 2021

robi56 / video-summarization-resources

Video Summarization Dataset, Papers, Codes

157 26 Updated Aug 17, 2018

minghu0830 / NurViD-benchmark

Python 18 1 Updated Jan 12, 2024

arjunakula / Visual-Discourse-Parsing

Discourse Processing in Videos https://arxiv.org/abs/1903.02252

Python 5 1 Updated Mar 26, 2019

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 20,379 1,538 Updated Dec 22, 2024

RenzeLou / awesome-instruction-learning

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

Python 470 23 Updated Apr 4, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,922 506 Updated Dec 22, 2024

VideoAnalysis / EDUVSUM

EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important temporal segments in educational videos.

Python 20 4 Updated Mar 8, 2024

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 21,147 3,113 Updated Dec 21, 2024

Atomic-man007 / Awesome_Multimodel_LLM

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-contex…

280 18 Updated Aug 18, 2024

HaozheZhao / MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Python 338 15 Updated Dec 18, 2023

Lupin1998 / Awesome-MIM

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Python 312 16 Updated Oct 7, 2024

AnthonyMRios / pymetamap

Python wraper for MetaMap

Python 170 61 Updated Jul 22, 2020

cvdfoundation / kinetics-dataset

Shell 796 100 Updated May 15, 2024

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 11,593 11,356 Updated Dec 19, 2024

xwying / torchshow

Visualize PyTorch tensors with a single line of code.

Python 648 10 Updated Dec 21, 2024

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,204 419 Updated May 29, 2024

MCG-NJU / PointTAD

[NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points

Python 41 1 Updated Nov 24, 2023

atfortes / Awesome-Controllable-Diffusion

Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.

404 23 Updated Sep 29, 2024

atfortes / Awesome-LLM-Reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓

2,130 123 Updated Dec 17, 2024