Skip to content
View sourjyadip's full-sized avatar
  • IIT Kharagpur
  • Kolkata

Highlights

  • Pro

Block or report sourjyadip

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
156 results for source starred repositories
Clear filter

My homepage

JavaScript 39 51 Updated Dec 6, 2024

Personal Website

HTML 2 Updated Nov 10, 2023

Curated list of awesome computer networking resources

479 56 Updated Nov 22, 2023

A Curated List of Multiplayer Game Network Programming Resources

C 7,539 482 Updated Sep 29, 2024

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,223 58 Updated Mar 14, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,036 712 Updated Aug 12, 2024

☕️ CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion

Python 28 2 Updated Jun 14, 2024

Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch

Python 89 2 Updated Dec 22, 2023

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 35,679 4,434 Updated Nov 18, 2024
Python 582 27 Updated Feb 15, 2024

Implementation of ViViT: A Video Vision Transformer

Python 518 67 Updated Jun 21, 2021

Video Summarization Dataset, Papers, Codes

157 26 Updated Aug 17, 2018
Python 18 1 Updated Jan 12, 2024

Discourse Processing in Videos https://arxiv.org/abs/1903.02252

Python 5 1 Updated Mar 26, 2019

DSPy: The framework for programming—not prompting—language models

Python 20,379 1,538 Updated Dec 22, 2024

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

Python 470 23 Updated Apr 4, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,922 506 Updated Dec 22, 2024

EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important temporal segments in educational videos.

Python 20 4 Updated Mar 8, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 21,147 3,113 Updated Dec 21, 2024

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-contex…

280 18 Updated Aug 18, 2024

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Python 338 15 Updated Dec 18, 2023

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Python 312 16 Updated Oct 7, 2024

Python wraper for MetaMap

Python 170 61 Updated Jul 22, 2020

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 11,593 11,356 Updated Dec 19, 2024

Visualize PyTorch tensors with a single line of code.

Python 648 10 Updated Dec 21, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,204 419 Updated May 29, 2024

[NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points

Python 41 1 Updated Nov 24, 2023

Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.

404 23 Updated Sep 29, 2024

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓

2,130 123 Updated Dec 17, 2024
Next