rumanxyz

👋

Hi there, thanks for stopping by my profile

Ruman rumanxyz

👋

Hi there, thanks for stopping by my profile

Computer Vision and NLP

6 followers · 6 following

India
https://rumn.medium.com/

Achievements

Lists (20)

Sort

Stars

qubvel-org / segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 10,011 1,704 Updated Feb 7, 2025

aleju / imgaug

Image augmentation for machine learning experiments.

Python 14,502 2,451 Updated Jul 30, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,931 1,407 Updated Dec 25, 2024

vikhyat / moondream

tiny vision language model

Python 7,253 565 Updated Feb 7, 2025

PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 46,196 7,982 Updated Feb 6, 2025

MultimediaTechLab / YOLO

An MIT License of YOLOv9, YOLOv7, YOLO-RD

Python 992 121 Updated Feb 6, 2025

browser-use / browser-use

Make websites accessible for AI agents

Python 24,945 2,496 Updated Feb 7, 2025

codeforany / montly_expenses_trackizer_app_flutter

Dart 37 18 Updated Jul 18, 2023

jameskokoska / Cashew

💸 An app created to help users manage a budget and purchases

Dart 2,392 324 Updated Oct 9, 2024

abi / screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 68,065 8,328 Updated Feb 4, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 39,387 5,193 Updated Feb 6, 2025

feder-cr / Jobs_Applier_AI_Agent_AIHawk

Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

Python 26,997 4,003 Updated Feb 2, 2025

siddhantdubey / PDFSpacedRepetition

TypeScript 17 5 Updated Aug 19, 2024

facebookresearch / sapiens

High-resolution models for human tasks.

Python 4,793 279 Updated Nov 18, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 4,357 259 Updated Feb 6, 2025

pragatiunna / License-Plate-Number-Detection

A project where the license plate number is extracted from image of a vehicle using Object detection and Character recognition techniques.

Jupyter Notebook 89 29 Updated Jun 9, 2021

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 3,311 491 Updated Apr 24, 2024

Tiiiger / bert_score

BERT score for text generation

Jupyter Notebook 1,666 225 Updated Jul 30, 2024

excalidraw / excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 91,734 8,767 Updated Feb 7, 2025

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 66,165 7,070 Updated Feb 7, 2025

yuguochencuc / BAE-Net

BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION

Python 66 3 Updated Aug 20, 2024

haoheliu / versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,304 136 Updated Jan 28, 2025

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 45,433 5,022 Updated Feb 7, 2025

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,306 104 Updated Sep 24, 2023

sh-lee-prml / HierSpeechpp

The official implementation of HierSpeech++

Python 1,200 137 Updated Feb 20, 2024

Kartik-3004 / facexformer

Official implementation of FaceXFormer: A Unified Transformer for Facial Analysis

Python 217 22 Updated Jan 5, 2025

kyegomez / VisionMamba

Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…

Python 424 21 Updated Feb 3, 2025

myshell-ai / MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 5,529 740 Updated Dec 24, 2024

relari-ai / continuous-eval

Data-Driven Evaluation for LLM-Powered Applications

Python 468 32 Updated Jan 22, 2025

shibuiwilliam / mixture_of_experts_keras

Mixture of experts on convolutional neural network using Keras and Cifar10

HTML 25 6 Updated Dec 7, 2017

Ruman rumanxyz

Lists (20)

art_with_ai

computer-vision-models

Deep dive paper

deep fakes

expense-tracker-app

face-attribute-analysis

face-detect-repos

face-spoof-detection

genAI, LLM & LVM

learning

mac app

mixture of experts

ml-ds-interview

number-plate-ocr

Reinforcement Learning

speech-analysis

text to speech (TTS)

vision - Paper, Arch, Repos,Demo

voice spoof detection

yolo-projects

Stars