-
Computer Vision Enginner
- Genoa, Italy
- https://[email protected]
- in/rollov
- https://t.me/rollovd
Stars
Learn Low Level Design (LLD) and prepare for interviews using free resources.
OMGGGGG / mmdg
Forked from ZitongYu/Flex-Modal-FASSuppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing
[AAAI 2025] DepthFM: Fast Monocular Depth Estimation with Flow Matching
Productive, portable, and performant GPU programming in Python.
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
When it comes to optimizers, it's always better to be safe than sorry
OpenMMLab Pose Estimation Toolbox and Benchmark.
This repository contains code for the paper "Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs", RecSys '24
This is a list of awesome paper about optical flow and related work.
CoTracker is a model for tracking any point (pixel) on a video.
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Model for watermark classification implemented with PyTorch
Implementation of "TrackFormer: Multi-Object Tracking with Transformers”. [Conference on Computer Vision and Pattern Recognition (CVPR), 2022]
Run PyTorch LLMs locally on servers, desktop and mobile
Magnificent app which corrects your previous console command.
real time face swap and one-click video deepfake with only a single image
Extensions to YAML syntax for better python interaction
An extremely fast Python package and project manager, written in Rust.
High-Resolution Image Synthesis with Latent Diffusion Models
A simple and light-weight camera image processing pipeline
The repository provides code for the paper RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders, CIKM'24
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Utilities intended for use with Llama models.
An extremely fast Python linter and code formatter, written in Rust.