-
CSC @ Unipd
- Padua, Italy
-
22:23
(UTC +01:00) - matteospanio.github.io
- https://orcid.org/0000-0002-2436-7208
- in/matteo-spanio
Highlights
- Pro
Starred repositories
Python packaging and dependency management made easy
TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18
A simple library for Fréchet Audio Distance (FAD) calculation
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
Symbolic Music NLP Artificial Intelligence Toolkit
Initial experiment for a iterative refined sign language model.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
A full-featured, hackable Next.js AI chatbot built by Vercel
A comprehensive database of foods in the United States.
The official codebase for Capturing label characteristics in VAEs
Official implementation of Diffusion Autoencoders
Code for the paper "Jukebox: A Generative Model for Music"
A family of diffusion models for text-to-audio generation.
Statistical Rethinking course and book package
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Visualisation, analysis, and annotation of music audio recordings
🪄 Create rich visualizations with AI
Code supporting the CVPR 2017 paper "Learning Cross-modal Embeddings for Cooking Recipes and Food Images"
im2recipe Pytorch implementation