Highlights
- Pro
Stars
C++23 solutions to advent of code puzzles -- all years complete.
Data pipeline and training pipeline for 🎵 music genre classification from FMA dataset
Tutorial on Multicore OCaml parallel programming with domainslib
Code for the Million Song Dataset, the dataset contains metadata and audio analysis for a million tracks, a collaboration between The Echo Nest and LabROSA. See website for details.
A pipeline for music similarity search
This library provides common speech features for ASR including MFCCs and filterbank energies.
Audio fingerprinting and recognition in Python
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Advice on how to get hired for the 2 most popular SWE-oriented Microsoft internships
Collection of Summer 2025 tech internships!
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Minimal and clean examples of machine learning algorithms implementations
Minimal yet powerful ReasonReact template