Skip to content
Change the repository type filter

All

    Repositories list

    • vosk-api

      Public
      Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
      C++
      Apache License 2.0
      1.2k000Updated Apr 1, 2021Apr 1, 2021
    • Facebook AI Research's Automatic Speech Recognition Toolkit
      C++
      Other
      1k000Updated Mar 31, 2021Mar 31, 2021
    • nama

      Public
      multitrack recorder and digital audio workstation
      OpenEdge ABL
      GNU General Public License v3.0
      6000Updated Mar 24, 2021Mar 24, 2021
    • Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
      Python
      Apache License 2.0
      1.2k000Updated Mar 23, 2021Mar 23, 2021
    • StreamSaver writes stream to the filesystem directly asynchronous
      JavaScript
      MIT License
      426000Updated Mar 16, 2021Mar 16, 2021
    • WavPack

      Public
      WavPack encode/decode library, command-line programs, and several plugins
      C
      BSD 3-Clause "New" or "Revised" License
      71000Updated Mar 13, 2021Mar 13, 2021
    • Program receives the time the button was pressed.
      C++
      GNU General Public License v3.0
      1000Updated Mar 9, 2021Mar 9, 2021
    • RE-VERB

      Public
      speaker diarization system using an LSTM
      Python
      MIT License
      8000Updated Mar 9, 2021Mar 9, 2021
    • crunker

      Public
      Simple way to merge or concatenate audio files with the Web Audio API.
      JavaScript
      MIT License
      60000Updated Mar 9, 2021Mar 9, 2021
    • An HTML5 saveAs() FileSaver implementation
      JavaScript
      Other
      4.4k000Updated Mar 1, 2021Mar 1, 2021
    • Recorder

      Public
      html5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、ios部分浏览器、和Hybrid App(提供Android IOS App源码),微信也是支持的,提供H5版语音通话聊天示例 和DTMF编解码
      JavaScript
      MIT License
      1k000Updated Feb 15, 2021Feb 15, 2021
    • JavaScript
      1000Updated Feb 9, 2021Feb 9, 2021
    • React app using the Watson Speech to Text service to transform voice audio into written text.
      JavaScript
      Apache License 2.0
      36000Updated Feb 8, 2021Feb 8, 2021
    • A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
      JavaScript
      Other
      166000Updated Feb 6, 2021Feb 6, 2021
    • Stopwatch with lap counter
      JavaScript
      1000Updated Feb 3, 2021Feb 3, 2021
    • DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
      C++
      Mozilla Public License 2.0
      4k000Updated Feb 2, 2021Feb 2, 2021
    • A C# library to generate .RTF text files from any string object data, stylized and ready for any text processor. No fancy dependencies or restrictive licenses.
      C#
      Other
      6000Updated Feb 2, 2021Feb 2, 2021
    • PDF Annotation Tool
      JavaScript
      24000Updated Jan 27, 2021Jan 27, 2021
    • Pure-python library for adding annotations to PDFs
      Python
      MIT License
      46000Updated Jan 25, 2021Jan 25, 2021
    • rich-text

      Public
      Libraries for handling and rendering Rich Text 📄
      TypeScript
      MIT License
      109000Updated Jan 21, 2021Jan 21, 2021
    • sppas

      Public
      SPPAS: the automatic annotation and analysis of speech software
      8000Updated Jan 20, 2021Jan 20, 2021
    • dover-lap

      Public
      Method for combining overlap-aware diarization system outputs.
      Perl
      MIT License
      12000Updated Jan 16, 2021Jan 16, 2021
    • PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
      C
      Other
      725000Updated Jan 8, 2021Jan 8, 2021
    • Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
      Python
      BSD 2-Clause "Simplified" License
      340000Updated Dec 31, 2020Dec 31, 2020
    • Croppr.js

      Public
      A vanilla JavaScript image cropper that's lightweight, awesome, and has absolutely zero dependencies.
      JavaScript
      MIT License
      92000Updated Dec 14, 2020Dec 14, 2020
    • 🎤 Sample Node.js Application for the IBM Watson Speech to Text Service
      JavaScript
      Apache License 2.0
      707000Updated Dec 10, 2020Dec 10, 2020
    • Timing Labels Assistant for use with Audacity to produce timings for SAB
      AutoHotkey
      MIT License
      2000Updated Dec 1, 2020Dec 1, 2020
    • A testing server for a speech to text service based on mozilla deepspeech
      Python
      Mozilla Public License 2.0
      71000Updated Sep 3, 2020Sep 3, 2020
    • An Image upload and client-side resize javascript library using only HTML5 APIs
      JavaScript
      Other
      42000Updated Sep 2, 2020Sep 2, 2020
    • MoM.ai

      Public
      AI-enabled web app for Speaker Diarization, Audio transcription with time stamps and placeholder for every speakers' name for manual tagging, and MoM summary.
      HTML
      0000Updated Aug 30, 2020Aug 30, 2020