-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Voice input limit handling #2536
Comments
Thanks for trying aider and filing this issue. Unfortunately converting to mp3 requires the user to have ffmpeg or libav. So it's tricky to make that the default. You can certainly configure your aider with |
@paul-gauthier Agreed though that it is a bit sucky when you record a voice and realize it's just slightly longer than the limit and then copy the WAV from tmp and upload it to https://replicate.com/openai/whisper Would be nicer if there were some fallback option that allowed you to record very long voice and use it without leaving aider. [edit: That replicate gave me nonsense strangely, and I used this instead: https://replicate.com/cjwbw/whisper] |
I'm labeling this issue as stale because it has been open for 2 weeks with no activity. If there are no additional comments, I will close it in 7 days. Note: A bot script made these updates to the issue. |
The main branch now checks if the wav file is too large, and tries to convert it to mp3 if so. The change is available in the main branch. You can get it by installing the latest version from github:
If you have a chance to try it, let me know if it works better for you. |
I'm closing this enhancement request since it has been marked as 'fixed' for over 3 weeks. The requested feature should now be available in recent versions of aider. If you find that this enhancement is still needed, please feel free to reopen this issue or create a new one. Note: A bot script made these updates to the issue. |
Issue
I just take a long time to explain all goals, features, todos, issues in detail using voice input. Took over 7 minutes and was really happy that I now gave it all relevant information, only for the transcription to fail. Would be very glad if this could be avoided, having minutes of voice input going down the drain really hurts. I managed to copy the tmp audio recording file and hope to be able to transcribe it myself, but this obviously isn't a good solution.
data:image/s3,"s3://crabby-images/a89ec/a89ec6f77c4377943b99d1771f530994ce5bdc99" alt="image"
Version and model info
Aider v0.66.0
Main model: claude-3-5-sonnet-20241022 with diff edit format, prompt cache, infinite output
Weak model: claude-3-5-haiku-20241022
Git repo: .git with 5 files
Repo-map: using 1024 tokens, files refresh
Added README.md to the chat.
Added experiment_fetch_recent_stars.py to the chat.
Added scrape_github.py to the chat.
Added test_scrape_github.py to the chat.
Restored previous conversation history.
Command Line Args: --deepseek --vim --analytics --analytics-log analytics.log
--cache-prompts --max-chat-history-tokens 10000 --voice-language de
Environment Variables:
OPENAI_API_KEY: ...U_UA
ANTHROPIC_API_KEY: ...egAA
Config File (/home/tom/.aider.conf.yml):
model: sonnet
editor: vim
cache-keepalive-pings:1
Defaults:
--model-settings-file:.aider.model.settings.yml
--model-metadata-file:.aider.model.metadata.json
--env-file: /home/tom/git/github_star_scraping/.env
--map-refresh: auto
--map-multiplier-no-files:2
--input-history-file:/home/tom/git/github_star_scraping/.aider.input.history
--chat-history-file:/home/tom/git/github_star_scraping/.aider.chat.history.md
--user-input-color:#00cc00
--tool-error-color:#FF2222
--tool-warning-color:#FFA500
--assistant-output-color:#0088ff
--code-theme: default
--aiderignore: /home/tom/git/github_star_scraping/.aiderignore
--lint-cmd: []
--test-cmd: []
--encoding: utf-8
--voice-format: wav
Option settings:
The text was updated successfully, but these errors were encountered: