7 comments

  • jdiff 16 hours ago
    Appreciate the concept, seems deeply useful if a bit underbaked at present.

    Active STT allows a "No STT loaded" option that mentions it requires a multimodal LLM like Gemma 4. Except even when I use Gemma 4 features, Ctrl+S to dictate doesn't work. Unless I Voice Edit then quickly Dictate as soon as it processes the silence. Sometimes if the Dictation is triggered on silence, it'll just choose to paste whatever text is on screen. There's no way to dismiss the popup with the text before it's ready to vanish on its own. There's no way to preview what the TTS voices sound like without triggering something to be said manually.

    It seems like this will be a great tool soon, but currently there are very many rough edges that would benefit greatly from a nice heavy sanding pass.

    • lostathome 7 hours ago
      On it! Thanks for the great feedback.
  • joey9prints 15 hours ago
    Love that it's local ai, I think that's the future.
  • amanzi 20 hours ago
    You might want to mention this is Mac-only
  • ghostly_s 18 hours ago
    So it's a dictation tool? Then why does "voice to text" barely appear on the page? Why are you describing it here as an AI assistant but the page doesn't say anything about that? "Understands my screen"? Why does my dictation software need to understand my screen? I don't know what "text generation", "AI editing" or "AI writing" even mean.
  • linggen 6 hours ago
    [flagged]
  • ipotapov 12 hours ago
    [dead]
  • phillip_xyz 2 hours ago
    [dead]