I’m not ready to talk about it in detail. Even my boss doesn’t know. But you’re in the right ballpark.
I’m actually building a proof-of-concept prototype for what I want to work on… and I’m using a browser extension so that I can build it independently without anyone from the tech team being involved and slowing me down.
That sounds nice. I’ve been looking at serenade.ai and thought about extending their STT with an option to use another third-party STT engine. I would then like to extend their command engine with LLM command recognition. In my experience, maybe also with my pronunciation as a non-english speaker, their STT and command recognition really doesn’t work that well.
So what are you building? A browser STT interface for chatting with GPT and other LLMs?
I’m not ready to talk about it in detail. Even my boss doesn’t know. But you’re in the right ballpark.
I’m actually building a proof-of-concept prototype for what I want to work on… and I’m using a browser extension so that I can build it independently without anyone from the tech team being involved and slowing me down.
That sounds nice. I’ve been looking at serenade.ai and thought about extending their STT with an option to use another third-party STT engine. I would then like to extend their command engine with LLM command recognition. In my experience, maybe also with my pronunciation as a non-english speaker, their STT and command recognition really doesn’t work that well.
Have you tried Whisper from OpenAI? It’s the best I’ve ever seen. I’m curious how it would handle accents.
No, not yet. But thanks for the tip!