You think?

nifty@lemmy.world · 5 days ago

You think?

MonkderVierte@lemmy.ml · edit-2 5 days ago

Btw, why is there no speech recognition yet, using LLM to recognize words and meaning better?

And can’t google it really; flooded with results for Alexa and Siri and co., which is the reverse.

IMALlama@lemmy.world · 5 days ago

I work adjacent to a group that does speech recognition. There’s a massive amount of variation in regional dialects and that’s before you get to non-native speakers. The you have people like my mother in law who doesn’t have an accent, but her diction and grammar are… unique.

If someone is speaking in sentences you can use context clues to infer intent, but it’s a lot more challenging when you’re just getting spoken commands.

I suspect it’s a training/sample gap, but it’s likely going to be really hard to get to 100%.

papertowels@mander.xyz · 5 days ago

If I’m understanding your question correctly, heres an example model.

MonkderVierte@lemmy.ml · 5 days ago

Exactly something like this for Windows/Linux.