I use speech to text in apple's notepad on my phone.
-
@Phosphenes @tsturm @futurebird @canacar
"Generative ai" is a misnomer. Some useful tools use the same kind of architecture as image or video generators (text to speech for instance), and some use the same kind of transformer architecture as chatbots.But that's all implementation details. That's not important (it's like arguing what language was used to write a specific program). What matters is what it's used for, and by whom.
@jannem @Phosphenes @tsturm @canacar
I think the "generative" adjective is about if one is using the training data to correctly match, or to extrapolate.
Sometimes when I do dictation there is a loud noise or I mumble: I get a bunch of nonsense. The new nonsense is much more like normal sentences. It's doing a better job guessing in that way. I said a sentence and it only has a few sounds so it gives me a sentence (the wrong one)
But this same improvement lets it get words right more often.
-
@futurebird
@canacar Can't you just disconnect from any network (telco & wifi) and try? Does it still work offline? -
@futurebird @canacar Apple has made a big deal about having to “opt-in” to AI related services that happen off device. I am certain that the text to speech model is something like whisper running on your phone.
Quantized inference is relatively small and efficient. Most of the energy demands are on the training side.
@gdupont pointed out I could just disconnect from internet and wifi and test it again. So I did.
And it works great! It really must be doing most of the work locally, including the more fancy stuff where it goes back and fixes words as you add more context.
That makes me very happy because I like this feature.
-