I use speech to text in apple's notepad on my phone.

Can Acar

@rom yes, Apple has been good at making sure user has knowledge and control over how these things work. I expect local processing, as smaller LLMs can run on newer phones, but Apple also recently announced "Private Cloud Compute" which is supposed to offload compute intensive tasks to cloud while preserving privacy. Not sure which one this is, but turning off all network connectivity and checking if the transcription quality changes should help.

@futurebird It seems Apple is not immune to pushing the LLM hype into their devices. They did delay rolling out "LLM Siri" and had papers about LLMs inability to do reasoning, so their devices may still be a better choice in that regard, especially considering how MS and Google are fully on board with this.

knowuh

@futurebird @canacar

How to get Apple Intelligence - Apple Support

Apple Intelligence features are integrated across apps and experiences to help users communicate, express themselves, and get things done. Feature availability can vary by platform, language, and region, as noted.

Apple Support (support.apple.com)

“To get started with Apple Intelligence features on your compatible iPhone, iPad, Mac, or Apple Vision Pro, update your device to the latest software version, and ensure you have Apple Intelligence turned on under Settings > Apple Intelligence & Siri.”

myrmepropagandist

@knowuh @canacar

It's still asking me to turn that on and I keep ignoring it.

Thomas Sturm

@futurebird @canacar Apple is doing that on the phone, it's a small pre-trained AI model that lives in the "neural engine" or whatever they call that part of the CPU.

There are supposedly some new iOS services that will ask nicely if they can take your data offsite for better processing, but that should be all opt-in.

myrmepropagandist

@tsturm @canacar

This sounds ... fine?

Thomas Sturm

@futurebird @canacar That's how I understand it.

Apple's AI push seems to have fizzled out a bit. They will certainly try and expand Siri with more features that require a data-center, but speech recognition should still be all in device.

myrmepropagandist

@tsturm @canacar

This is literally the best thing to come out of this whole "tech cycle"

It's amazing to be able to dictate text and have it be correct, with nice punctuation and only a few little things to fix.

? Offline

@futurebird I don't know the details of that Apple product, but It probably uses a lot less power than you're imagining.

There's a diminishing returns problem with the intelligence of LLMs, and the companies chasing those diminishing returns are building massive data-centers.

But if you're not trying to build something that can pass as a sentient being, then LLMs don't have to be power-hungry.

There are small models that are actually quite useful for common tasks.

That's getting lost in this rush to try to build the world's smartest AI that can just do anything you ask it to.

myrmepropagandist

@apLundell

Why isn't functional speech to text more of a big deal? Seems like a massive tech win that could change workflows all over the place.

Janne Moren

@tsturm @futurebird @canacar
I believe the Android speech recognition and translation functions are also local models. And the Firefox language translator is also entirely local.

There's lots of truly useful and significant tools coming from recent ML advances (one, Alphafold, got a Nobel prize), but they're not LLM chatbots and don't get all this public recognition or money.

myrmepropagandist

@jannem @tsturm @canacar

It's gonna change more lives than chatbots for the better.

Think of the applications to teaching things like writing.

Think about me writing more because I get sick of typing and now I can go for a walk instead and keep going.

Phosphenes

@jannem @tsturm @futurebird @canacar

None of those accomplishments were by generative AI were they? All the crappy AI is generative. Translation and speech recognition are just mapping, no hallucinations.

ggdupont

@futurebird
@canacar Can't you just disconnect from any network (telco & wifi) and try? Does it still work offline?

myrmepropagandist

@gdupont @canacar

I will try this and see. I'm annoyed I didn't think of it.

Janne Moren

@Phosphenes @tsturm @futurebird @canacar
"Generative ai" is a misnomer. Some useful tools use the same kind of architecture as image or video generators (text to speech for instance), and some use the same kind of transformer architecture as chatbots.

But that's all implementation details. That's not important (it's like arguing what language was used to write a specific program). What matters is what it's used for, and by whom.

myrmepropagandist

@jannem @Phosphenes @tsturm @canacar

I think the "generative" adjective is about if one is using the training data to correctly match, or to extrapolate.

Sometimes when I do dictation there is a loud noise or I mumble: I get a bunch of nonsense. The new nonsense is much more like normal sentences. It's doing a better job guessing in that way. I said a sentence and it only has a few sounds so it gives me a sentence (the wrong one)

But this same improvement lets it get words right more often.

myrmepropagandist

@gdupont @canacar

“this is a test of the speech to text function to see if it’s able to work when my phone is not connected to the Internet. Given a context, such as writing about ants can it understand a word like integument the outer EXO skeleton of an ant.”

That’s not bad at all!

myrmepropagandist

@knowuh @canacar

@gdupont pointed out I could just disconnect from internet and wifi and test it again. So I did.

And it works great! It really must be doing most of the work locally, including the more fancy stuff where it goes back and fixes words as you add more context.

That makes me very happy because I like this feature.

myrmepropagandist

@gdupont @canacar

Don't know why it thinks I shouted the EXO part of exoskeleton... but, I'll take it.

Chebucto Regional Softball Club

I use speech to text in apple's notepad on my phone.

How to get Apple Intelligence - Apple Support

How to get Apple Intelligence - Apple Support