Apple's Latest Machine Learning Journal Entry Focuses on 'Hey Siri' Trigger Phrase

Apple's latest entry in its online Machine Learning Journal focuses on the personalization process that users partake in when activating "Hey Siri" features on iOS devices. Across all Apple products, "Hey Siri" invokes the company's AI assistant, and can be followed up by questions like "How is the weather?" or "Message Dad I'm on my way."

"Hey Siri" was introduced in iOS 8 on the iPhone 6, and at that time it could only be used while the iPhone was charging. Afterwards, the trigger phrase could be used at all times thanks to a low-power and always-on processor that fueled the iPhone and iPad's ability to continuously listen for "Hey Siri."

hey siri iphone x
In the new Machine Learning Journal entry, Apple's Siri team breaks down its technical approach to the development of a "speaker recognition system." The team created deep neural networks and "set the stage for improvements" in future iterations of Siri, all motivated by the goal of creating "on-device personalization" for users.

Apple's team says that "Hey Siri" as a phrase was chosen because of its "natural" phrasing, and described three scenarios where unintended activations prove troubling for "Hey Siri" functionality. These include "when the primary users says a similar phrase," "when other users say "Hey Siri"," and "when other users say a similar phrase." According to the team, the last scenario is "the most annoying false activation of all."

To lessen these accidental activations of Siri, Apple leverages techniques from the field of speaker recognition. Importantly, the Siri team says that it is focused on "who is speaking" and less on "what was spoken."

The overall goal of speaker recognition (SR) is to ascertain the identity of a person using his or her voice. We are interested in “who is speaking,” as opposed to the problem of speech recognition, which aims to ascertain “what was spoken.” SR performed using a phrase known a priori, such as “Hey Siri,” is often referred to as text-dependent SR; otherwise, the problem is known as text-independent SR.

The journal entry then goes into how users enroll in a personalized "Hey Siri" process using explicit and implicit enrollment. Explicit begins the minute that users speak the trigger phrase a few times, but implicit is "created over a period of time" and made during "real-world situations."

The Siri team says that the remaining challenges faced by speaker recognition is figuring out how to get quality performance in reverberant (large room) and noisy (car) environments. You can check out the full Machine Learning Journal entry on "Hey Siri" right here.

Since it began last summer, Apple has shared numerous entries in its Machine Learning Journal about complex topics, which have already included "Hey Siri", face detection, and more. All past entries can be seen on Apple.com.

Popular Stories

New Things Your iPhone Can Do in iOS 18

20 New Things Your iPhone Can Do in iOS 18.2

Monday December 16, 2024 8:55 am PST by
Apple released iOS 18.2 in the second week of December, bringing the second round of Apple Intelligence features to iPhone 15 Pro and iPhone 16 models. This update brings several major advancements to Apple's AI integration, including completely new image generation tools and a range of Visual Intelligence-based enhancements. Apple has added a handful of new non-AI related feature controls as...
iphone 16 apple intelligence

Apple Drops Plans for iPhone Hardware Subscription Service

Wednesday December 18, 2024 11:39 am PST by
Apple is no longer planning to launch a hardware subscription service that would let customers "subscribe" to get a new iPhone each year, reports Bloomberg's Mark Gurman. Gurman first shared rumors about Apple's work on a hardware subscription service back in 2022, and at the time, he said that Apple wanted to develop a simple system that would allow customers to pay a monthly fee to gain...
iPhone 17 Pro Dual Tone Feature 1

iPhone 17 Pro Rumored to Stick With 'Triangular' Camera Design

Wednesday December 18, 2024 2:36 am PST by
Contrary to recent reports, the iPhone 17 Pro will not feature a horizontal camera layout, according to the leaker known as "Instant Digital." In a new post on Weibo, the leaker said that a source has confirmed that while the appearance of the back of the iPhone 17 Pro has indeed changed, the layout of the three cameras is "still triangular," rather than the "horizontal bar spread on the...
elevation lab airtag battery

Your AirTag's Battery Will Last for Up to 10 Years With Elevation Lab's New TimeCapsule Enclosure

Wednesday December 18, 2024 10:05 am PST by
Elevation Lab today announced the launch of TimeCapsule, an innovative and simple solution for increasing the battery life of Apple's AirTag. Priced at $20, TimeCapsule is an AirTag enclosure that houses two AA batteries that offer 14x more battery capacity than the CR2032 battery that the AirTag runs on. It works by attaching the AirTag's upper housing to the built-in custom contact in the...
apple tv 4k yellow bg feature

New Apple TV Rumored to Launch Next Year With These Features

Tuesday December 17, 2024 9:02 am PST by
The current Apple TV 4K was released more than two years ago, so the streaming device is becoming due for a hardware upgrade soon. Fortunately, it was recently rumored that a new Apple TV will launch at some point next year. Below, we recap rumors about the next-generation Apple TV. Bloomberg's Mark Gurman last week reported that Apple has been working on its own combined Wi-Fi and...
blackmagic vision pro

Blackmagic Debuts $30K 3D Camera for Capturing Video for Vision Pro

Monday December 16, 2024 4:17 pm PST by
Blackmagic today announced that its URSA Cine Immersive camera is now available for pre-order, with deliveries set to start late in the first quarter of 2025. Blackmagic says that this is the world's first commercial camera system designed to capture 3D content for the Vision Pro. The URSA Cine Immersive camera was first introduced in June, but it has not been available for purchase until...
mac pro creativity

Apple Launched the Controversial 'Trashcan' Mac Pro 11 Years Ago Today

Thursday December 19, 2024 7:00 pm PST by
Apple launched the controversial "trashcan" Mac Pro eleven years ago today, introducing one of its most criticized designs that persisted through a period of widespread discontentment with the Mac lineup. The redesign took the Mac Pro in an entirely new direction, spearheaded by a polished aluminum cylindrical design that became unofficially dubbed the "trashcan" in the Mac community. All of ...
iPhone 17 Slim Feature

'iPhone 17 Air' With 'Major' Design Changes and 19-Inch MacBook Detailed in New Report

Sunday December 15, 2024 9:47 am PST by
Apple is planning a series of "major design" and "format changes" for iPhones over the next few years, according to The Wall Street Journal's Aaron Tilley and Yang Jie. The paywalled report published today corroborated the widely-rumored "iPhone 17 Air" with an "ultrathin" design that is thinner than current iPhone models. The report did not mention a specific measurement, but previous...

Top Rated Comments

ThatGuyInLa Avatar
87 months ago
Apple can post all the journals and blog entries they want. Siri sucks. (When compared to others)

As I’ve said before, they need to fire the entire team and hire new. Poach if needed. They need to JOBS THIS SOB.
Score: 8 Votes (Like | Disagree)
barkomatic Avatar
87 months ago
Blah blah blah, neural networks, blah blah blah, set the stage for improvements, blah. They need to actually *make* those improvements and quickly. Either that or allow us to use Google assistant or Alexa natively on the iPhone. I'm tired of Siri telling me about web searches. It's hard to believe how Apple dropped the ball that they brought to the game.
Score: 6 Votes (Like | Disagree)
WWPD Avatar
87 months ago
I thought they did say 'Computer' often before issuing commands in Star Trek? Maybe my memory is fuzzy though.
Computer was always the trigger word in Star Trek.
Score: 3 Votes (Like | Disagree)
sundog925 Avatar
87 months ago
They're whole siri system needs an overhaul.
Its lagging so far behind others, dictation is awful, through bluetooth is worse. its just all around poor.

Even spotify's new voice control is lightyears more accurate than siri :rolleyes:
Score: 3 Votes (Like | Disagree)
TheBruno Avatar
87 months ago
I had to disable this trigger. I can't say 'serious' anywhere in the vicinity of my iPad Pro.
Score: 2 Votes (Like | Disagree)
iamgalt Avatar
87 months ago
I can't say I've experienced too many unintended activations, but I do still wish apple would allow us to change the triggering phrase to whatever we wanted. That may help with unintended activations for each person.
Score: 2 Votes (Like | Disagree)