Siri Gives Eagles 33 False Super Bowl Wins in Basic Knowledge Test

In what may not come as much of a surprise, a new test of Siri's knowledge of Super Bowl history has revealed significant accuracy issues with Apple's virtual assistant, suggesting Apple still has some way to go in overcoming challenges with Siri's ability to provide reliable information.

Should Apple Kill Siri Feature
In a methodical experiment, One Foot Tsunami's Paul Kafasis asked Siri who won each Super Bowl from I through LX and documented its responses. The results were strikingly poor, with Siri correctly identifying winners only 34% of the time – just 20 correct answers out of 58 played Super Bowls.

Perhaps most notably, Siri repeatedly and incorrectly credited the Philadelphia Eagles with 33 Super Bowl victories, despite the team having won only one championship in their history. The virtual assistant's responses ranged from providing information about wrong Super Bowls to offering completely unrelated football facts.

While Siri did manage a few streaks of accurate answers, including three consecutive correct responses for Super Bowls V through VII, it also had a remarkable string of 15 consecutive incorrect answers spanning Super Bowls XVII through XXXII.

In one telling instance, when asked about Super Bowl XVI, Siri offered to defer to ChatGPT - which then provided the correct answer. The contrast highlighted the limitations of Siri's own knowledge base compared to more advanced AI systems.

The test was conducted on iOS 18.2.1 with Apple Intelligence enabled, and similar results were found on both the upcoming iOS 18.3 beta and macOS 14.7.2, suggesting the issue extends across Apple's platforms. Kafasis generated a spreadsheet of the results in both Excel and PDF formats, which you can read here.

Separately, inspired by Kafasis' test, Daring Fireball's John Gruber tried some of his own sports queries with Siri and compared its responses to ChatGPT, Kagi, DuckDuckGo, and Google, all of which succeeded where Siri failed.

Perhaps worse for Apple, Gruber found that old Siri (i.e. before Apple Intelligence) did a better job at answering a question by declining to answer it, instead providing a list of web links. The first web result provided an accurate, if only partial, answer to the question, whereas new Siri, powered by Apple Intelligence, fared much worse. Gruber explains:

New Siri — powered by Apple Intelligence™ with ChatGPT integration enabled — gets the answer completely but plausibly wrong, which is the worst way to get it wrong. It's also inconsistently wrong — I tried the same question four times, and got a different answer, all of them wrong, each time. It's a complete failure.

"It's just incredible how stupid Siri is about a subject matter of such popularity," commented Gruber. "If you had guessed that Siri could get half the Super Bowls right, you lost, and it wasn't even that close."

Of course, this isn't the first time Siri has received heavy flak for its all-round performance, but Gruber's criticism about "plausibly wrong" answers to general knowledge questions ties back to the modern problem of hallucinating AI chatbots that spout misleading or flat-out wrong responses with complete confidence.

Apple is developing a much smarter version of Siri that utilizes advanced large language models, which should allow the personal assistant to better compete with chatbots like ChatGPT. A chatbot version of Siri would likely be able to hold ongoing conversations and provide the sort of help and insight as ChatGPT or Claude, but how well the integration will perform may be a concern, going on Siri's abysmal track record.

Apple is expected to announce LLM Siri as soon as 2025 at WWDC, but Apple won't launch it until several months after it's unveiled. That means LLM Siri would come in an update to iOS 19, with Apple planning for a spring 2026 launch.

Popular Stories

prioritize notifications ios 18 4

Everything New in iOS 18.4 Beta 1

Friday February 21, 2025 1:08 pm PST by
Apple finally released the first beta of iOS 18.4 to developers for testing purposes, and while the beta is lacking some of the Apple Intelligence features we were hoping for, there are some notable new additions. Subscribe to the MacRumors YouTube channel for more videos. Priority Notifications - Apple Intelligence There is a new Priority Notifications feature that can show you your most...
ios 18 4 ambient music

iOS 18.4 Adds New Ambient Music Feature

Friday February 21, 2025 11:06 am PST by
In iOS 18.4, there's a new Ambient Music option that can be added to Control Center. There are four different sound categories, including Sleep, Chill, Productivity, and Wellbeing. Each category can be added to Control Center separately, and tapping one plays a random selection of sounds or music from that particular category. You can't choose what's playing from Control Center, but if...
apple launch feb 2025 alt

Here Are the New Apple Products We're Still Expecting This Spring

Thursday February 20, 2025 5:06 am PST by
Now that Apple has announced its new more affordable iPhone 16e, our thoughts turn to what else we are expecting from the company this spring. There are three product categories that we are definitely expecting to get upgraded before spring has ended. Keep reading to learn what they are. If we're lucky, Apple might make a surprise announcement about a completely new product category. M4...
iPhone 16e Feature

Apple Denies Speculation Surrounding iPhone 16e's Lack of MagSafe

Friday February 21, 2025 8:01 am PST by
Apple has confirmed that its custom-designed C1 modem in the iPhone 16e has nothing to do with the device's lack of MagSafe support, according to Macworld. Following the launch of the iPhone 16e, there was some speculation online about how MagSafe magnets might have interfered with the C1 modem's cellular connectivity performance, and this was considered to be a potential reason for the...
iPhone Fold Vertical Feature

Alleged Display Sizes Leaked for Apple's Book-Style Foldable iPhone

Friday February 21, 2025 2:14 am PST by
Another week, another alleged leak regarding Apple's fabled foldable iPhone. We've been hearing rumors about an iPhone that folds in half for over eight years now. While they have lacked consistency, they do suggest that Apple has tested various prototypes, with the hinge seemingly the biggest challenge Apple has been trying to overcome. Apple wants to eliminate any crease in the screen before...
ios 18 4 carplay

iOS 18.4 Includes a Small But Useful Change for CarPlay

Sunday February 23, 2025 2:23 pm PST by
The first beta of iOS 18.4 is now available, and it includes a small but useful change for CarPlay. As we noted in our list of iOS 18.4 features, CarPlay now shows a third row of icons, up from two rows previously. However, this change is only visible in vehicles with a larger center display. For example, a MacRumors Forums member noticed the change in a Toyota Tundra, which can be equipped...
iCloud Versus UK Key Feature

Apple Pulls Encrypted iCloud Security Feature in UK Amid Government Backdoor Demands

Friday February 21, 2025 7:17 am PST by
Apple has withdrawn its Advanced Data Protection iCloud feature from the United Kingdom following government demands for backdoor access to encrypted user data, according to Bloomberg. The move comes after UK officials secretly ordered Apple to provide unrestricted access to encrypted iCloud content worldwide. Customers who are already using Advanced Data Protection, or ADP, will need to...
Apple iPhone 16e Feature

Apple Announces iPhone 16e With A18 Chip and Apple Intelligence, Pricing Starts at $599

Wednesday February 19, 2025 8:02 am PST by
Apple today introduced the iPhone 16e, its newest entry-level smartphone. The device succeeds the third-generation iPhone SE, which has now been discontinued. The iPhone 16e features a larger 6.1-inch OLED display, up from a 4.7-inch LCD on the iPhone SE. The display has a notch for Face ID, and this means that Apple no longer sells any iPhones with a Touch ID fingerprint button, marking the ...
oppo find n5 fingers

World's Thinnest Foldable Phone Launches in Europe and Asia

Thursday February 20, 2025 8:55 am PST by
Oppo has launched the Find N5, the world's thinnest foldable phone yet. When closed, the book-style foldable measures 8.93mm. That's less than a millimeter thicker than an iPhone 16 Pro, and thinner than the Honor Magic V3, which was the previous record holder. The device is barely thicker than its USB-C port. Indeed, Oppo has suggested that the obstacle to making it any thinner is now "the...

Top Rated Comments

brofkand Avatar
4 weeks ago
Siri has been and always will be useless for anything other than simple things like setting timers. Apple has not written good software in years. The fact that their platforms are still mostly usable speaks to how far ahead they were a decade+ ago.
Score: 36 Votes (Like | Disagree)
Eriamjh1138@DAN Avatar
4 weeks ago
So Siri has become as factual as TikTok and Facebook. Got it.

Deactivated.
Score: 18 Votes (Like | Disagree)
NightfallOrchid Avatar
4 weeks ago

This isn't about Apple as such. The entire idea behind this is entirely flawed and they are being dragged into the hype and relying on praying that i'll eventually work, which it won't. Every company in the LLM space is doing the same thing. It's an arms race based on swimming in excrement with the promise of a cake at the end (the cake is a lie) and the end game is drowning.
As the original post states, ChatGPT, Kagi, DuckDuckGo and Google are all capable of giving correct answers, for some reason only Siri isn’t… Siri with ChatGPT support is somehow worse than ChatGPT on its own
Score: 14 Votes (Like | Disagree)
rivalius13 Avatar
4 weeks ago
Go Birds.
Score: 13 Votes (Like | Disagree)
MVMNT Avatar
4 weeks ago


Attachment Image
Score: 13 Votes (Like | Disagree)
kiranmk2 Avatar
4 weeks ago

This isn't a Maps level fiasco, but it's not too far off.

Remember when Apple's selling point was "they're not first, but when they do something, they do it right"? Those were the days.

We've got a confluence of factors here:

* An industry-wide fixation on what is in many ways not very good technology (and an insistence on shoehorning it in everywhere).
* Apple having a particular weakness in this particular area, dating back to well before the machine learning age and showing no signs of improvement.
* FOMO on Apple's part - fear of Android/Samsung eating their lunch if they can't say they also do this stuff.

So we've got a technology where even the best implementations are pretty bad, and Apple's implementation is worse.
Exactly this. I know it's a trope on here, but this is exactly when a Steve Jobs-like personality is really needed. Famously, he wasn't interested in pandering to Wall Street, insisting that Apple would follow it's own path and wouldn't pay dividends / buy back shares, instead, relying on a constant pipeline of amazing products to grow the company and stock price.

Under Tim Cook, Apple became much more led by its investors (look at the value of dividends / buy backs over the last 10 years) and this whole AI/LLM rush to not just catch-up, but publicly announce their plans in this area almost a year in advance (context aware Siri was announced last June, but won't be available until iOS18.4 around April/May) screams that they are messaging to investors that they are following the industry trend, rather than taking that trend and creating a new trend with it as they used to do.
Score: 12 Votes (Like | Disagree)