Apple Teams Up With NVIDIA to Speed Up AI Language Models

Apple has shared details on a collaboration with NVIDIA to greatly improve the performance of large language models (LLMs) by implementing a new text generation technique that offers substantial speed improvements for AI applications.

ml research apple
Apple earlier this year published and open-sourced Recurrent Drafter (ReDrafter), an approach that combines beam search and dynamic tree attention methods to accelerate text generation. Beam search explores multiple potential text sequences at once for better results, while tree attention organizes and removes redundant overlaps among these sequences to improve efficiency.

Apple has now integrated the technology into NVIDIA's TensorRT-LLM framework, which optimizes LLMs running on NVIDIA GPUs, where it achieved "state of the art performance," according to Apple. The integration saw the technique manage a 2.7x speed increase in tokens generated per second during testing with a production model containing tens of billions of parameters.

Apple says the improved performance not only reduces user-perceived latency but also leads to decreased GPU usage and power consumption. From Apple's Machine Learning Research blog:

"LLMs are increasingly being used to power production applications, and improving inference efficiency can both impact computational costs and reduce latency for users. With ReDrafter's novel approach to speculative decoding integrated into the NVIDIA TensorRT-LLM framework, developers can now benefit from faster token generation on NVIDIA GPUs for their production LLM applications."

Developers interested in implementing ReDrafter can find detailed information on both Apple's website and NVIDIA's developer blog.

Tag: Nvidia

Popular Stories

New Things Your iPhone Can Do in iOS 18

20 New Things Your iPhone Can Do in iOS 18.2

Monday December 16, 2024 8:55 am PST by
Apple released iOS 18.2 in the second week of December, bringing the second round of Apple Intelligence features to iPhone 15 Pro and iPhone 16 models. This update brings several major advancements to Apple's AI integration, including completely new image generation tools and a range of Visual Intelligence-based enhancements. Apple has added a handful of new non-AI related feature controls as...
apple tv 4k yellow bg feature

New Apple TV Rumored to Launch Next Year With These Features

Tuesday December 17, 2024 9:02 am PST by
The current Apple TV 4K was released more than two years ago, so the streaming device is becoming due for a hardware upgrade soon. Fortunately, it was recently rumored that a new Apple TV will launch at some point next year. Below, we recap rumors about the next-generation Apple TV. Bloomberg's Mark Gurman last week reported that Apple has been working on its own combined Wi-Fi and...
iphone 16 apple intelligence

Apple Drops Plans for iPhone Hardware Subscription Service

Wednesday December 18, 2024 11:39 am PST by
Apple is no longer planning to launch a hardware subscription service that would let customers "subscribe" to get a new iPhone each year, reports Bloomberg's Mark Gurman. Gurman first shared rumors about Apple's work on a hardware subscription service back in 2022, and at the time, he said that Apple wanted to develop a simple system that would allow customers to pay a monthly fee to gain...
iPhone 17 Pro Dual Tone Feature 1

iPhone 17 Pro Rumored to Stick With 'Triangular' Camera Design

Wednesday December 18, 2024 2:36 am PST by
Contrary to recent reports, the iPhone 17 Pro will not feature a horizontal camera layout, according to the leaker known as "Instant Digital." In a new post on Weibo, the leaker said that a source has confirmed that while the appearance of the back of the iPhone 17 Pro has indeed changed, the layout of the three cameras is "still triangular," rather than the "horizontal bar spread on the...
elevation lab airtag battery

Your AirTag's Battery Will Last for Up to 10 Years With Elevation Lab's New TimeCapsule Enclosure

Wednesday December 18, 2024 10:05 am PST by
Elevation Lab today announced the launch of TimeCapsule, an innovative and simple solution for increasing the battery life of Apple's AirTag. Priced at $20, TimeCapsule is an AirTag enclosure that houses two AA batteries that offer 14x more battery capacity than the CR2032 battery that the AirTag runs on. It works by attaching the AirTag's upper housing to the built-in custom contact in the...
Apple TV 4K hero 221018 feature

Here is Everything New for the Apple TV in the tvOS 18.3 Update So Far

Tuesday December 17, 2024 6:25 am PST by
Apple on Monday seeded the first tvOS 18.3 beta to developers for testing. The update will likely be released in January. So far, there are only minor changes for the Apple TV, with one new feature and a few code changes discovered. Below, we outline what is new in tvOS 18.3 so far. Robot Vacuum Support in Home App First, tvOS 18.3 will add robot vacuum support to the Home app on the...
blackmagic vision pro

Blackmagic Debuts $30K 3D Camera for Capturing Video for Vision Pro

Monday December 16, 2024 4:17 pm PST by
Blackmagic today announced that its URSA Cine Immersive camera is now available for pre-order, with deliveries set to start late in the first quarter of 2025. Blackmagic says that this is the world's first commercial camera system designed to capture 3D content for the Vision Pro. The URSA Cine Immersive camera was first introduced in June, but it has not been available for purchase until...
iPhone 17 Slim Feature

'iPhone 17 Air' With 'Major' Design Changes and 19-Inch MacBook Detailed in New Report

Sunday December 15, 2024 9:47 am PST by
Apple is planning a series of "major design" and "format changes" for iPhones over the next few years, according to The Wall Street Journal's Aaron Tilley and Yang Jie. The paywalled report published today corroborated the widely-rumored "iPhone 17 Air" with an "ultrathin" design that is thinner than current iPhone models. The report did not mention a specific measurement, but previous...

Top Rated Comments

attohs Avatar
6 hours ago at 03:25 am
NVidia? Did hell freeze over again?
Score: 28 Votes (Like | Disagree)
vegetassj4 Avatar
5 hours ago at 03:35 am
NVIDIA and Apple??!!? Working together again?



Attachment Image
Score: 10 Votes (Like | Disagree)
Delgibbons Avatar
5 hours ago at 03:52 am
Can't wait to put a 5090 in my Ma....

oh.
Score: 7 Votes (Like | Disagree)
Little Endian Avatar
5 hours ago at 04:03 am
Apple is in triage mode over Siri!! Yes everyone knows how bad Siri is!! All AI LLM is far from perfect but so far I would rather deal with with any AI/LLM engine rather than Siri. I have an android phone with Google’s Gemini which is a far from perfect but I find myself using it 90% of the time over Siri. If my life depended on it I would avoid Siri at all costs. I would rather seek help from an alcoholic meth head with dementia rather than trust Siri. For heavens sakes she still can’t even dial a phone number or route me to the correct address with a greater than ~90% success rate.
Score: 6 Votes (Like | Disagree)
redbeard331 Avatar
5 hours ago at 03:32 am
Good we have to hurry this up.



Attachment Image
Score: 5 Votes (Like | Disagree)
lilkwarrior Avatar
4 hours ago at 05:15 am
What would be an even better collaboration would be Apple enabling Nvidia GPU options again—at least for the Mac Pro.

It would be AWESOME to be able to use Nvidia’s ray-tracing and tensor cores with my creative professional and AI problems with Titan-class/Prosumer/workstation GPUs (x90 and up) again without having to switch to my PC.

A Nvidia MPX GPU module as capable as a 5090 with no wires and Thunderbolt 5 support would be a nirvana-like outcome—especially if Microsoft, Apple, and/or Valve enables a way to dual boot to Windows on ARM and SteamOS.

While I love building a liquid-cooled PC, I and various prosumers would finally have a choice to stop buying PCs altogether
Score: 4 Votes (Like | Disagree)