Free Trial

→ The Trump Dump is starting; Get out of stocks now? (From Paradigm Press) (Ad)

NASDAQ:NVDA

NVIDIA Q4 2024 Earnings Report

$159.28 +2.03 (+1.29%)

Closing price 07/3/2025 03:59 PM Eastern

Extended Trading

$159.22 -0.06 (-0.04%)

As of 07/3/2025 04:59 PM Eastern

Profile Earnings History Forecast

NVIDIA EPS Results

Actual EPS

$0.52

Consensus EPS

$0.42

Beat/Miss

Beat by +$0.10

One Year Ago EPS

$0.07

NVIDIA Revenue Results

Actual Revenue: $22.10 billion
Expected Revenue: $20.40 billion
Beat/Miss: Beat by +$1.71 billion
YoY Revenue Growth: +265.30%

NVIDIA Announcement Details

Quarter: Q4 2024
Date: 2/21/2024
Time: After Market Closes
Conference Call Date: Wednesday, February 21, 2024
Conference Call Time: 5:00PM ET

Upcoming Earnings

NVIDIA's Q2 2026 earnings is scheduled for Wednesday, August 27, 2025, with a conference call scheduled at 5:00 PM ET. Check back for transcripts, audio, and key financial metrics as they become available.

Conference Call Resources

Slide Deck

NVIDIA's Q4 2024 Slide Deck

Full Screen Slide Deck

NVIDIA Q4 2024 Earnings Call Transcript

Key Takeaways

Record Q4 and fiscal 2024 results: NVIDIA posted Q4 revenue of $22.1 billion (up 22% sequentially, 265% year-on-year) and full-year revenue of $60.9 billion (up 126% year-on-year), well above outlook.
Data center business triples: Fiscal 2024 data center revenue reached $47.5 billion (more than 3× last year), with Q4 up 27% sequentially and 409% year-on-year, driven by the Hopper GPU platform and generative AI workloads.
Software and services momentum: The division reached a $1 billion annualized run rate in Q4, and NVIDIA AI Enterprise and DGX Cloud expanded to AWS, Azure, Google Cloud and Oracle Cloud partners.
Strong Q1 outlook: Management guided Q1 revenue of $24 billion ±2% with GAAP gross margin around 76.3%, signaling continued data center and professional segment growth despite gaming seasonality.
China data center decline: Q4 data center revenue in China fell sharply following U.S. export controls; restricted products are paused pending approved alternatives, now ~mid-single-digit % of global data center revenue.

AI Generated. May Contain Errors.

Conference Call Audio

Earnings Conference Call

NVIDIA Q4 2024

00:00 / 00:00

There are 11 speakers on the call.

Operator

00:00:00

Good afternoon. My name is Rob, and I will be your conference operator today. At this time, I would like to welcome everyone to the NVIDIA's 4th Quarter Earnings Call. All lines have been placed on mute to prevent any background noise. After the speakers' remarks, there will be a question and answer Thank you.

Operator

00:00:25

Simona Jankowski, you may begin your conference.

Speaker 1

00:00:29

Thank you. Good afternoon, everyone, and welcome to NVIDIA's conference call for the 4th quarter fiscal 2024. With me today from NVIDIA are Jensun Huang, President and Chief Executive Officer and Colette Kress, Executive Vice President and Chief Financial Officer. I'd like to remind you that our call is being webcast live on NVIDIA's Investor Relations website. The webcast will be available for replay until the conference call to discuss our financial results for the Q1 of fiscal 2025.

Speaker 1

00:00:58

The content of today's call is NVIDIA's property. It can be reproduced or transcribed without our prior written consent. During this call, we may make forward looking statements based on current expectations. These are subject to a number of significant risks and uncertainties, and our actual results may differ materially. For a discussion of factors that could affect our future financial results and business, please refer to the disclosure in today's earnings release, our most recent Forms 10 ks and 10 Q and the reports that we may file on Form 8 ks with the Securities and Exchange Commission.

Speaker 1

00:01:32

All our statements are made as of today, February 21, 2024, based on information currently available to us. Except as required by law, we assume no obligation to update any such statements. During this call, we will discuss non GAAP financial measures. You can find a reconciliation of these non GAAP financial measures to GAAP financial measures in our CFO commentary, which is posted on our website. With that, let me turn the call over to Colette.

Speaker 2

00:02:00

Thanks, Simona. Q4 was another record quarter. Revenue of $22,100,000,000 was up 22% sequentially and up 2 65% year on year and well above our outlook of $20,000,000,000 For fiscal 2024, revenue was $60,900,000,000 and up 126 percent from the prior year. Starting with data center. Data center revenue for the fiscal 2024 year was $47,500,000,000 more than tripling from the prior year.

Speaker 2

00:02:38

The world has reached a tipping point of new computing era. The $1,000,000,000,000 installed base of data center infrastructure is rapidly transitioning from general purpose to accelerated computing. As Moore's law slows, while computing demand continues to skyrocket, companies may accelerate every workload possible to drive future improvement in performance, TCO and energy efficiency. At the same time, companies have started to build the next generation of modern data centers, what we refer to as AI factories, purpose built to refine raw data and produce valuable intelligence in the era of generative AI. In the Q4, data center revenue of $18,400,000,000 was a record, up 27% sequentially and up 409% year on year, driven by the NVIDIA Hopper GPU computing platform, along with InfiniBand end to end networking.

Speaker 2

00:03:43

Compute revenue grew more than 5x and networking revenue tripled from last year. We are delighted that supply of hopper architecture products is improving. Demand for hopper remains very strong. We expect our next generation products to be supply constrained as demand far exceeds supply. 4th quarter data center growth was driven by both training and inference of generative AI and large language models across a broad set of industries, use cases and regions.

Speaker 2

00:04:19

The versatility and leading performance of our data center platform enables a high return on investment for many use cases, including AI training and inference, data processing and a broad range of CUDA accelerated workloads. We estimate in the past year approximately 40% of data center revenue was for AI inference. Building and deploying AI solutions has reached virtually every industry. Many companies across industries are training and operating their AI models and services at scale. Enterprises across NVIDIA AI Infrastructure through cloud providers, including hyperscales, GPU specialized and private clouds or on premise.

Speaker 2

00:05:10

NVIDIA's computing stack extends seamlessly across cloud and on premise environments, allowing customers to deploy with a multi cloud or hybrid cloud strategy. In the 4th quarter, large cloud providers represented more than half of our data center revenue, supporting both internal workloads and external public cloud customers. Microsoft recently noted that more than 50,000 organizations use GitHub Copilot Business to supercharge the productivity of their developers contributing to GitHub revenue growth accelerating to 40% year on year. And CoPilot for Microsoft 365 adoption grew faster in its 1st 2 months than the 2 previous major Microsoft 365 enterprise suite that leases that. Consumer Internet Companies have been early adopters of AI and represent one of our largest customer categories.

Speaker 2

00:06:13

Companies from search to e commerce, social media, news and video services and entertainment are using AI for deep learning based recommendation systems. These AI investments are generating a strong return by improving customer engagement, ad conversation and click throughs rates. Meta, in its latest quarter, cited more accurate predictions and improved advertiser performance as contributing to the significant acceleration in its revenue. In addition, consumer Internet companies are investing in generative AI to support content creators, advertisers and customers through automation tools for content and ad creation, online product descriptions and AI shopping assistance. Enterprise software companies are applying generative AI to help customers realize productivity gains.

Speaker 2

00:07:13

Early customers we've partnered with for both training and inference of generative AI are already seeing notable commercial success. ServiceNow's generative AI products in their latest quarter drove their largest ever net new annual contract value contribution of any new product family release. We are working with many other leading AI and enterprise software platforms as well, including Adobe, Databricks, Getty Images, SAP and Snowflake. The field of foundation large language models is thriving. Anthropic, Google, Inflexion, Microsoft, OpenAI and XAI are leading with continued amazing breakthrough in generative AI.

Speaker 2

00:08:04

Exciting companies like Adept, AI21, Character AI, Cohere, Mistral, Perplexity and Runway are building platforms to serve enterprises and creators. New startups are creating LLMs to serve the specific languages, cultures and customs of the world's many regions. And others are creating foundation models to address entirely different industries like recursion, pharmaceuticals and generative biomedicines for biology. These companies are driving demand for NVIDIA AI infrastructure through hyperscale or GPU specialized cloud providers. Just this morning, we announced that we collaborated with Google to optimize its state of the art new JEMMA language models to accelerate their inference performance on NVIDIA GPUs in the cloud, data center and PC.

Speaker 2

00:09:05

One of the most notable trends over the past year is the significant adoption of AI by enterprises across the industry verticals such as automotive, healthcare and financial services. NVIDIA offers multiple application frameworks to help companies adopt AI in vertical domains such as autonomous driving, drug discovery, low latency machine learning for fraud detection or robotics, leveraging our full stack accelerated computing platform. We estimate that data center revenue contribution of the automotive vertical through the cloud or on prem exceeded $1,000,000,000 last year. NVIDIA DRIVE Infrastructure Solutions include systems and software for the development of autonomous driving, including data ingestion, curation, labeling and AI training, plus validation through simulation. Almost 80 vehicle manufacturers across global OEMs, new energy vehicles, trucking, robotaxi and Tier 1 suppliers are using NVIDIA's AI infrastructure to train LLMs and other AI models for automated driving and AI cockpit applications.

Speaker 2

00:10:20

In effect, nearly every automotive company working on AI is working with NVIDIA. As AV algorithms move to video transformers and more cars are equipped with cameras, we expect NVIDIA's automotive data center processing demand to grow significantly. In healthcare, digital biology and generative AI are helping to reinvent drug discovery, surgery, medical imaging and wearable devices. We have built deep domain expertise in healthcare over the past decade, creating the NVIDIA Clara Healthcare platform and NVIDIA Bionemo, a generative AI service to develop, customize and deploy AI foundation models for computer aided drug discovery. BioNemo features a growing collection of pre trained biomolecular AI models that can be applied to the end to end drug discovery processes.

Speaker 2

00:11:23

We announced Recursion is making available for their proprietary AI model through BioNeMo for the drug discovery ecosystem. In financial services, customers are using AI for a growing set of use cases from trading and risk management to customer service and fraud detection. For example, American Express improved fraud detection accuracy by 6% using NVIDIA AI. Shifting to our data center revenue by geography. Growth was strong across all regions except for China, where our data center revenue declined significantly following the U.

Speaker 2

00:12:02

S. Government export control regulations imposed in October. Although we have not received licenses from the U. S. Government to ship restricted products to China, we have started shipping alternatives that don't require a license for the China market.

Speaker 2

00:12:18

China represented a mid single digit percentage of our data center revenue in Q4, and we expect it to stay in a similar range in the Q1. In regions outside of the U. S. And China, Sovereign AI has become an additional demand driver. Countries around the world are investing in AI infrastructure to support the building of large language models in their own language, on domestic data and in support of their local research and enterprise ecosystems.

Speaker 2

00:12:51

From a product perspective, the vast majority of revenue was driven by our Hopper architecture, along with InfiniBand networking. Together they have emerged as the de facto standard for accelerated computing and AI infrastructure. We are on track to ramp H200 with initial shipments in the 2nd quarter. Demand is strong as H200 nearly doubles the inference performance of H100. Networking exceeded a $13,000,000,000 annualized revenue run rate.

Speaker 2

00:13:27

Our end to end networking solutions define modern AI data centers. Our quantum InfiniBand solutions grew more than 5x year on year. NVIDIA Quantum InfiniBand is the standard for the highest performance AI dedicated infrastructures. We are now entering the Ethernet networking space with the launch of our new SpectrumX end to end offering designed for an AI optimized networking for the data center. SpectrumX introduces new technologies over Ethernet that are purpose built for AI.

Speaker 2

00:14:05

Technologies incorporated in our spectrum switch, BlueField, CPU and software stack deliver 1.6x higher networking performance for AI processing compared with traditional Ethernet. Leading OEMs, including Dell, HPE, Lenovo and Supermicro with their global sales channels are partnering with us to expand our AI solution to enterprises worldwide. We are on track to ship SpectrumX this quarter. We also made great progress with our software and services offerings, which reached an annualized revenue run rate of $1,000,000,000 in Q4. We announced that NVIDIA DGX Cloud will expand its list of partners to include Amazon's AWS, joining Microsoft Azure, Google Cloud and Oracle Cloud.

Speaker 2

00:14:59

DGX Cloud is used for NVIDIA's own AI R and D and custom model development, as well as NVIDIA developers. It brings the CUDA ecosystem to NVIDIA CSP partners. Okay, moving to gaming. Gaming revenue was $2,870,000,000 was flat sequentially and up 56% year on year, better than our outlook on solid consumer demand for NVIDIA GeForce RTX GPUs during the holidays. Fiscal year revenue of $10,450,000,000 was up 15%.

Speaker 2

00:15:36

At CES, we announced our GeForce RTX 40 Super Series family of GPUs. Starting at $5.99 they deliver incredible gaming performance and generative AI capabilities. Sales are off to a great start. NVIDIA AI Tensor Cores in the GPUs deliver up to 836 AI TOPS, perfect for powering AI for gaming, creating an everyday productivity. The rich software stack we offer with our RTX GPUs further accelerates AI.

Speaker 2

00:16:13

With our DLSS technology, 7 out of 8 pixels can be AI generated, resulting up to 4x faster ray tracing and better image quality. And with the TensorRTLLM for Windows, our open source library that accelerates inference performance for the latest large language models, generative AI can run up to 5x faster on RTX AI PCs. At CES, we also announced a wave of new RTX 40 Series AI laptops from every major OEMs. These bring high performance gaming and AI capabilities to a wide range of form factors, including 14 inches and thin and light laptops. With up to 6 86 tops of AI performance, these next generations AI PCs increase generative AI performance by up to 60x, making them the best performing AI PC platforms.

Speaker 2

00:17:15

At CES, we announced NVIDIA Avatar Cloud Engine Microservices, which allowed developers to integrate state of the art generative AI models into digital avatars. ACE won several best of CES 2024 awards. NVIDIA has an end to end platform for building and deploying generative AI applications for RTX PCs and workstations. This includes libraries, SDKs, tools and services developers can incorporate into their generative AI workloads. NVIDIA is fueling the next wave of generative AI applications coming to the PC.

Speaker 2

00:17:57

With over 100,000,000 RTX PCs in the installed base and over 500 AI enabled PC applications and games, we are on our way. Moving to Pro Visualization. Revenue of $463,000,000 was up 11% sequentially and up 105% year on year. Fiscal year revenue of $1,550,000,000 was up 1%. Sequential growth in the quarter was driven by a rich mix of RTX, ADA architecture DPUs continuing to ramp.

Speaker 2

00:18:35

Enterprises are refreshing their workstations to support generative AI related workloads such as data preparation, LLM, fine tuning and retrieval augmented generation. These key industrial verticals driving demand include manufacturing, automotive and robotics. The automotive industry has also been an early adopter of NVIDIA Omniverse as it seeks to digitalize workflows from design to build, simulate, operate and experience their factories and cars. At CES, we announced that creative partners and developers, including Brickland, WPP and ZeroLight are building Omniverse powered car configurators. Leading automakers like Lotus are adopting the technology to bring new levels of personalization, realism and enter activity to the car buying experience.

Speaker 2

00:19:34

Moving to automotive. Revenue was $281,000,000 up 8% sequentially and down 4% year on year. Fiscal year revenue of $1,090,000,000 was up 21%, crossing the $1,000,000,000 mark for the first time on continued adoption of the NVIDIA DRIVE Orin is the AI car computer of choice for software defined AV fleets. Its successor NVIDIA Drive Thor designed for vision transformers, Ocwen offers more AI performance and integrates a wide range of intelligent capabilities into a single AI compute platform, including autonomous driving and parking, driver and passenger monitoring and AI cockpit functionality and will be available next year.

Speaker 3

00:20:30

There were

Speaker 2

00:20:30

several automotive customer announcements this quarter. Li Auto, Great Wall Motor, Zeekr, the premium EV subsidiary of Geely and Xiaomi ET all announced new vehicles built on NVIDIA. Moving to the rest of the P and L. GAAP gross margins expanded sequentially to 76% and non GAAP gross margins to 76.7% on strong data center growth and mix. Our gross margins in Q4 benefited from favorable component costs.

Speaker 2

00:21:09

Sequentially, GAAP operating expenses were up 6% and non GAAP operating expenses were up 9%, primarily reflecting higher compute and infrastructure investments and employee growth. In Q4, we returned $2,800,000,000 to shareholders in the form of share repurchases and cash dividends. During fiscal year 2024, we utilized cash of $9,900,000,000 towards shareholder returns, including $9,500,000,000 in share repurchases. Let me turn to the outlook for the Q1. Total revenue is expected to be $24,000,000,000 plus or minus 2%.

Speaker 2

00:21:55

We expect sequential growth in data center and pro business, partially offset by seasonal decline in gaming. GAAP and non GAAP gross margins are expected to be 76.3% and 77%, respectively, plus or minus 50 basis points. Similar to Q4, Q1 gross margins are benefiting from favorable component costs. Beyond Q1, for the remainder of the year, we expect gross margins to return to the mid-70s percent range. GAAP and non GAAP operating expenses are expected to be approximately $3,500,000,000 $2,500,000,000 respectively.

Speaker 2

00:22:45

Fiscal year 2025 GAAP and non GAAP operating expenses are expected to grow in the mid-thirty percent range as we continue to invest in the large opportunities ahead of us. GAAP and non GAAP other income and expenses are expected to be an income of approximately 250,000,000 dollars excluding gains and losses from non affiliated investments. GAAP and non GAAP tax rates are expected to be 17%, plus or minus 1% excluding any discrete items. Further financial details are included in the CFO commentary and other information available on our IR website. In closing, let me highlight some upcoming events for the financial community.

Speaker 2

00:23:35

We will attend the Morgan Stanley Technology and Media and Telecom Conference in San Francisco on March 4 and the TD Cohen's 44th Annual Healthcare Conference in Boston on March 5th. And of course, please join us for our annual GCC conference starting Monday, March 18th in San Jose, California To be held in person for the first time in 5 years, DTC will kick off with Janssen's keynote and we will host a Q and A session for financial analysts the next day, March 19. Time, we are now open the call for questions. Operator, would you please poll for questions?

Operator

00:24:36

Your first question comes from the line of Toshiya Hari from Goldman Sachs. Your line is open.

Speaker 4

00:24:43

Hi. Thank you so much for taking the question and congratulations on the really strong results. My question is for Jensen on the data center business. Clearly, you're doing extremely well in the business. I'm curious how your expectations for calendar 202425 have evolved over the past 90 days.

Speaker 4

00:25:05

And as you answer the question, I was hoping you can touch on some of the newer buckets within data center, things like software, Sovereign AI, I think you've been pretty vocal about how to think about that medium to long term. And recently there was an article about NVIDIA potentially participating in the ASIC market. Is there any credence to that? And if so, how should we think about you guys playing in that market over the next several years? Thank you.

Speaker 5

00:25:31

Thanks, Toshiya.

Speaker 3

00:25:35

Let's see.

Speaker 5

00:25:38

There were 3 questions. One more time. First question was

Speaker 4

00:25:44

I guess your expectations for data center, how they've evolved. Thank you.

Speaker 5

00:25:49

Okay. Yes. Well, we guide 1 quarter at a time, but fundamentally the conditions are excellent for continued growth calendar 'twenty four to calendar 'twenty five and beyond. And let me tell you why. We're at the beginning of 2 industry wide transitions and both of them are industry wide.

Speaker 5

00:26:16

The first one is a transition from general to accelerated computing. General purpose computing, as you know, is starting to run out of steam. And you could tell by the CSPs extending and many data centers, including our own for general purpose computing extending the depreciation from 4 to 6 years, there's just no reason to update with more CPUs when you can't fundamentally and dramatically enhance its throughput like you used to. And so you have to accelerate everything. This is what NVIDIA has been pioneering for some time.

Speaker 5

00:26:51

And with accelerated computing, you can dramatically improve your energy efficiency. You can dramatically improve your cost in data processing by 20:one, huge numbers. And of course, the speed. That speed is so incredible that we enabled a second industry wide transition called generative AI. Generative AI, we can I'm sure we're going to talk plenty about it during the call.

Speaker 5

00:27:21

But remember, generative AI is a new application. It is enabling a new way of doing software, new types of software being created. It is a new way of computing. You can't do generative AI on traditional general purpose computing. You have to accelerate it.

Speaker 5

00:27:39

And the third is it is enabling a whole new industry. And this is something worthwhile to take a step back and look at and it connects to your last question about Sovereign AI. A whole new industry in the sense that for the very first time a data center is not just about computing data and storing data and serving the employees of a company. We now have a new type of data center that is about AI generation, an AI generation factory. And you've heard me describe it as AI factories.

Speaker 5

00:28:18

But basically, it takes raw material, which is data, it transforms it with these AI supercomputers that NVIDIA builds and it turns them into incredibly valuable tokens. These tokens are what people experience on the amazing ChatGPT or midjourney or search these days are augmented by that. All of your recommender systems are now augmented by that. The hyper personalization that goes along with it. All of these incredible startups in digital biology, generating proteins and generating chemicals and the list goes on.

Speaker 5

00:29:03

And so, all of these tokens are generated in a very specialized type of data center. And this data center we call AI supercomputers and AI generation factories. But we're seeing diversity. One of the other reasons the so at the foundation is that the way it manifests into new markets is in all of the diversity that you're seeing us in. 1, the amount of inference that we do is just off the charts now.

Speaker 5

00:29:36

Almost every single time you interact with ChatGPT, you know that we're inferencing. Every time you use MidJourney, we're inferencing. Every time you see amazing, these SOAR videos that are being generated or runway, the videos that they're editing, Firefly, NVIDIA doing inferencing. The inference part of our business has grown tremendously. We estimate about 40%.

Speaker 5

00:29:59

The amount of training is continuing because these models are getting larger and larger, the amount of inference is increasing. But we're also diversifying into new industries. The large CSPs are still continuing to build out. You could see from their CapEx and their discussions. But there's a whole new category called GPU Specialized CSPs.

Speaker 5

00:30:24

They specialize NVIDIA AI Infrastructure, GPU Specialized CSPs. You're seeing enterprise software platforms deploying AI. ServiceNow is just a really, really great example. You see Adobe, they see others, SAP and others. You see consumer Internet services that are now augmenting all of their services at the past with generative AI.

Speaker 5

00:30:48

So they can have even more hyper personalized content to be created. You see us talking about industrial generative AI. Now our industries represent multibillion dollar businesses, auto, health, financial services in total are vertical industries are multibillion dollar businesses now. And of course Sovereign AI. The reason for Sovereign AI has to do with the fact that the language, the knowledge, the history, the culture of each region are different and they own their own data.

Speaker 5

00:31:29

They would like to use their data, train it with to create their own digital intelligence and provision it to harness that raw material themselves. It belongs to them. Each one of the regions around the world, the data belongs to them. The data is most useful to their society. And so they want to protect the data.

Speaker 5

00:31:50

They want to transform it themselves, value added transformation into AI and provision those services themselves. So, we're seeing sovereign AI infrastructures being built in Japan, in Canada, in France, so many other regions. And so my expectation is that what is being experienced here in the United States, in the West will surely be replicated around the world. And these AI generation factories are going to be in every industry, every company, every region. And so I think the last this last year, we've seen generative AI really becoming a whole new application space, a whole new way of doing computing, a whole new industry is being formed and that's driving our growth.

Operator

00:32:51

Your next question comes from the line of Joe Moore from Morgan Stanley. Your line is open.

Speaker 6

00:32:57

Great. Thank you. I wanted to follow-up on the 40% of revenues coming from inference. That's a bigger number than I expected. Can you give us some sense of where that number was maybe a year before?

Speaker 6

00:33:10

How much you're seeing growth around LLMs from inference? And how are you measuring that? Is that I assume it's in some cases the same GPUs you use

Speaker 2

00:33:18

for training and

Speaker 6

00:33:18

inference. How solid is that measurement? Thank you.

Speaker 5

00:33:24

I'll go backwards. The estimate is probably understated and but we estimated it. And let me tell you why. Whenever a year ago, the recommender systems that people are when you run the Internet, the news, the videos, the music, the products that are being recommended to you because as you know the Internet has trillions, I don't know how many trillions, but trillions of things out there and your phone is 3 inches squared. And so the ability for them to fit all of that information down to something such as small real estate is through a system, an amazing system called recommender systems.

Speaker 5

00:34:12

These recommender systems used to be all based on CPU approaches. But the recent migration to deep learning and now generative AI has really put these recommender systems now directly into the path of GPU acceleration. It needs GPU acceleration for the embeddings. It needs GPU acceleration for the nearest neighbor search, it needs GPU acceleration for re ranking and needs GPU acceleration to generate the augmented information for you. So, GPUs are in every single step of recommender system now.

Speaker 5

00:34:52

And as you know, recommender system is the single largest software engine on the planet. Almost every major company in the world has to run these large recommender systems. Whenever you use chat GPT as being inference, whenever you hear about Midjourney and just the number of things that they're generating for consumers. When you see Getty, the work that we do with Getty and Firefly from Adobe, these are all generative models. The list goes on.

Speaker 5

00:35:26

And none of these, as I mentioned, existed a year ago, 100% new.

Operator

00:35:33

Your next question comes from the line of Stacy Rasgon from Bernstein Research. Your line is open.

Speaker 7

00:35:40

Hi, guys. Thanks for taking my question. I wanted to Colette, I wanted to touch on your comments that you expected the next generation of products, I assume that meant Blackwell, to be supply constrained. Can you dig into that a little bit? What is the driver of that?

Speaker 7

00:35:56

Why does that get constrained as Hopper is easing up? And how long do you expect that to be constrained? Like do you expect the next generation to be constrained like all the way through calendar 2025 like when do

Speaker 3

00:36:07

those start to ease?

Speaker 5

00:36:10

Yes, the first thing is, overall, our supply is improving. Overall, our supply chain is just doing an incredible job for us. Everything from, of course, the wafers, the packaging, the memories, all of the power regulators to transceivers and networking and cables and you name it. The list of components that we ship, as you know, people think that NVIDIA GPU is like a chip, but the NVIDIA Hopper GPU has 35,000 parts. It weighs 70 pounds These things are really complicated things.

Speaker 5

00:36:55

We've built people call it an AI supercomputer for good reason. If you ever look in the back of the data center, the systems, the cabling system is mind boggling. It is the most dense complex cabling system for networking the world's ever seen. Our InfiniBand business grew 5x year over year. The supply chain is really doing fantastic supporting us.

Speaker 5

00:37:20

And so, overall, the supply is improving. We expect the demand will continue to be stronger than our supply provides and through the year and we'll do our best. The cycle times are improving and we're going to continue to do our best. However, whenever we have new products, as you know, it ramps from 0 to a very large number and you can't do that overnight. Everything is ramped up.

Speaker 5

00:37:54

It doesn't step up. And so, whenever we have a new generation of products and right now we are ramping H200s, there's no way we can reasonably keep up on demand in the short term as we ramp. We're ramping Spectrum X. We're doing incredibly well with Spectrum X. It's our brand new product into the world of Ethernet.

Speaker 5

00:38:22

InfiniBand is the standard for AI dedicated systems. Ethernet with SpectrumX, Ethernet is just not a very good scale out system. But with Spectrum X, we've augmented layered on top of Ethernet fundamental new capabilities like adaptive routing, congestion control, noise isolation or traffic isolation, so that we can optimize Ethernet for AI. And so, InfiniBand will be our AI dedicated infrastructure, Spectrum X will be our AI optimized networking and that is ramping. And so, we'll with all of new products, demand is greater than supply.

Speaker 5

00:39:11

And that's just kind of the nature of new products. And we work as fast as we can to capture with the demand. But overall, overall net net, overall, our supply is increasing very nicely.

Operator

00:39:26

Your next question comes from the line of Matt Ramsay from TD Cowen. Your line is open.

Speaker 8

00:39:32

Good afternoon, gents and Colette. Congrats on the results. I wanted to ask, I guess, a 2 part question and it comes at what Stacy was just getting at on your demand being significantly more than your supply, even though supply is improving. And I guess the two sides of the question are, I guess, first for Colette, like how are you guys thinking about allocation of product in terms of customer readiness to deploy and sort of monitoring if there's any kind of buildup of product that might not yet be turned on. And then I guess, Jensen, for you, I'd be really interested to hear you speak a bit about the thought that you and your company are putting into the allocation of your product across customers, many of which compete with each other, across industries to smaller start up companies, to things in the healthcare arena, to governments.

Speaker 8

00:40:34

It's a very, very unique technology that you're enabling. And I'd be really interested to hear you speak a bit about how you think about fairly allocating sort of for the good of your company, but also for the good of the industry. Thanks.

Speaker 2

00:40:51

Let me first start with your question, thanks, about how we are working with our customers as they look into how they are building out their GPU instances and our allocation process. The folks that we work with, our customers that we work with have been partners with us for many years, as we have been assisting them both in what they set up in the cloud as well as what they are setting up internally. Many of these providers have multiple products going at one time to serve so many different needs across their end customers, but also what they need internally. So they are working in advance, of course, thinking about those new clusters that they will need. And our discussions with them continue not only on our Hopper architecture, but helping them understand the next wave and getting their interest and getting their outlook for the demand that they want.

Speaker 2

00:41:55

So it's always a moving process in terms of what they will purchase, what is still being built and what is in use for end customers. But the relationships that we've built and their understanding of the sophistication of the build has really helped us with that allocation and both helped us with our communications with them.

Speaker 5

00:42:19

First, our CSPs have a very clear view of our product roadmap and transitions. And that transparency with our CSPs gives them the confidence of which products to place and where and when. And so they know the timing to the best of our ability and they know quantities and of course allocation. We allocate fairly. We allocate fairly.

Speaker 5

00:43:02

Do the best of our best we can to allocate fairly and to avoid allocating unnecessarily. As you mentioned earlier, why allocate something when a data center is not ready? Nothing is more difficult than to have anything sit around. And so allocate fairly and to avoid allocating unnecessarily. And where we do the question that you asked about the end markets, you know that we have an excellent ecosystem with OEMs, ODMs, CSPs, and very importantly, end markets.

Speaker 5

00:43:47

What NVIDIA is really unique about is that we bring our customers, we bring our partners, CSPs and OEMs, we bring them customers. The biology companies, the healthcare companies, financial services companies, AI developers, large language model developers, autonomous vehicle companies, robotics companies, there's just a giant suite of robotics companies that are emerging. There are warehouse robotics to surgical robotics to humanoid robotics, all kinds of really interesting robotics companies, agriculture robotics companies. All of these startups held large companies, healthcare, financial services and auto and such are working on NVIDIA's platform. We support them directly.

Speaker 5

00:44:36

And oftentimes, we can have a 2fer by allocating to a CSP and bringing the customer to the CSP at the same time. And so this ecosystem, you're absolutely right that it's vibrant, but at the core of it, we want to allocate fairly with avoiding waste and looking for opportunities to connect partners and end users. We're looking for those opportunities all the time.

Operator

00:45:12

Your next question comes from the line of Timothy Arcuri from UBS. Your line is open.

Speaker 9

00:45:18

Thanks a lot. I wanted to ask about how you're converting backlog into revenue. Obviously, lead times for your products have come down quite a bit. Colette, you didn't talk about the inventory purchase commitments. But if I sort of add up your inventory plus the purchase commits and your prepaid supply sort of the aggregate of your supply, it was actually down a touch.

Speaker 9

00:45:38

How should we read that? Is that just you saying that you don't need to make as much of a financial commitment to your suppliers because the lead times are lower? Or is that maybe you're reaching some sort of steady state where you're closer to filling your order book in your backlog? Thanks.

Speaker 2

00:45:54

Yes. So let me highlight on those three different areas of how we look at our suppliers. You're correct, our inventory on hand, given our allocation that we're on, we're trying to as things come into inventory, immediately work to ship them to our customers. Think our customer appreciates our ability to meet the schedules that we've looked for. The second piece of it is our purchase commitments.

Speaker 2

00:46:19

Our purchase commitments have many different components into it, components that we need for manufacturing, but also often we are procuring capacity that we may need. The lengths of that need for capacity or the length of the components are all different. Some of them may be for the next two quarters, some of them may be for multiple years. I can say the same regarding our prepaids. Our prepaids are predesigned to make sure that we have the reserve capacity that we need at several of our manufacturing suppliers as we look forward.

Speaker 2

00:46:54

So wouldn't read into anything regarding approximately about the same numbers as we are increasing our supply. All of them just have different lengths as we have sometimes had to buy things in long lead times or things that needed capacity to be built for us.

Operator

00:47:15

Your next question comes from the line of Ben Reitz from Melius Research. Your line is open.

Speaker 6

00:47:22

Yes, thanks.

Speaker 8

00:47:23

Congratulations on the results. Colette, I wanted to talk about your comment regarding gross margins and that they should go back to the mid-70s. If you don't mind unpacking that and also is that due to the HBM content in the new products? And what do you think are the drivers of that comment? Thanks so much.

Speaker 2

00:47:50

Yes, thanks for the question. We highlighted in our opening remarks really about our Q4 results and our outlook for Q1. Both of those quarters are unique. Those two quarters are unique in their gross margin as they include some benefit from favorable component costs in the supply chain kind of across both our compute and networking and also in several different stages of our manufacturing process. So looking forward, we have visibility into a mid-70s gross margin for the rest of the fiscal year, taking us back to where we were before this Q4 and Q1 peak that we've had here.

Speaker 2

00:48:38

So we're really looking at just a balance of our mix. Mix is always going to be our largest driver of what we will be shipping for the rest of the year. And those are really just the drivers.

Operator

00:48:52

Your next question comes from the line of C. J. Muse from Cantor Fitzgerald. Your line is open.

Speaker 3

00:48:58

Yes, good afternoon. Thank you for taking the question. Bigger picture question for you, Jensen. When you think about the 1,000,000 x improvement in GPU compute over the last decade and expectations for similar improvements to the next. How do your customers think about the long term usability of their NVIDIA investments that they're making today?

Speaker 3

00:49:16

Did today's training clusters become tomorrow's inference clusters? How do you see this playing out? Thank you.

Speaker 5

00:49:23

Hey, CJ. Thanks for the question. Yes, that's the really cool part. If you look at the reason why we're able to improve performance so much, it's because we have 2 characteristics about our platform. 1 is that it's accelerated and 2, it's programmable.

Speaker 5

00:49:45

It's not brittle. NVIDIA is the only architecture that has gone from the very, very beginning, literally the very beginning when CNN's and Alex Krzyzewski and Ilya Suskever and Jeff Hinton first revealed AlexNet all the way through RNNs to LSTMs to every RLs to deep learning RLs to transformers to every single version, every single version of every species that have come along vision transformers, multi modality transformers that every single and then now time sequence stuff and every single variation, every single species of AI that has come along, we've been able to support it, optimize our stack for it and deploy it into our installed base. This is really the great amazing part. On the one hand, we can invent new architectures and new technologies like our Tensor Cores, like our Transformer Engine for Tensor Cores, improve new numerical formats and structures of processing like we've done with the different generations of tensor cores, meanwhile supporting the installed base at the same time. And so, as a result, we take all of our new software algorithm invest inventions, all of the inventions, new inventions of models of the industry and it runs on our installed base on the one hand.

Speaker 5

00:51:12

On the other hand, whenever we see something revolutionary, we can like transformers, we can create something brand new like the Hopper transformer engine and implement it into future. And so, we simultaneously have this ability to bring software to the installed base and keep making it better and better and better, so our customers' installed base is enriched over time with our new software. On the other hand, for new technologies, create revolutionary capabilities. Don't be surprised if in our future generation, all of a sudden, amazing breakthroughs in large language models were made possible. And those breakthroughs, some of which will be in software because they run CUDA, will be made available to the installed base.

Speaker 5

00:52:01

And so, we carry everybody with us on the one hand, we make giant breakthroughs on the other hand.

Operator

00:52:09

Your next question comes from the line of Aaron Rakers from Wells Fargo. Your line is open.

Speaker 10

00:52:15

Yes. Thanks for taking the question. I wanted to ask about the China business. I know that in your prepared comments, you said that you started shipping some alternative solutions in China. You also pointed out that you expect that contribution to continue to be about a mid single digit percent of your total data center business.

Speaker 10

00:52:33

So I guess the question is, what is the extent of products that you're shipping today into the China market? And why should we not expect that maybe other alternative solutions come to the market and expand your breadth to participate in that opportunity again? Thank you.

Speaker 5

00:52:52

At the core, remember, the U. S. Government want to limit the latest capabilities of NVIDIA's accelerated computing and AI to the Chinese market. And the U. S.

Speaker 5

00:53:08

Government would like to see us be as successful in China as possible. Within those two constraints, within those two pillars, if you will, are the restrictions. And so, we had to pause when the new restrictions came out. We immediately paused. So, we understood what the restrictions are, reconfigured our products in a way that is not software hackable in any way.

Speaker 5

00:53:43

And that took some time. And so, we reset our product offering to China and now we're sampling to customers in China and we're going to do our best to compete in that marketplace and succeed in that marketplace within the specifications of the restriction. And so that's it. This last quarter, we our business significantly declined as we paused in the marketplace. We stopped shipping in the marketplace.

Speaker 5

00:54:16

We expect this quarter to be about the same. But after that, hopefully, we can go compete for our business and do our best and we'll see how it turns out.

Operator

00:54:28

Your next question comes from the line of Harsh Kumar from Piper Sandler. Your line is open.

Speaker 3

00:54:33

Yes. Hey, gents and Colette and NVIDIA team. First of all, congratulations on a stunning quarter and guide. I wanted to talk about a

Speaker 6

00:54:41

little bit about your software business and it's pleasing to hear that it's over $1,000,000,000 But I

Speaker 3

00:54:47

was hoping, Genshin or Colette, if you could just help us understand what the different parts and pieces are for the software business? In other words, just help us unpack it a little bit so we can get a better understanding of where that growth is coming from?

Speaker 5

00:55:01

Let me take a step back and explain the fundamental reason why NVIDIA will be very successful in software. So first, as you know, accelerated computing really grew in the cloud. In the cloud, the cloud service providers have really large engineering teams and we work with them in a way that allows them to operate and manage their own business. And whenever there are any issues, we have large teams assigned to them and their engineering teams are working directly with our engineering teams and we enhance, we fix, we maintain, we patch the complicated stack of software that's involved in accelerated computing. As you know, accelerated computing is very different than general purpose computing.

Speaker 5

00:55:49

You're not starting from a program like C plus plus you compile it and things run on all your CPUs. The stacks of software necessary for every domain from data processing, SQL versus SQL structured data versus all the images and text and PDF, which is unstructured to classical machine learning, to computer vision, to speech, to large language models, all recommender systems, all of these things require different software stacks. That's the reason why NVIDIA has hundreds of libraries. If you don't have software, you can't open new markets. If you don't have software, you can't open and enable new applications.

Speaker 5

00:56:37

Software is fundamentally necessary for accelerated computing. This is the fundamental difference between accelerated computing and general purpose computing that most people took a long time to understand. And now people understand that the software is really key. And the way that we work with CSPs, that's really easy. We have large teams that are working with their large teams.

Speaker 5

00:56:58

However, now that generative AI is enabling every enterprise and every enterprise software company to embrace accelerated computing And when it is now essential to embrace accelerated computing because it is no longer possible, no longer likely anyhow to sustain improved throughput through just general purpose computing. All of these enterprise software companies and enterprise companies don't have large engineering teams to be able to maintain and optimize their software stack to run across all of the world's clouds and private clouds and on prem. So we are going to do the management, the optimization, the patching, the tuning, the install base optimization for all of their software stacks and we containerize them into our stack, we call NVIDIA AI Enterprise. And the way we go to market with it is think of that NVIDIA AI Enterprise now as a runtime like an operating system. It's an operating system for artificial intelligence.

Speaker 5

00:58:11

And we charge $4,500 per GPU per year. And my guess is that every enterprise in the world, every software enterprise company that are deploying software in all the clouds and private clouds and on prem will run on NVIDIA AI Enterprise, especially obviously for our GPUs. And so this is going to likely be a very significant business over time. We're off to a great start. And Colette mentioned that it's already at a $1,000,000,000 run rate and we're really just getting started.

Operator

00:58:52

Thank you. I will now turn the call back over to Jensen Huang, CEO for closing remarks.

Speaker 5

00:58:59

The computer industry is making 2 simultaneous platform shifts at the same time. The $1,000,000,000,000 installed base of data centers is transitioning from general purpose to accelerated computing. Every data center will be accelerated, so the world can keep up with the computing demand with increasing throughput, while managing cost and energy. The incredible speed up of NVIDIA enabled that NVIDIA enabled a whole new computing paradigm, generative AI, where software can learn, understand and generate any information from human language to the structure of biology and the 3 d world. We are now at the beginning of a new industry where AI dedicated data centers process massive raw data to refine it into digital intelligence, Like AC power generation plants of the last industrial revolution, NVIDIA AI supercomputers are essentially AI generation factories of this industrial revolution.

Speaker 5

01:00:13

Every company and every industry is fundamentally built on their proprietary business intelligence and in the future their proprietary generative AI. Generative AI has kicked off a whole new investment cycle to build the next $1,000,000,000,000 of infrastructure of AI generation factories. We believe these two trends will drive a doubling of the world's data center infrastructure installed base in the next 5 years and will represent an annual market opportunity in the 100 of billions. This new AI infrastructure will open up a whole new world of applications not possible today. We started the AI journey with the hyperscale cloud providers and consumer Internet companies.

Speaker 5

01:01:03

And now every industry is on board from automotive to healthcare to financial services to industrial to telecom, media and entertainment. NVIDIA's full stack computing platform with industry specific applications frameworks and a huge developer and partner ecosystem gives us the speed, scale and reach to help every company to help companies in every industry become an AI company. We have so much to share with you at next month's GTC in San Jose. So be sure to join us. We look forward to updating you on our progress next quarter.

Earnings Documents

NVIDIA Earnings Headlines

Quarterly Results stock ticker

What to Expect From the Q2 Earnings Reporting Cycle (NVDA)

The S&P 500 is likely to outperform its lowered bar for earnings growth in Q2. The risk to the market is the impact of tariffs and trade relations on guidance.

June 23, 2025 | marketbeat.com

The Smartest Growth Stocks to Buy Right Now

2 hours ago | fool.com

A grave, grave error.

I thought what happened 25 years ago was a once- in-a-lifetime event… but how wrong I was. Because here we are, a quarter of a century later, almost to the exact day, and it’s happening again.

| Porter & Company (Ad)

Vertiv Unveils Energy-Efficient Cooling, Power Reference Architecture for NVIDIA GB300 NVL72 Platform

2 hours ago | insidermonkey.com

NVIDIA (NasdaqGS:NVDA) Makes Strides In AI Cloud With New Hardware Deployments

July 5 at 10:08 PM | finance.yahoo.com

If I Could Buy and Hold Just 1 Stock Forever, This Would Be It

July 5 at 6:15 PM | fool.com

See More NVIDIA Headlines

Get Earnings Announcements in your inbox

Want to stay updated on the latest earnings announcements and upcoming reports for companies like NVIDIA? Sign up for Earnings360's daily newsletter to receive timely earnings updates on NVIDIA and other key companies, straight to your email.

About NVIDIA

NVIDIA (NASDAQ:NVDA) is a leading technology company specializing in the design and development of graphics processing units (GPUs) and system-on-a-chip (SoC) products. Its offerings power a wide range of applications, from high-performance gaming and professional graphics to artificial intelligence (AI) and data center compute. The company’s expertise in parallel computing and GPU architecture has positioned it at the forefront of accelerating workloads across industries such as automotive, healthcare, finance, and scientific research.

The company’s flagship GeForce GPUs serve the gaming market, delivering advanced real-time ray tracing and AI-enhanced graphics. In the professional segment, NVIDIA’s RTX and Quadro series provide visualization and simulation capabilities for designers, engineers, and content creators. For data centers, its A100 and H100 GPUs, along with the CUDA software platform and DGX systems, enable AI model training, inference, and high-performance computing workloads. NVIDIA’s Tegra SoCs and DRIVE platforms are widely adopted in automotive applications, supporting infotainment, advanced driver-assistance systems, and autonomous vehicle development.

Founded in 1993 by Jensen Huang, Chris Malachowsky, and Curtis Priem, NVIDIA introduced the world’s first GPU in 1999 and has since driven industry innovation through key milestones such as the launch of the CUDA parallel computing architecture in 2006. The company expanded its reach through strategic acquisitions, including Mellanox Technologies in 2020, bolstering its networking capabilities and complementing its data center offerings with high-speed interconnect solutions.

Headquartered in Santa Clara, California, NVIDIA operates globally with research, development, and sales offices across North America, Europe, Asia-Pacific, and Latin America. Under the leadership of co-founder and CEO Jensen Huang, along with a senior management team that includes CFO Colette Kress, the company continues to advance GPU technology and software platforms to meet the evolving demands of graphics, compute, and AI-driven markets.

Written by Jeffrey Neal Johnson

View NVIDIA Profile

More Earnings Resources from MarketBeat

Earnings Tools

Earnings By Country

Latest Articles

Upcoming Earnings

Get 30 Days of MarketBeat All Access for Free

Sign up for MarketBeat All Access to gain access to MarketBeat's full suite of research tools.

Start Your 30-Day Trial

Sign in to your free account to enjoy these benefits

In-depth profiles and analysis for 20,000 public companies.
Real-time analyst ratings, insider transactions, earnings data, and more.
Our daily ratings and market update email newsletter.

Sign in to your free account to enjoy all that MarketBeat has to offer.

Sign In
Create Account

Your Email Address:

Your Password:

Log In

or

Sign in with Google

Forgot your password?

Your Email Address:

Choose a Password:

or

Sign in with Google

By creating a free account, you agree to our terms of service. This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.