NVIDIA Corp
NVIDIA is the world leader in accelerated computing.
Profit margin of 55.6% — that's well above average.
Current Price
$177.39
+0.93%GoodMoat Value
$221.97
25.1% undervaluedNVIDIA Corp (NVDA) — Q3 2025 Earnings Call Transcript
Operator
Good afternoon. My name is Joel, and I'll be your conference operator today. At this time, I would like to welcome everyone to NVIDIA's Third Quarter Earnings Call. All lines have been placed on mute to prevent any background noise. After the speakers' remarks, there will be a question-and-answer session. Thank you. Stewart Stecker, you may begin your conference.
Thank you. Good afternoon, everyone, and welcome to NVIDIA's conference call for the third quarter of fiscal 2025. With me today from NVIDIA are Jensen Huang, President and Chief Executive Officer; and Colette Kress, Executive Vice President and Chief Financial Officer. I'd like to remind you that our call is being webcast live on NVIDIA's Investor Relations website. The webcast will be available for replay until the conference call to discuss our financial results for the fourth quarter of fiscal 2025. The content of today's call is NVIDIA's property. It can't be reproduced or transcribed without our prior written consent. During this call, we may make forward-looking statements based on current expectations. These are subject to a number of significant risks and uncertainties and our actual results may differ materially. For a discussion of factors that could affect our future financial results and business, please refer to the disclosure in today's earnings release, our most recent Forms 10-K and 10-Q, and the reports that we may file on Form 8-K with the Securities and Exchange Commission. All our statements are made as of today, November 20, 2024, based on information currently available to us. Except as required by law, we assume no obligation to update any such statements. During this call, we will discuss non-GAAP financial measures. You can find a reconciliation of these non-GAAP financial measures to GAAP financial measures in our CFO commentary, which is posted on our website. With that, let me turn the call over to Colette.
Thank you, Stewart. Q3 was another record quarter. We continued to deliver incredible growth. Revenue of $35.1 billion was up 17% sequentially and up 94% year-on-year and, well above our outlook of $32.5 billion. All market platforms posted strong sequential and year-over-year growth, fueled by the adoption of NVIDIA accelerated computing and AI. Starting with Data Center, another record was achieved in Data Center. Revenue of $30.8 billion, up 17% sequential and up 112% year-on-year. NVIDIA Hopper demand is exceptional and sequentially, NVIDIA H200 sales increased significantly to double-digit billions, the fastest product ramp in our company's history. The H200 delivers up to 2 times faster inference performance and up to 50% improved TCO. Cloud service providers were approximately half of our data center sales with revenue increasing more than 2 times year-on-year. CSPs deployed NVIDIA H200 infrastructure and high-speed networking with installations scaling to tens of thousands of GPUs to grow their business and serve rapidly rising demand for AI training and inference workloads. NVIDIA H200-powered cloud instances are now available from AWS, CoreWeave, and Microsoft Azure with Google Cloud and OCI coming soon. Alongside significant growth from our large CSPs, NVIDIA GPU regional cloud revenue jumped 2 times year-on-year as North America, EMEA, and Asia Pacific regions ramped NVIDIA cloud instances and sovereign cloud buildout. Consumer Internet revenue more than doubled year-on-year as companies scaled their NVIDIA Hopper infrastructure to support next-generation AI models, training, multimodal and agentic AI, deep learning recommender engines, and generative AI inference and content creation workloads. NVIDIA's Ampere and Hopper infrastructures are fueling inference revenue growth for customers. NVIDIA is the largest inference platform in the world. Our large installed base and rich software ecosystem encourage developers to optimize for NVIDIA and deliver continued performance and TCL improvements. Rapid advancements in NVIDIA's software algorithms boosted Hopper inference throughput by an incredible 5 times in one year and cut the time to first token by 5 times. Our upcoming release of NVIDIA NIM will boost Hopper Inference performance by an additional 2.4 times. Continuous performance optimizations are a hallmark of NVIDIA and drive increasingly economic returns for the entire NVIDIA installed base. Blackwell is in full production after a successfully executed mass change. We shipped 13,000 GPU samples to customers in the third quarter, including one of the first Blackwell DGX engineering samples to OpenAI. Blackwell is a full stack, full infrastructure, AI data center scale system with customizable configurations needed to address a diverse and growing AI market from x86 to ARM, training to inferencing GPUs, InfiniBand to Ethernet switches, and NVLINK. Every customer is racing to be the first to market. Blackwell is now in the hands of all of our major partners and they are working to bring up their Data Centers. We are integrating Blackwell systems into the diverse Data Center configurations of our customers. Blackwell demand is staggering and we are racing to scale supply to meet the incredible demand customers are placing on us. Customers are gearing up to deploy Blackwell at scale. Oracle announced the world's first Zettascale AI Cloud computing clusters that can scale to over 131,000 Blackwell GPUs to help enterprises train and deploy some of the most demanding next-generation AI models. Yesterday, Microsoft announced they will be the first CSP to offer in private preview Blackwell-based cloud instances powered by NVIDIA GB200, and Quantum InfiniBand. Last week, Blackwell made its debut on the most recent round of MLPerf Training results, sweeping the per GPU benchmarks and delivering a 2.2 times leap in performance over Hopper. The results also demonstrate our relentless pursuit to drive down the cost of compute. Just 64 Blackwell GPUs are required to run the GPT-3 benchmark compared to 256 H100s or a 4 times reduction in cost. NVIDIA Blackwell architecture with NVLINK Switch enables up to 30 times faster inference performance and a new level of inference scaling throughput and response time that is excellent for running new reasoning inference applications like OpenAI's o1 model. With every new platform shift, a wave of start-ups is created. Hundreds of AI native companies are already delivering AI services with great success. Through Google, Meta, Microsoft, and OpenAI are headliners and Anthropic, Perplexity, Mistral, Adobe Firefly, Runway, Midjourney, Lightricks, Harvey, Codeium, Cursor, and Bridge are seeing great success, while thousands of AI-native startups are building new services. The next wave of AI are Enterprise AI and Industrial AI. Enterprise AI is in full throttle. NVIDIA AI Enterprise, which includes NVIDIA NeMo and NIM microservices, is an operating platform of agentic AI. Industry leaders are using NVIDIA AI to build Co-Pilots and agents. Working with NVIDIA, Cadence, Cloudera, Cohesity, NetApp, Nutanix, Salesforce, SAP, and ServiceNow are racing to accelerate development of these applications with the potential for billions of agents to be deployed in the coming years. Consulting leaders like Accenture and Deloitte are taking NVIDIA AI to the world's enterprises. Accenture launched a new business group with 30,000 professionals trained on NVIDIA AI technology to help facilitate this global build-out. Additionally, Accenture, with over 770,000 employees, is leveraging NVIDIA-powered Agentic AI applications internally, including in one case that cuts manual steps in marketing campaigns by 25% to 35%. Nearly 1,000 companies are using NVIDIA NIM, and the speed of its uptake is evident in NVIDIA AI Enterprise monetization. We expect NVIDIA AI Enterprise full-year revenue to increase over 2 times from last year, and our pipeline continues to build. Overall, our software, service, and support revenue is annualizing at $1.5 billion, and we expect to exit this year annualizing at over $2 billion. Industrial AI and robotics are accelerating. This is triggered by breakthroughs in physical AI, foundation models that understand the physical world. Like NVIDIA NeMo for enterprise AI agents, we built NVIDIA Omniverse for developers to build, train, and operate industrial AI and robotics. Some of the largest industrial manufacturers in the world are adopting NVIDIA Omniverse to accelerate their businesses, automate their workflows, and to achieve new levels of operating efficiency. Foxconn, the world's largest electronics manufacturer, is using digital twins and industrial AI built on NVIDIA Omniverse to speed the bring-up of its Blackwells factories and drive new levels of efficiency. In its Mexico facility alone, Foxconn expects to reduce over 30% in annual kilowatt-hour usage. From a geographic perspective, our Data Center revenue in China grew sequentially due to shipments of export-compliant copper products to industries. As a percentage of total Data Center revenue, it remains well below levels prior to the onset of export controls. We expect the market in China to remain very competitive going forward. We will continue to comply with export controls while serving our customers. Our sovereign AI initiatives continue to gather momentum as countries embrace NVIDIA accelerated computing for a new industrial revolution powered by AI. India's leading CSPs, including Tata Communications and Yotta Data Services, are building AI factories for tens of thousands of NVIDIA GPUs. By year-end, they will have boosted NVIDIA GPU deployments in the country by nearly 10 times. Infosys, TSE, and Wipro are adopting NVIDIA AI Enterprise and upskilling nearly 0.5 million developers and consultants to help clients build and run AI agents on our platform. In Japan, SoftBank is building the nation's most powerful AI supercomputer with NVIDIA DGX Blackwell and Quantum InfiniBand. SoftBank is also partnering with NVIDIA to transform the telecommunications network into a distributed AI network with NVIDIA AI Aerial and AI-RAN platform that can process both 5G RAN on AI on CUDA. We are launching the same in the US with T-Mobile. Leaders across Japan, including Fujitsu, NEC, and NTT, are adopting NVIDIA AI Enterprise, and major consulting companies, including EY, Strategy, and Consulting, will help bring NVIDIA AI technology to Japan's industries. Networking revenue increased 20% year-on-year. Areas of sequential revenue growth include InfiniBand and Ethernet switches, SmartNICs, and BlueField DPUs. The networking revenue was sequentially down; networking demand is strong and growing, and we anticipate sequential growth in Q4. CSPs and supercomputing centers are using and adopting the NVIDIA InfiniBand platform to power new H200 clusters. NVIDIA Spectrum-X Ethernet for AI revenue increased over 3 times year-on-year, and our pipeline continues to build with multiple CSPs and consumer Internet companies planning large cluster deployments. Traditional Ethernet was not designed for AI. NVIDIA Spectrum-X uniquely leverages technology previously exclusive to InfiniBand to enable customers to achieve massive scale of their GPU compute. Utilizing Spectrum-X, xAI's Colossus, 100,000 Hopper Supercomputer will experience zero application latency degradation and maintain 95% data throughput versus 60% for traditional Ethernet. Now moving to gaming and AI PCs. Gaming revenue of $3.3 billion increased 14% sequentially and 15% year-on-year. Q3 was a great quarter for gaming, with notebook, console, and desktop revenue all growing sequentially and year-on-year. RTX end-demand was fueled by strong back-to-school sales as consumers continued to choose GeForce RTX GPUs and devices to power gaming, creative, and AI applications. Channel inventory remains healthy, and we are gearing up for the holiday season. We began shipping new GeForce RTX AI PCs with up to 321 AI tops from ASUS and MSI, with Microsoft's Copilot+ capabilities anticipated in Q4. These machines harness the power of RTX ray tracing and AI technologies to supercharge gaming, photo and video editing, image generation, and coding. This past quarter, we celebrated the 25th anniversary of the GeForce 256, the world's first GPU. The transforming computing graphics to igniting the AI revolution, NVIDIA's GPUs have been the driving force behind some of the most consequential technologies of our time. Moving to ProViz. Revenue of $486 million was up 7% sequentially and 17% year-on-year. NVIDIA RTX workstations continue to be the preferred choice to power professional graphics, design, and engineering-related workloads. Additionally, AI is emerging as a powerful demand driver, including autonomous vehicle simulation, generative AI model prototyping for productivity-related use cases, and generative AI content creation in media and entertainment. Moving to Automotive. Revenue was a record $449 million, up 30% sequentially and up 72% year-on-year. Strong growth was driven by self-driving ramps of NVIDIA Orin and robust end-market demand for NAVs. Volvo Cars has rolled out its fully electric SUV built on NVIDIA Orin and DriveOS. Okay. Moving to the rest of the P&L. GAAP gross margin was 74.6%, and non-GAAP gross margin was 75%, down sequentially, primarily driven by a mix shift of the H100 systems to more complex and higher cost systems within Data Center. Sequentially, GAAP operating expenses and non-GAAP operating expenses were up 9% due to higher compute, infrastructure, and engineering development costs for new product introductions. In Q3, we returned $11.2 billion to shareholders in the form of share repurchases and cash dividends. Let me turn to the outlook for the fourth quarter. Total revenue is expected to be $37.5 billion, plus or minus 2%, which incorporates continued demand for Hopper architecture and the initial ramp of our Blackwell products. While demand greatly exceeds supply, we are on track to exceed our previous Blackwell revenue estimate of several billion dollars as our visibility into supply continues to increase. On gaming, although sell-through was strong in Q3, we expect fourth quarter revenue to decline sequentially due to supply constraints. GAAP and non-GAAP gross margins are expected to be 73% and 73.5%, respectively, plus or minus 50 basis points. Blackwell is a customizable AI infrastructure with several different types of NVIDIA-build chips, multiple networking options, and for air and liquid-cooled Data Centers. Our current focus is on ramping to strong demand, increasing system availability, and providing the optimal mix of configurations to our customers. As Blackwell ramps, we expect gross margins to moderate to the low-70s. When fully ramped, we expect Blackwell margins to be in the mid-70s. GAAP and non-GAAP operating expenses are expected to be approximately $4.8 billion and $3.4 billion, respectively. We are a data center scale AI infrastructure company. Our investments include building data centers for the development of our hardware and software stacks and to support new introductions. GAAP and non-GAAP other income and expenses are expected to have an income of approximately $400 million, excluding gains and losses from non-affiliated investments. GAAP and non-GAAP tax rates are expected to be 16.5% plus or minus 1%, excluding any discrete items. Further financial details are included in the CFO commentary and other information available on our IR websites. In closing, let me highlight upcoming events for the financial community. We will be attending the UBS Global Technology and AI Conference on December 3rd in Scottsdale. Please join us at CES in Las Vegas, where Jensen will deliver a keynote on January 6th, and we will host a Q&A session for financial analysts the next day on January 7th. Our earnings call to discuss results for the fourth quarter of fiscal 2025 is scheduled for February 26th, 2025.
Operator
We will now open the call for questions. Operator, can you poll for questions, please?
Yes, good afternoon. Thank you for taking the question. I guess just a question for you on the debate around whether scaling for large language models has stalled. Obviously, we're very early here but would love to hear your thoughts on this front. How are you helping your customers as they work through these issues? And then obviously, part of the context here as we're discussing clusters that have yet to benefit from Blackwell. So is this driving even greater demand for Blackwell? Thank you.
A foundation model pre-training scaling is intact and it's continuing. As you know, this is an empirical law, not a fundamental physical law, but the evidence is that it continues to scale. What we're learning, however, is that it's not enough that we've now discovered two other ways to scale. One is post-training scaling. Of course, the first generation of post-training was reinforcement learning human feedback, but now we have reinforcement learning AI feedback and all forms of synthetic data generated data that assists in post-training scaling. And one of the biggest events and one of the most exciting developments is Strawberry, ChatGPT o1, OpenAI's o1, which does inference time scaling, what's called test time scaling. The longer it thinks, the better and higher-quality answer it produces, and it considers approaches like chain of thought and multi-path planning and all kinds of techniques necessary to reflect and so on and so forth. And it's intuitively, it's a little bit like us doing thinking in our head before we answer a question. And so we now have three ways of scaling and we're seeing all three ways of scaling. And as a result of that, the demand for our infrastructure is really great. You see now that at the tail-end of the last generation of foundation models were at about 100,000 Hoppers. The next generation starts at 100,000 Blackwells. And so that kind of gives you a sense of where the industry is moving with respect to pre-training scaling, post-training scaling, and then now very importantly inference time scaling. And so the demand is really great for all of those reasons. But remember, simultaneously, we're seeing inference really starting to scale out for our company. We are the largest inference platform in the world today because our installed base is so large and everything that was trained on Amperes and Hoppers inference incredibly on Amperes and Hoppers. And as we move to Blackwells for training foundation models, it leads behind it a large installed base of extraordinary infrastructure for inference. And so we're seeing inference demand go up. We're seeing inference time scaling go up. We see the number of AI-native companies continue to grow. And of course, we're starting to see enterprise adoption of agentic AI really is the latest rage. And so we're seeing a lot of demand coming from a lot of different places.
Hi, good afternoon. Thank you so much for taking the question. Jensen, you executed the mass change earlier this year. There were some reports over the weekend about some heating issues. On the back of this, we've had investors ask about your ability to execute to the roadmap you presented at GTC this year with Ultra coming out next year and the transition to Ruben in 2026. Can you sort of speak to that? And some investors are questioning that. So if you can sort of speak to your ability to execute on time, that would be super helpful. And then a quick part B, on supply constraints, is it a multitude of componentry that's causing this? Or is it specifically HBM? Is it supply constraints? Are the supply constraints getting better? Are they worsening? Any sort of color on that would be super helpful as well. Thank you.
Yes, thank you. Blackwell production is operating at full capacity. In fact, as Colette mentioned earlier, we will deliver more Blackwells this quarter than we had initially estimated. Our supply chain team is doing an excellent job collaborating with our supply partners to boost Blackwell production, and we will continue to strive to increase it through next year. Demand is outpacing our supply, which is expected as we are at the onset of the generative AI revolution. We're also at the beginning of a new generation of foundation models capable of reasoning and long-form thinking, with physical AI being one of the most exciting areas, as it understands the physical world's structure. Therefore, Blackwell demand remains very strong, and our execution is on target. There's substantial engineering work underway globally. You might have noticed systems being established by Dell, CoreWeave, Oracle, Microsoft, and Google, all of which are racing to be first. The integration process is complex because, while we build comprehensive infrastructure, we disaggregate all of the AI supercomputers and integrate them into customized data center architectures worldwide. Although we have considerable expertise in this integration process through several generations, it still requires a significant amount of engineering effort. However, given the systems being deployed, Blackwell is in great shape. As mentioned earlier, our supply and planned shipments for this quarter are higher than previous estimates. Regarding the supply chain, we manufacture seven different chips needed for the Blackwell systems, which come in air-cooled or liquid-cooled configurations with various NVLink options. The integration of these systems into the world's data centers is remarkable. The component supply chain needed to scale has significantly improved since we shipped zero Blackwell last quarter. This quarter, we will ship billions worth of Blackwell systems, marking an extraordinary ramp-up. Nearly every company globally contributes to our supply chain, including TSMC, Amphenol, Vertiv, SK Hynix, Micron, Amkor, KYEC, Foxconn, Quanta, Wiwynn, Dell, HP, Super Micro, Lenovo, and many others, all of whom we greatly appreciate. Overall, we are in excellent shape regarding the Blackwell ramp at this time. Lastly, regarding our roadmap execution, we are on track with our annual plan, which aims to enhance our platform's performance. It's essential to recognize that by improving performance while reducing costs, we make AI training and inferencing more accessible. Another crucial point is that data centers are limited in power capacity. Regardless of size, the performance per watt directly impacts revenue for our partners. Our annual roadmap not only optimizes costs but also ensures our per-watt performance is unmatched, leading to maximum revenue generation for our customers. Maintaining this annual rhythm is vital for us, and we intend to continue along this path. Everything is progressing as planned.
Thanks a lot. I'm wondering if you can talk about the trajectory of how Blackwell is going to ramp this year. I know, Jensen, you did just talk about Blackwell being better than I think you had said several billion dollars in January. It sounds like you're going to do more than that. But I think in recent months also, you said that Blackwell crosses over Hopper in the April quarter. So I guess I had two questions. First of all, is that still the right way to think about it that Blackwell will cross over Hopper in April? And then Colette, you kind of talked about Blackwell bringing down gross margin to the low-70s as it ramps. So I guess if April is the crossover, is that the worst of the pressure on gross margin? So you're going to be kind of in the low-70s as soon as April. I'm just wondering if you can sort of shape that for us. Thanks.
Sure. Let me first start with your question, Tim. Thank you regarding our gross margins. We discussed our gross margins as we are ramping Blackwell in the very beginning and the many different configurations, the many different chips that we are bringing to market, we are going to focus on making sure we have the best experience for our customers as they stand that up. We will start growing into our gross margins, but we do believe those will be in the low 70s in that first part of the ramp. So you're correct, as you look at the quarters following after that, we will start increasing our gross margins and we hope to get to the mid-70s quite quickly as part of that ramp.
Hopper demand will continue through next year, surely the first several quarters of the next year. And meanwhile, we will ship more Blackwells next quarter than this, and we'll ship more Blackwells the quarter after that than our first quarter. And so that kind of puts it in perspective. We are really at the beginning of two fundamental shifts in computing that are really quite significant. The first is moving from coding that runs on CPUs to machine learning that creates neural networks that run on GPUs. That fundamental shift from coding to machine learning is widespread at this point. There are no companies who are not going to do machine learning. And so machine learning is also what enables generative AI. And so on the one hand, the first thing that's happening is a trillion dollars’ worth of computing systems and data centers around the world are now being modernized for machine learning. On the other hand, secondary, I guess, is that, on top of these systems, we're going to be creating a new type of capability called AI. And when we say generative AI, we're essentially saying that these data centers are really AI factories. They're generating something. Just like we generate electricity, we're now going to be generating AI. And if the number of customers is large, just as the number of consumers of electricity is large, these generators are going to be running 24/7. And today, many AI services are running 24/7, just like an AI factory. And so we're going to see this new type of system come online, and I call it an AI factory because that's really as close to what it is. It's unlike a data center of the past. And so these two fundamental trends are really just beginning. And so we expect this growth – this modernization and the creation of a new industry to go on for several years.
Thanks for taking my question. Colette, just to clarify, do you think it's a fair assumption to think NVIDIA could recover to kind of mid-70s gross margin in the back half of calendar 2025? Just wanted to clarify that. And then, Jensen, my main question historically, when we have seen hardware deployment cycles, they have inevitably included some digestion along the way. When do you think we get to that phase, or is it just too premature to discuss that because you're just the start of Blackwell? So how many quarters of shipments do you think are required to kind of satisfy this first wave? Can you continue to grow this into calendar 2026? Just how should we be prepared to see what we have seen historically, right, the periods of digestion along the way of a long-term kind of secular hardware deployment?
Okay. Vivek, thank you for the question. Let me clarify your question regarding gross margins. Could we reach the mid-70s in the second half of next year? And yes, I think it is a reasonable assumption or a goal for us to do, but we'll just have to see how that mix of ramp goes. But yes, it is definitely possible.
The way to think through that, Vivek, is I believe that there will be no digestion until we modernize a trillion dollars with the data centers. If you look at the world's data centers, the vast majority of them are built for a time when we wrote applications by hand and we ran them on CPUs. It's just not a sensible thing to do anymore. If you have – if every company's CapEx is ready to build a data center tomorrow, they ought to build it for a future of machine-learning and generative AI because they have plenty of old data centers. And so what's going to happen over the course of next X number of years, and let's assume that over the course of four years, the world's data centers could be modernized as we grow into IT, as you know, IT continues to grow about 20%, 30% a year, let's say. And let's say by 2030, the world's data centers for computing is a couple of trillion dollars. And we have to grow into that. We have to modernize the data center from coding to machine learning. That's number one. The second part of it is generative AI, and we're now producing a new type of capability that the world has never known, a new market segment that the world has never had. If you look at OpenAI, it didn't replace anything. It's something that's completely brand new. It's in a lot of ways as when the iPhone came, it was completely brand new. It wasn't really replacing anything. And so we're going to see more and more companies like that. And they're going to create and generate out of their services, essentially intelligence. Some of it would be digital artist intelligence like Runway. Some of it would be basic intelligence like OpenAI. Some of it would be legal intelligence like Harvey. Digital marketing intelligence like Reuters, so on and so forth. And the number of these companies, what are they called AI-native companies, is just in the hundreds. And almost every platform shift there was – there were Internet companies as you recall, there were cloud-first companies. They were mobile-first companies and now they're AI natives. And so these companies are being created because people see that there's a platform shift and there's a brand new opportunity to do something completely new. And so my sense is that we're going to continue to build out to modernize IT, modernize computing, number one, and then number two, create these AI factories that are going to be for a new industry for the production of artificial intelligence.
Hi, guys. Thanks for taking my questions. Colette, I had a clarification and a question for you. The clarification, just when you say low-70s gross margins, does 73.5 count as low-70s, or do you have something else in mind? And for my question, you're guiding total revenues, and total Data Center revenues in the next quarter must be up quote-unquote several billion dollars, but it sounds like Blackwell now should be up more than that. But you also said Hopper was still strong. So is Hopper down sequentially next quarter? And if it is, why? Is it because of the supply constraints? China has been pretty strong. Is China kind of rolling off a bit into Q4? So any color you can give us on sort of the Blackwell ramp and the Blackwell versus Hopper behavior into Q4 would be really helpful. Thank you.
So first starting on your first question there, Stacy, regarding our gross margin and defined low. Low, of course, is below the mid, and let's say we might be at 71%, maybe about 72%, 72.5% we're going to be in that range. We could be higher than that as well. We're just going to have to see how it comes through. We do want to make sure that we are ramping and continuing that improvement, the improvement in terms of our yields, the improvement in terms of the product as we go through the rest of the year. So we'll get up to the mid-70s by that point. The second statement was a question regarding our Hopper and what is our Hopper doing. We have seen substantial growth for our H200, not only in terms of orders but the quickness in terms of those that are standing that up. It is an amazing product, and it's the fastest-growing and ramping that we've seen. We will continue to be selling Hopper in this quarter, in Q4 for sure, that is across-the-board in terms of all of our different configurations, and our configurations include what we may do in China. But keep that in mind, folks are also at the same time looking to build out their Blackwell. So we've got a little bit of both happening in Q4. But yes, is it possible for Hopper to grow between Q3 and Q4? It's possible, but we'll just have to see.
Great. Thank you. I wonder if you could talk a little bit about what you're seeing in the inference market. You've talked about Strawberry and some of the ramifications of longer scaling inference projects. But you've also talked about the possibility that as some of these Hopper clusters age that you could use some of the Hopper latent chips for inference. So I guess, do you expect inference to outgrow training in the next kind of 12-month time frame, and just generally your thoughts there?
Our hopes and dreams is that someday, the world does a ton of inference. And that's when AI has really succeeded, right. It's when every single company is doing inference inside their companies for the marketing department and forecasting department and supply chain group and their legal department and engineering, of course, and coding, of course. And so we hope that every company is doing inference 24/7. And that there will be a whole bunch of AI-native startups, thousands of AI-native startups that are generating tokens and generating AI in every aspect of your computer experience from using Outlook to PowerPointing or when you're sitting there with Excel, you're constantly generating tokens. One of my favorite applications is NotebookLM, this Google application that came out. I use the living daylights out of it just because it's fun. And I put every PDF, every archive paper into it just to listen to it as well as scanning through it. And so I think that's the goal is to train these models so that people use it. And there's now a whole new era of AI if you will, a whole new genre of AI called physical AI, just those large language models understand the human language and how we the thinking process, if you will. Physical AI understands the physical world and it understands the meaning of the structure and understands what's sensible and what's not and what could happen and what won't and not only does it understand but it can predict and roll out a short future. That capability is incredibly valuable for industrial AI and robotics. And so that's fired up so many AI-native companies and robotics companies and physical AI companies that you're probably hearing about. And it's really the reason why we built Omniverse. Omniverse is so that we can enable these AIs to be created and learn in Omniverse and learn from synthetic data generation and reinforcement learning physics feedback instead of human feedback is now physics feedback. To have these capabilities, Omniverse was created so that we can enable physical AI. And so the goal is to generate tokens. The goal is to inference, and we're starting to see that growth happening. So I'm super excited about that. Now let me just say one more thing. Inference is super hard. And the reason why inference is super hard is because you need the accuracy to be high on the one hand. You need the throughput to be high so that the cost could be as low as possible, but you also need the latency to be low. And computers that are high throughput as well as low latency are incredibly hard to build. And these applications have long context lengths because they want to understand, they want to be able to inference within understanding the context of what's being asked to do. And so the context length is growing larger and larger. On the other hand, the models are getting larger, and they're multimodality. Just the number of dimensions that inference is innovating is incredible. And this innovation rate is what makes NVIDIA's architecture so great because our ecosystem is fantastic. Everybody knows that if they innovate on top of CUDA on top of NVIDIA's architecture, they can innovate more quickly and they know that everything should work. And if something were to happen, it's probably likely their code and not ours. And so that ability to innovate in every single direction at the same time, having a large installed base so that whatever you create could land on an NVIDIA computer and be deployed broadly all around the world in every single data center all the way out to the edge into robotic systems, that capability is really quite phenomenal.
Yes, thanks for taking the question. I wanted to ask you as we kind of focus on the Blackwell cycle and think about the data center business. When I look at the results this last quarter, Colette, you mentioned that obviously, the networking business was down about 15% sequentially, but then your comments were that you were seeing very strong demand. You mentioned also that you had multiple cloud CSP design wins for these large-scale clusters. So I'm curious if you could unpack what's going on in the networking business and where maybe you've seen some constraints and just your confidence in the pace of Spectrum-X progressing to that multiple billions of dollars that you previously had talked about. Thank you.
Let's first start with the networking. The growth year-over-year is tremendous and our focus since the beginning of our acquisition of Mellanox has really been about building together the work that we do in terms of the Data Center. The networking is such a critical part of that. Our ability to sell our networking with many of our systems that we are doing in the data center is continuing to grow and do quite well. So this quarter is just a slight dip down, and we're going to be right back up in terms of growing. They're getting ready for Blackwell and more and more systems that will be using not only our existing networking but also the networking that is going to be incorporated in a lot of these large systems that we are providing them to.
Thank you for taking my question. I have two quick ones for Colette. Colette, on the last earnings call, you mentioned that sovereign demand is in low double-digit billions. Can you provide an update on that? And then can you explain the supply-constrained situation in gaming? Is that because you're shifting your supply towards data center?
So first starting in terms of sovereign AI, such an important part of growth, something that is really surfaced with the onset of generative AI and building models in the individual countries around the world. And we see a lot of them, and we talked about a lot of them in the call today and the work that they are doing. So our sovereign AI and our pipeline going forward is still absolutely intact as those are working to build these foundational models in their own language, in their own culture, and working in terms of the enterprises within those countries. And I think you'll continue to see this be growth opportunities that you may see with our regional clouds that are being stored up and/or those that are focusing in terms of AI factories for many parts of the sovereign AI. This is areas where this is growing not only in terms of in Europe, but you're also seeing this in terms of growth in terms of in the Asia-Pac as well. Let me flip to your second question that you asked regarding gaming. So our gaming right now from a supply, we're busy trying to make sure that we can ramp all of our different products. And in this case, our gaming supply, given what we saw selling through was moving quite fast. Now the challenge that we have is how fast could we get that supply getting ready into the market for this quarter. Not to worry, I think we'll be back on track with more suppliers we turn the corner into the new calendar year. We're just going to be tight for this quarter.
Yes. Hi. Thanks a lot for the question. I wanted to ask Colette and Jensen with regard to sequential growth. So very strong sequential growth this quarter, and you're guiding to about 7%. Do your comments on Blackwell imply that we reaccelerate from there as you get more supply? Just in the first half, it would seem that there would be some catch-ups. So I was wondering how prescriptive you could be there. And then, Jensen, just overall, with the change in administration that's going to take place here in the US and the China situation, have you gotten any sense, or any conversations about tariffs, or anything with regard to your China business? Any sense of what may or may not go on? It's probably too early, but wondering if you had any thoughts there. Thanks so much.
We guide one quarter at a time.
We are working right now on the quarter that we're in and building what we need to ship in terms of Blackwell. We have every supplier on the planet working seamlessly with us to do that. And once we get to the next quarter, we'll help you understand in terms of that ramp that we'll see to the next quarter and after that.
Whatever the new administration decides, we will, of course, support the administration. And that's our highest mandate. And then after that, do the best we can. And just as we always do. And so we have to simultaneously and we will comply with any regulation that comes along fully and support our customers to the best of our abilities and compete in the marketplace. We'll do all of these three things simultaneously.
Hey, thanks for taking my question. Jensen, you mentioned in your comments you have the pre-trainings, the actual language models, and you have reinforcement learning that becomes more and more important in training and in inference as well. And then you have inference itself. And I was wondering if you have a sense like a high-level typical sense of out of an overall AI ecosystem like maybe one of your clients or one of the large models that are out there. Today, how much of the compute goes into each of these buckets? How much for the pre-training, how much for the reinforcement, and how much into inference today? Do you have any sense for how it's splitting and where the growth is the most important as well?
Well, today it's vastly in pre-training a foundation model because, as you know, post-training, the new technologies are just coming online, and whatever you could do in pre-training and post-training, you would try to do so that the inference cost could be as low as possible for everyone. However, there are only so many things that you could do priority. And so you'll always have to do on-the-spot thinking and in-context thinking and a reflection. And so I think that the fact that all three are scaling is actually very sensible based on what we are. And in the area of foundation model, now we have multimodality foundation models and the amount of petabytes of video that these foundation models are going to be trained on is incredible. And so my expectation is that for the foreseeable future, we're going to be scaling pre-training, post-training as well as inference time scaling, which is the reason why I think we're going to need more and more compute, and we're going to have to drive as hard as we can to keep increasing the performance by X factors at a time so that we can continue to drive down the cost and continue to increase their revenues and get the AI revolution going. Thank you.
Operator
Thank you. I'll now turn the call back over to Jensen Huang for closing remarks.
Thank you. The tremendous growth in our business is being fueled by two fundamental trends that are driving global adoption of NVIDIA computing. First, the computing stack is undergoing a reinvention, a platform shift from coding to machine learning. From executing code on CPUs to processing neural networks on GPUs. The trillion-dollar installed base of traditional Data center infrastructure is being rebuilt for Software 2.0, which applies machine learning to produce AI. Second, the age of AI is in full steam. Generative AI is not just a new software capability, but a new industry with AI factories manufacturing digital intelligence, a new industrial revolution that can create a multi-trillion dollar AI industry. Demand for Hopper and anticipation for Blackwell, which is now in full production, are incredible for several reasons. There are more foundation model makers now than there were a year ago. The computing scale of pre-training and post-training continues to grow exponentially. There are more AI-native start-ups than ever and the number of successful inference services is rising. And with the introduction of ChatGPT o1, OpenAI o1, a new scaling law called test time scaling has emerged. All of these consume a great deal of computing. AI is transforming every industry, company, and country. Enterprises are adopting agentic AI to revolutionize workflows. Over time, AI coworkers will assist employees in performing their jobs faster and better. Investments in industrial robotics are surging due to breakthroughs in physical AI. Driving new training infrastructure demand as researchers train world foundation models on petabytes of video and Omniverse synthetically generated data. The age of robotics is coming. Countries across the world recognize the fundamental AI trends we are seeing and have awakened to the importance of developing their national AI infrastructure. The age of AI is upon us and it's large and diverse. NVIDIA's expertise, scale, and ability to deliver full stack and full infrastructure let us serve the entire multi-trillion dollar AI and robotics opportunities ahead. From every hyperscale cloud, enterprise private cloud to sovereign regional AI clouds, on-prem to industrial edge and robotics. Thanks for joining us today and catch up next time.
Operator
This concludes today's conference call. You may now disconnect.