Nvidia CEO Jensen Huang talks blowout quarter, AI, inferencing, ongoing demand, and more

I'm Julie Heyman host of Yahoo finances Market domination here with our Tech Editor Dan Howley Nvidia has done it Again the chip giant blowing past Analyst expectations in its strong Fiscal first quarter data center Revenue Alone soaring by 4 127% year-over-year And the company also gave another Bullish sales forecast which shows that AI spending momentum continues a pace on Top of all that the company also Announced a 10 for one forward stock Split and ra its dividend joining us now Nvidia founder and CEO Jensen Wong fresh Off the conference call Jensen welcome Thank you so much for being with Us I'm happy to be here nice to see you Guys you too I want to start uh with Blackwell which is your next Generation Chip it's shipping this year you said on The call you also said on the call we Will see a lot of Blackwell Revenue this Year so if we're looking at about $28 Billion in Revenue in the current Quarter and Blackwell is a more Expensive product than Hopper the chip Series out now what does that imply About Revenue in the fourth quarter and For the full Year well it should be significant yeah Blackwell Blackwell and and as you know We guide one quarter at a time and but What I what I could tell you about about Blackwell is this this is this is um a

Giant leap in in um uh in Ai and it was Designed for trillion parameter AI models and this is as you know we're Already at two trillion parameters uh Models sizes are growing about doubling Every six months and the amount of Processing uh between the size of the Model the amount of data is growing four Times and so the ability for uh these Data centers to keep up with these large Models really depends on the technology That we bring bring to them and so the Blackwell is is designed uh also for Incredibly fast inferencing and Inference used to be about recognition Of things but now inferencing as you Know is about generation of information Generative Ai and so whenever you're Talking to chat GPT and it's generating Information for you or drawing a picture For you or recognizing something and Then drawing something for you that Generation is a brand new uh inferencing Technology is really really complicated And requires a lot of performance and so Blackwell is designed for large models For generative a I and we designed it to Fit into any data center and so it's air Cooled liquid cooled x86 or this new Revolutionary processor we designed Called Grace Grace blackwall super chip And then um uh you know supports uh Infinite band data centers like we used To but we also now support a brand new

Type of data center ethernet we're going To bring AI to ethernet data centers so The number of ways that you could deploy Blackwell is way way higher than than Hopper generation so I'm excited about That I I I want to talk about the the Inferencing Jensen you know some Analysts have brought up the idea that As we move over towards inferencing from The the training that there may be some Inhouse companies uh uh processors from Companies that those made from Microsoft Google Amazon maybe more suited for the Actual inferencing I guess how does that Impact Nvidia Then well inferencing used to be easy You know when people started talking About inference uh generative AI didn't Exist and now generative AI is is uh uh Of course is about prediction but it's About prediction of the next token or Prediction of the next pixel or Prediction of the next frame and all of That is complicated and and generative AI is also used for um understanding the Cont in order to generate the content Properly you have to understand the Context and what what is called memory And so now the memory size is incredibly Large and you have to have uh context Memory you have to be able to generate The next token really really fast it Takes a whole lot of tokens to make an Image takes a ton of tokens to make a

Video and takes a lot of tokens to be Able to uh reason about a particular Task so that it can make a plan and so Gener the the the gener generative AI um Era really made inference a million Times more complicated and as you know The number of chips that were intended For inference uh kind of kind of fell by The wayside and now people are talking Talk about building new Chips you know The versatility of invidious Architecture makes it possible for People to continue to innovate and Create these amazing new Ai and then now Black Wall's coming so in other words You think you still have a competitive Advantage even as the market sort of Shifts to Inferencing we have a great position in Inference because inference is just a Really complicated problem you know and The software stack is complicated the Type of models that people use is Complicated there's so many different Types it's just going to be a giant Market market opportunity for us the Vast majority of the world's inferencing Today as as people are experiencing in Their data centers and on the web vast Majority of the inferencing today is Done on Nvidia and so we we I expect That to continue um you said on the call A couple of times that you'll be Supply Constrained for both Hopper and then

Blackwell uh chips well until next year Because of the vast demand that's out There um what can you do about that are There any sort of levers you can pull to Help increase Supply copper demand grew throughout This Quarter after we announced Blackwell and so that kind of tells you How much demand there is out there People want to deploy these data centers Right now they want to put our gpus to Work right now and start making money And start saving money and so so that That demand is just so strong um you Know it's really important to take a Step back and realize that that what we Build is not a GPU chip we call it Blackwell and we call it GPU but we're Really building AI factories these AI Factories have CPUs and gpus and really Complicated memory the systems are Really complicated it's connected by MV Link there's an MV link switch there's Infiniband switches infiniband Nicks and Then now we have ethernet switches and Ethernet Nicks and all of this connected Together with this incredibly Complicated spine called mvy link and Then the amount of software that it Takes to build all this and run all this Is incredible and so these AI factories Are essentially what we build we build It as a as a holistic unit as a holistic

Architecture and platform but then we Disaggregate it so that our partners Could take it and put it into Data Centers of any kind and every single Cloud has slightly different Architectures and different stacks and Our our stacks and our architecture can Now deeply integrated into theirs but Everybody's a little different so we Build it as an AI Factory we then Disaggregated so that everybody can have Ai factories this is just an incredible Thing and we do this at very hard very High volume it's just very very hard to Do and so every every component every Every part of our data center uh is the Most complex computer the world's ever Made and so it's sensible that almost Everything is Constrained Jess I want to ask about the Uh Cloud providers versus the the other Industries that you said are are getting Into the the gener AI game or or getting Nvidia chips you had mentioned that uh In uh comments in the actual release That we heard from uh CFO CL Crest uh That 40% mid 40% of data center Revenue Comes from those Cloud providers as we Start to see these other Industries open Up what does what does that mean for NVIDIA well will the cloud providers Kind Of uh shrink I guess their share and Then will these other Industries pick up

Where those Cloud providers Were I expect I expect them both to grow Uh a couple of different areas of course Uh the consumer internet service Providers this last quarter of course a Big stories from meta the uh the Incredible scale that that um Mark is Investing in uh llama 2 was a Breakthrough llama 3 was even more Amazing they're creating models that That are that are activating uh large Language model and generative AI work All over the world and so so the work That meta is doing is really really Important uh you also saw uh uh Elon Talking about uh the incredible Infrastructure that he's building and And um one of the things that's that's Really revolutionary about about the the Version 12 of of uh Tesla's uh full Self-driving is that it's an endtoend Generative model and it learns from Watching video surround video and it it Learns about how to drive uh end to end And Jed using generative AI uh uh Predict the next the path and and the How distur the uh how to understand and How to steer the car and so the the Technology is really revolutionary and The work that they're doing is Incredible so I gave you two examples a A startup company that we work with Called recursion has built a Supercomputer for generating molecules

Understanding proteins and generating Molecule molecules for drug Discovery uh The list goes on I mean we can go on all Afternoon and and just so many different Areas of people who are who are now Recognizing that we now have a software And AI model that can understand and be Learn learn almost any language the Language of English of course but the Language of images and video and Chemicals and protein and even physics And to be able to generate almost Anything and so it's basically like Machine translation and uh that Capability is now being deployed at Scale in so many different Industries Jensen just one more quick last question I'm glad you talked about um the auto Business and and what you're seeing There you mentioned that Automotive is Now the largest vertical Enterprise Vertical Within data center you talked About the Tesla business but what is That all about is it is it self-driving Among other automakers too are there Other functions that automakers are Using um within data center help us Understand that a little bit better well Tesla is far ahead in self-driving cars Um but every single car someday will Have to have autonomous capability it's It's safer it's more convenient it's More more fun to drive and in order to Do that uh

It is now very well known very well Understood that learning from video Directly is the most effective way to Train these models we used to train Based on images that are labeled we Would say this is a this is a car you Know this is a car this is a sign this Is a road and we would label that Manually it's incredible and now we just Put video right into the car and let the Car figure it out by itself and and this Technology is very similar to the Technology of large language models but It requires just an enormous training Facility and the reason for that is Because there's videos the data rate of Video the amount of data of video is so So high well the the same approach That's used for learning physics the Physical world um from videos that is Used for self-driving cars is Essentially the same um AI technology Used for grounding large language models To understand the world of physics uh so Technologies that are uh like Sora which Is just incredible um uh and other Technologies vo from from uh uh Google Incredible the ability to generate video That makes sense that are conditioned by Human prompt that needs to learn from Video and so the next generation of AIS Need to be grounded in physical AI needs To be needs to understand the physical World and the the best way to teach

These AIS how the physical world behaves Is through video just watching tons and Tons and tons of videos and so the the Combination of this multimodality Training capability is going to really Require a lot of computing demand in the Years to come Jensen as always super Cool stuff and great to be able to talk To you Dan and I really appreciate it Jensen Wong everybody founder and CEO of Nvidia great to see you guys thank you

Editorial Staff

The Daily Grind Magazine brings business owners, CEO’s & Founders, and Entrepreneurs the latest business news from around the world. Our Editorial Staff comprises of over a dozen different journalists & editors.

All Posts

Our Latest Issue