Panic over DeepSeek Exposes AI's Weak Foundation On Hype
The drama around DeepSeek develops on an incorrect property: Large language models are the Holy Grail. This ... [+] misdirected belief has actually driven much of the AI investment frenzy.
The story about DeepSeek has actually disrupted the dominating AI narrative, affected the markets and stimulated a media storm: A big language model from China competes with the leading LLMs from the U.S. - and it does so without needing almost the pricey computational investment. Maybe the U.S. does not have the technological lead we thought. Maybe stacks of GPUs aren't needed for AI's unique sauce.
But the heightened drama of this story rests on an incorrect facility: LLMs are the Holy Grail. Here's why the stakes aren't almost as high as they're constructed out to be and the AI financial investment craze has actually been misguided.
Amazement At Large Language Models
Don't get me wrong - LLMs represent unmatched development. I've been in artificial intelligence because 1992 - the very first six of those years working in natural language processing research study - and I never believed I 'd see anything like LLMs during my life time. I am and will constantly remain slackjawed and gobsmacked.
LLMs' astonishing fluency with human language validates the ambitious hope that has fueled much device finding out research: Given enough examples from which to discover, computer systems can establish abilities so advanced, they defy human understanding.
Just as the brain's performance is beyond its own grasp, so are LLMs. We understand how to program computer systems to perform an exhaustive, automatic learning process, however we can hardly unpack the outcome, the thing that's been discovered (constructed) by the procedure: an enormous neural network. It can only be observed, not dissected.
The drama around DeepSeek develops on an incorrect property: Large language models are the Holy Grail. This ... [+] misdirected belief has actually driven much of the AI investment frenzy.
The story about DeepSeek has actually disrupted the dominating AI narrative, affected the markets and stimulated a media storm: A big language model from China competes with the leading LLMs from the U.S. - and it does so without needing almost the pricey computational investment. Maybe the U.S. does not have the technological lead we thought. Maybe stacks of GPUs aren't needed for AI's unique sauce.
But the heightened drama of this story rests on an incorrect facility: LLMs are the Holy Grail. Here's why the stakes aren't almost as high as they're constructed out to be and the AI financial investment craze has actually been misguided.
Amazement At Large Language Models
Don't get me wrong - LLMs represent unmatched development. I've been in artificial intelligence because 1992 - the very first six of those years working in natural language processing research study - and I never believed I 'd see anything like LLMs during my life time. I am and will constantly remain slackjawed and gobsmacked.
LLMs' astonishing fluency with human language validates the ambitious hope that has fueled much device finding out research: Given enough examples from which to discover, computer systems can establish abilities so advanced, they defy human understanding.
Just as the brain's performance is beyond its own grasp, so are LLMs. We understand how to program computer systems to perform an exhaustive, automatic learning process, however we can hardly unpack the outcome, the thing that's been discovered (constructed) by the procedure: an enormous neural network. It can only be observed, not dissected.