• elucubra@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    27
    arrow-down
    1
    ·
    edit-2
    16 hours ago

    Things are probably going to get worse in the short term, but the AI bubble is going to burst. Magnitude? We’ll see, but when investors realize that these companies cannot make a profit, and open source frontier models that allow you to run AI in house are removing vendor lock in, things are going to change. Also, LLMs are a dead end, and have little room to improve.

    Newer paradigms are appearing, such as Yann LeCun’s JASP, which actually learns, and other approaches, which will make LLMs obsolete, and are way less hardware intensive.

    Another factor is the Chinese closing in in consumer grade RAM. If it can be proven that no backdoor or other shenanigans are there, they will balance things somewhat.

    While current reality is what it is, there may be a massive social and traditional media manipulation by the big three and other interested parties to fuel fear of rising prices forever, to push people to buy as much as they can at these prices. I have no proof of this, but I don’t think it’s far fetched.

    And let’s not forget that for media outlets, fear and tragedy sells. (I think Hearst or some other news mogul said that last century.)

    • Prathas@lemmy.zip
      link
      fedilink
      English
      arrow-up
      9
      arrow-down
      1
      ·
      15 hours ago

      Yann LeCun’s JASP

      Wait, this is my first time reading about this. Got an ELI5 or TL;DR?

      • SabinStargem@lemmy.today
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        4
        ·
        12 hours ago

        Courtesy of Kagi’s search AI:

        000000000

        It appears there is a slight misunderstanding of the acronym: Yann LeCun’s architecture is called JEPA (Joint Embedding Predictive Architecture), not JASP.

        JEPA differs from Large Language Models (LLMs) primarily in how they learn, what they predict, and how they represent the world. While LLMs are “word models” designed to generate sequences of tokens, JEPA is intended to create “world models” that understand the underlying physics and logic of reality 3 4 .

        The key differences are summarized below:

        1. Generative vs. Predictive (Non-Generative)

          LLMs are Generative: They operate by predicting the next token in a sequence (generative AI) 2 . This approach often leads to hallucinations because the model focuses on statistical probability rather than factual ground truth 6 . JEPA is Predictive: Instead of generating every single pixel or word, JEPA predicts latent representations (embeddings) in a hidden space 5 . It tries to learn what is “plausible” rather than attempting to reconstruct every single detail of the input.

        2. Word Models vs. World Models

          LLMs are “Word Models”: They learn from text and treat intelligence as a language manipulation task 4 . LeCun argues that language captures only a small subset of human thinking and cannot represent high-dimensional physical spaces 2 7 . JEPA aims for “World Models”: It is designed to understand cause and effect, physics, and the physical environment 1 . This allows the system to reason from first principles and plan sequences of actions, which is a prerequisite for autonomous AI 1 .

        3. System 1 vs. System 2 Thinking

          LLMs (System 1): LeCun describes LLMs as “System 1” processes—they are reactive and perform a fixed amount of computation to produce each token 2 . JEPA (Path to System 2): By incorporating world models, JEPA is intended to enable “System 2” thinking—the ability to plan, reason, and deliberate before acting 1 .

        Summary Comparison Table Feature Large Language Models (LLMs) JEPA / World Models Core Goal Predict next token (Text/Code) Predict latent state (Reality/Physics) Method Generative (Pixel/Token by pixel/token) Joint Embedding (Non-generative) Domain Linguistic/Statistical patterns Physical/Causal understanding Weakness Hallucinations, lacks physical grounding Limited fluency in natural language Cognition Reactive (System 1) Planning/Reasoning (System 2)