About 39,200 results
Open links in new tab
  1. Grokking (machine learning) - Wikipedia

    In ML research, "grokking" is not used as a synonym for "generalization"; rather, it names a sometimes-observed delayed‑generalization training phenomenon in which training and …

  2. Grokking Explained: A Statistical Phenomenon - arXiv.org

    Feb 3, 2025 · Grokking, or delayed generalization, is an intriguing learning phenomenon where test set loss decreases sharply only after a model’s training set loss has converged. This …

  3. GROKKING Definition & Meaning - Merriam-Webster

    Grok may be the only English word that derives from Martian. Yes, we do mean the language of the planet Mars. No, we're not getting spacey; we've just ventured into the realm of science …

  4. Do Machine Learning Models Memorize or Generalize?

    When Does Grokking Happen? It’s important to note that grokking is a contingent phenomenon — it goes away if model size, weight decay, data size and other hyper parameters aren’t just …

  5. What is Grokking? From Rote to Revelation, overfitting represents …

    May 15, 2025 · Grokking forces us to reconsider established practices in training neural networks. It challenges the validity of early stopping criteria and suggests that a model appearing to …

  6. Grokking in Neural Networks: A Review | SN Computer Science

    Jul 11, 2025 · One such phenomenon is grokking. According to the Oxford English Dictionary, “to grok something” means “to understand something completely using your feelings rather than …

  7. Grokking - GitHub Pages

    Grokking, or delayed generalization, is a phenomenon where generalization in a deep neural network (DNN) occurs long after achieving near zero training error. Previous studies have …

  8. Carlisia Campos - Grokking

    Nov 26, 2025 · Grokking implies experiential, embodied learning, something beyond surface-level exposure. It hints of an orientation towards fluid intuition, rather than rigid knowing or …

  9. Grokking: A Deep Dive into Delayed Generalization in Neural

    Jun 14, 2024 · One of the most intriguing is the phenomenon of grokking, where neural networks exhibit surprisingly delayed generalization, achieving high performance on unseen data long …

  10. An Analysis of Grokking - Eric J. Michaud

    Last year, some researchers at OpenAI released a short paper called Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets. In this paper, they documented a curious …