-
Archives
- March 2026
- February 2026
- January 2026
- December 2025
- August 2025
- July 2025
- April 2025
- March 2025
- February 2025
- January 2025
- December 2024
- October 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- August 2022
- June 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- August 2021
- July 2021
- June 2021
- April 2021
- December 2019
-
Meta
Category Archives: Uncategorized
Protected: dl
There is no excerpt because this is a protected post.
Posted in Uncategorized
Enter your password to view comments.
Protected: DL
There is no excerpt because this is a protected post.
Posted in Uncategorized
Enter your password to view comments.
Transformers are challenging the CNN
For decades, Convolutional Neural Networks (CNNs) have been the undisputed kings of computer vision. If a machine was “seeing,” it was likely using a CNN. But the landscape is shifting. Vision Transformers (ViTs) are moving from the world of Natural … Continue reading
Posted in Uncategorized
Leave a comment
Architectural Parallels and Divergences in Neural Memory
Modern generative AI is hitting a familiar wall: every time we try to make models “smarter” by stuffing in more knowledge, we also make them more expensive to run. In classic dense Transformers, memory and compute are tightly coupled, more … Continue reading
Posted in Uncategorized
Leave a comment
Language Model Introduce Operating System Mechanism To Sustain
The rapid shift of language models from research artifacts to production-critical systems has forced a deep re-evaluation of how inference workloads are architected. Early optimization efforts focused almost exclusively on training, where performance is dominated by dense matrix multiplications and … Continue reading
Posted in Uncategorized
Leave a comment
LLM – we need to decouple facts from logic
Modern AI models are inherently inefficient because they work on the same task with the same level of intensity: they engage the same amount of compute for every question instead pulling static facts in. DeepSeek Engram (https://arxiv.org/abs/2601.07372) helps in solving … Continue reading
Posted in Uncategorized
Leave a comment
Manifold AI Model-Architecture
A language model has to understand many things at the same time when it reads a sentence: word meanings, grammar, context, and world knowledge. Consider the sentence “The bank is by the river”. The model must simultaneously consider: The model … Continue reading
Posted in Uncategorized
Leave a comment
The hidden AI bottleneck
If you’ve ever splurged on a processor or raved about a supercomputer, you’ve probably been talking about “Gigahertz,” “MIPS” or “TeraFLOPS.” We often imagine these numbers in terms of horsepower in a car, a given reading showing us how fast … Continue reading
Posted in Uncategorized
Leave a comment
First Think Than Talk
NVIDIA TiDAR, which stands for “Think in Diffusion, Talk in Autoregression”, is a hybrid architecture designed to make Large Language Models (LLMs) significantly faster without losing quality. Traditional models like GPT work like a person writing a letter one word … Continue reading
Posted in Uncategorized
Leave a comment
Rethinking AI Infrastructure
A fundamental transformation is reshaping the hardware landscape, driven not by the familiar cadence of Moore’s Law, but by the physical realities of data movement. For the past decade, the dominant narrative in AI acceleration has been the relentless pursuit … Continue reading
Posted in Uncategorized
Leave a comment