3 "LLMs" Posts

How Large Language Models (LLMs) Learn: Calculus and the Search for Understanding

Oct 8, 2025 • Categories: AI & The Mathematics of Language, • Tags: #AI, #calculus, #gradient descent, #LLMs, #optimization

When you interact with a large language model (LLM) such as ChatGPT or Claude , the model seems to respond instantly relative to the question’s degree of difficulty. What’s easy to forget is that every word it predicts comes from a long history of learning where billions of gradient steps have slowly sculpted its understanding of language.

Large language models don’t memorize text. They optimize it. Behind that optimization lies calculus. I’m not referring to the calculus you did with pencil and paper. I’m talking about a sprawling, automated version that computes millions of derivatives per second.

At its heart, every LLM is a feedback system. It starts with random guesses, measures how wrong it was, and then adjusts itself to be slightly less wrong. The word “slightly” in this context is the essence of calculus.

“Each gradient step represents a measurable reduction in error, guiding the model toward a more stable understanding of language.”

How Large Language Models (LLMs) Think: Turning Meaning into Math

Oct 7, 2025 • Categories: AI & The Mathematics of Language, • Tags: #AI, #embeddings, #geometry, #linear algebra, #LLMs

When you enter a sentence into a Large Language Model (LLM) such as ChatGPT or Claude , the model does not process words as language. It represents them as numbers.

Each word, phrase, and code token becomes a vector — a list of real-valued coordinates within a high-dimensional space. Relationships between meanings are captured not by grammar or logic but by geometry. The closer two vectors lie, the more similar their semantic roles appear to the model.

This is the mathematical foundation of large language models: linear algebra. Matrix multiplication, vector projection, cosine similarity, and normalization define how the model navigates this vast space of meaning. What feels like understanding is actually the alignment of high-dimensional vectors governed by probability and geometry.

“Linear algebra and geometry do more than support AI; they create its language of meaning.”

How Large Language Models (LLMs) Read Code: Seeing Patterns Instead of Logic

Oct 6, 2025 • Categories: AI & The Mathematics of Language, • Tags: #AI, #LLMs, #probability, #tokenization

Developers are accustomed to thinking about code in terms of syntax and semantics, the how and the why. Syntax defines what is legal; semantics defines what it means. A compiler enforces syntax with ruthless precision and interprets semantics through symbol tables and execution logic. But a Large Language Model (LLM), reads code the way a seasoned engineer reads poetry, recognizing rhythm, pattern, and context more than explicit rules.

“When an AI system ‘understands’ code, it is not executing logic; it is modeling probability.”