5 "Probability" Posts

Temperature and Top-P: The Creativity Knobs

Dec 24, 2025 • Category: AI and the Mathematics of Language, • Tags: #AI, #Anthropic, #LLM, #nucleus sampling, #OpenAI, #probability, #sampling, #softmax, #temperature, #top-p

Every API call to ChatGPT , Claude , or any other LLM includes two parameters most people either ignore or tweak randomly: temperature and top-p. The defaults work fine for casual use, so why bother understanding them? Because these two numbers fundamentally control how your model thinks.

The temperature value determines whether the model plays it safe or takes creative risks while the top-p value decides how many options the model even considers. Together, these values shape the personality of every response you receive.

I’ve watched developers cargo-cult settings from others without understanding what they do. “Set temperature to 0.7 for creative writing” becomes tribal knowledge, passed down without explanation. Let’s fix that by opening the hood and examining the mathematics that makes these knobs work.

“Temperature doesn’t make the model smarter or dumber. It changes how much the model trusts its own first instinct.”

The Birthday Paradox in Production: When Random IDs Collide

Nov 28, 2025 • Category: Applied Modeling and Simulation, • Tags: #cryptography, #databases, #distributed-systems, #probability, #Python

You generate a UUID. It’s 128 bits total, with 122 bits of randomness. That’s 340 undecillion possible values. Collision-proof, right? Your system generates a million IDs per second. Still safe? What about a billion?

As I like to say, common sense and intuition are the enemies of science. Common sense tells you that with 340,000,000,000,000,000,000,000,000,000,000,000,000 possible values, you’d need to generate at least trillions before worrying about duplicates. Maybe fill 1% of the space? 10%?

Math shows us the uncomfortable truth: You’ll hit a 50% collision probability after generating just \(2.7 \times 10^{18}\) IDs. That’s 0.0000000000000000008% of your total space. At a billion IDs per second, you’ve got 86 years. Comfortable, but not infinite. Drop to 64-bit IDs? Now you’ve got 1.4 hours. Just enough time to duck out for long lunch and return to a disaster. And 32-bit? 77 microseconds. Faster than you can blink.

You might know that the birthday paradox proves that just 23 people have more than a 50% probability of sharing a birthday. What you may not know is that this isn’t just a party trick; it’s the same mathematics that determines when your “guaranteed unique” database IDs collide, why hash tables need careful sizing, and when your distributed system’s assumptions break.

“In a room of 23 people, there’s a greater than 50% chance two share a birthday. In your database, collisions arrive far sooner than intuition suggests.”

How Large Language Models (LLMs) Read Code: Seeing Patterns Instead of Logic

Oct 6, 2025 • Category: AI and the Mathematics of Language, • Tags: #AI, #LLM, #probability, #tokenization

Developers are accustomed to thinking about code in terms of syntax and semantics, the how and the why. Syntax defines what is legal; semantics defines what it means. A compiler enforces syntax with ruthless precision and interprets semantics through symbol tables and execution logic. But a Large Language Model (LLM), reads code the way a seasoned engineer reads poetry, recognizing rhythm, pattern, and context more than explicit rules.

“When an AI system ‘understands’ code, it is not executing logic; it is modeling probability.”

The Five-Second Rule Explored with Math & Python

Sep 4, 2025 • Category: Applied Modeling and Simulation, • Tags: #physics, #Python, #probability

You know the story: drop a cookie on the kitchen floor, swoop in before five seconds are up, and declare it safe. It is comforting. It is also wrong.

“Germs don’t wait five seconds. They start the party the instant your food hits the floor.”

The truth is much more interesting than the myth. Germs do transfer gradually, but they are especially fast at the beginning. That means if you want to know whether your floor-cookie is still edible, you need to think in curves, not in timers. And curves are something we can model.

Should You Walk or Run in the Rain? The Puzzle That Sparked a Passion

Aug 18, 2025 • Category: Applied Modeling and Simulation, • Tags: #physics, #probability, #Python

To walk or to run. That is the question. Early in my programming career, I came across a coding challenge that stuck with me for many years: “If it’s raining, will you stay drier by walking or running through it?” At the time, I didn’t have the skillset or tools to simulate the problem properly. It became one of the first exercises that nudged me toward a lifelong fascination with modeling the real world through code.