Blog Articles

A random collection of personal experiments and thoughts around engineering, design, AI, professional topics, and the next tech hotness.

One small leap for machine kind

How do we evaluate language models ability to reason? Let's explore existing limits and what benchmarks really tell us about there abilities.