Blog Articles

A random collection of personal experiments and thoughts around engineering, design, AI, professional topics, and the next tech hotness.

From Hype to Reality: Testing OpenAI’s o1 in a Competitive Setting

October 3, 2024

Exploring the challenges and limitations of OpenAI’s o1 model during the HackerCup competition, where it struggled with speed and accuracy.

October 3, 2024

September 18, 2024

How do we evaluate language models ability to reason? Let's explore existing limits and what benchmarks really tell us about there abilities.

September 18, 2024

February 4, 2020

Strategies for a succeeding conference submission process

February 4, 2020