Sitemap - 2025 - Collinear AI’s Blog

Collinear Newsletter #9 – Notes on Frontier AI

RL Infrastructure for AI Agents: Why Environment-as-a-Service is the Missing Piece

Announcing Spider: a lightweight tool to craft post-training data recipes

Collinear Newsletter #8 – Notes on Improving AI

The case for simulations

Through the Valley of Reasoning: What Small Models Teach Us About Learning

Introducing Collinear Simulations: Steerable Personas for AI Agent Testing

Collinear Newsletter #7 | Notes on Improving AI

Introducing Curator Evals: A Benchmark for High-quality Post-training Data Curation

Collinear AI Now Available on Google Cloud Marketplace

You Can’t Hire Your Way to Model Alignment

Leveling the Playing Field: Livecodebench’s Big Bug Fix

OpenAI's gpt-oss on LiveCodeBench: A Competitive Programming Deep Dive

Newsletter #6 | Notes on Improving AI Performance

Cats confuse LRMs: Exposing blind spots in SOTA Models

Data Curation: The secret sauce for enterprise AI excellence

Newsletter #5 | Notes on Improving AI Performance

Gaming the System: Goodhart’s Law Exemplified in AI Leaderboard Controversy

Judges as Data curators cut Post-training Time to Half

Newsletter #4 | Notes on Improving AI Performance

From worlds best pros to AI personas: The MasterClass journey

Issue #3 | Notes on Improving AI Performance

Issue #2 | Notes on Improving AI Performance

The Limitations of AI Evaluations

The AI Safety Gap

Issue #1 | Notes on Improving AI Performance

Taming AI Agents: Why Your Butler Needs a Babysitter