Collinear AI’s Blog
Subscribe
Sign in
Home
Archive
About
Latest
Top
Discussions
Is your RL environment fair to your agent?
or ensuring that you hillclimbing budget is spent right :)
May 14
•
Adit Jain
7
Whose Taste?
More data won't fix the AI verification problem. Different taste might.
May 7
•
Sachin
13
April 2026
Collinear Newsletter #11 - Notes on Frontier AI
Hi AI innovators,
Apr 30
•
Soumyadeep Bakshi
11
AI's U-235 Problem
Nuclear physics solved for k_eff. What's the AGI equivalent?
Apr 23
•
Jed Gresham
10
SimLab: The self-serve staging playground for real-world agents
Agents fail on real tool calls, long workflows, and messy data. SimLab lets you find those failures in simulation, not in production.
Apr 2
•
Sachin
11
March 2026
Collinear Newsletter #10 - Notes on Frontier AI
Happy March from the Collinear team!
Mar 24
•
Soumyadeep Bakshi
and
Nazneen Rajani
10
We gave Claude, Gemini and GPT, $250k, and it didn't go as you’d expect...
Introducing YC Bench: The first open-source, long-horizon benchmark with a simulation clock
Mar 5
•
Muyu
and
Anand Kumar
7
1
December 2025
Collinear Newsletter #9 – Notes on Frontier AI
Hi AI innovators,
Dec 19, 2025
•
Soumyadeep Bakshi
and
Nazneen Rajani
5
November 2025
RL Infrastructure for AI Agents: Why Environment-as-a-Service is the Missing Piece
Reinforcement learning for large language models is more of a systems problem than ML.
Nov 18, 2025
•
Nazneen Rajani
3
1
Announcing Spider: a lightweight tool to craft post-training data recipes
TL;DR Spider is a single client interface that turns messy distillation and ablation experiments into a simple, configurable workflow.
Nov 6, 2025
•
Soumyadeep Bakshi
6
Collinear Newsletter #8 – Notes on Improving AI
Hi AI innovators,
Nov 4, 2025
•
Soumyadeep Bakshi
8
October 2025
The case for simulations
Unlocking model uplift through better evaluations
Oct 23, 2025
•
Soumyadeep Bakshi
and
Grant Griffith
6
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts