Newsletter #4 | Notes on Improving AI Performance

A deep dive into how leading companies are deploying AI agents reliably in production

jahnavi jambholkar, Nazneen Rajani, and Soumyadeep Bakshi

May 07, 2025

Hi AI innovators,

Welcome to Collinear’s latest update on AI safety and improvement. We're thrilled to share some exciting customer and product updates that will help you deploy AI with confidence and control.

🚀 Collaboration Spotlight: A Leading National AI Research Lab

Collinear AI recently partnered with a leading national research AI lab to revolutionize multilingual model safety for Arabic AI applications. As the first foundation models optimized for Arabic, they outperformed global alternatives—unlocking new possibilities across government, education, and business while ensuring safety remained paramount. Through our customized safety framework, we identified over 10,000 potential failure modes before deployment, securing the lab's competitive advantage in Arabic AI.

The results speak for themselves:

The lab's models achieved less than 10% failure rate in adversarial testing, while comparable open-source Llama models showed a concerning 72.6% failure rate
We co-created the first-of-its-kind Arabic safety vulnerability mapping, providing a proprietary security advantage for Arabic AI development

Our partnership has positioned this frontier lab as a leader in responsible multilingual AI innovation. Work is already underway to extend this framework to more powerful and agentic model variants.

⚙️ Product Spotlight: Conversation Builder

In today's competitive AI landscape, a pattern has emerged: Poor conversation quality is silently killing your AI adoption rates and user trust.

When these flawed conversations become your training foundation, even SOTA models will perpetuate the same frustrating user experience in production. That's why we're excited to introduce Collinear's Conversation Builder - a powerful tool designed to transform how you generate and evaluate conversational training data.

The Conversation Builder allows you to:

Engage directly with your LLM in natural dialogue
Provide immediate quality ratings and edits that serve as ground truth
Extend conversations with follow-up questions that mimic real user behavior
Export in flexible formats - unrolled for individual assistant messages or standard format for complete conversations

When every customer interaction matters, the quality of your conversational training data isn't just a technical detail - it's your competitive edge.

⭐ Quanta Magazine Feature

Our CEO Nazneen Rajani was featured in Quanta Magazine's Oral History on how ChatGPT transformed NLP alongside Sam Bowman, Julian Michael, Yejin Choi, Christiane Fellbaum to name a few from Anthropic, Scale AI, Stanford, Princeton and more!

From discussing the paper that changed everything, "Attention Is All You Need" as a PhD student to helping create open-source alternatives to ChatGPT at Hugging Face, Nazneen talked about the evolution of AI and the role she played in it.

🌱 Join Our Growing Team

We're expanding our team! Current openings:

Machine Learning Engineers
Software Engineers (Machine Learning)
Marketing

View all open positions on our Careers page.

🤝 What's Next?

Ready to improve your AI's performance? Let's talk about how Collinear can help your team take your AI solution to production.

Schedule a Demo

Collinear AI’s Blog

Discussion about this post

Ready for more?