Collinear AI’s Blog
Subscribe
Sign in
Home
Archive
About
Taming AI Agents: Why Your Butler Needs a Babysitter
They’re everywhere!!!
Jan 27
•
Meghana A Rajeev
,
Nazneen Rajani
, and
Soumyadeep Bakshi
5
Share this post
Collinear AI’s Blog
Taming AI Agents: Why Your Butler Needs a Babysitter
Copy link
Facebook
Email
Notes
More
December 2024
Collinear-Guard: Where Customization Meets Precision for Fine-Grained Evaluation and Feedback
Overview
Dec 9, 2024
•
Prapti Trivedi
,
Aditya Gulati
, and
Nazneen Rajani
6
Share this post
Collinear AI’s Blog
Collinear-Guard: Where Customization Meets Precision for Fine-Grained Evaluation and Feedback
Copy link
Facebook
Email
Notes
More
2
CollinearGuard Nano: A High-Performance, Holistically-Evaluated, Lightning-Fast Moderation Judge
Revolutionizing Safety: Ultra-Low Latency, High-Throughput Violation and False Refusal Detection Like Never Before!
Dec 3, 2024
•
Aditya Gulati
,
Prapti Trivedi
, and
Nazneen Rajani
21
Share this post
Collinear AI’s Blog
CollinearGuard Nano: A High-Performance, Holistically-Evaluated, Lightning-Fast Moderation Judge
Copy link
Facebook
Email
Notes
More
6
November 2024
Veritas Reliability Judge: A Cookbook to Benchmark AI Judges on Financial Data
When you're working with high-stakes data, like financial information, ensuring the accuracy of AI-generated responses is crucial.
Nov 18, 2024
•
Meghana A Rajeev
3
Share this post
Collinear AI’s Blog
Veritas Reliability Judge: A Cookbook to Benchmark AI Judges on Financial Data
Copy link
Facebook
Email
Notes
More
October 2024
Why AI Safety is existentially important, not optional
From virtual assistants that streamline our tasks to intelligent systems that drive innovation, AI's potential seems boundless.
Oct 24, 2024
•
Jahnavi Jambholkar
,
Nazneen Rajani
,
Meghana A Rajeev
,
Tanveesh Chaudhery
, and
Rajkumar Ramamurthy
1
Share this post
Collinear AI’s Blog
Why AI Safety is existentially important, not optional
Copy link
Facebook
Email
Notes
More
Introducing VERITAS: A Unified Approach to Reliability Evaluation
Veritas is a suite of Reliability Judges for Batch and Real-time use cases
Oct 24, 2024
•
Rajkumar Ramamurthy
,
Meghana A Rajeev
,
Nazneen Rajani
, and
Oliver Molenschot
1
Share this post
Collinear AI’s Blog
Introducing VERITAS: A Unified Approach to Reliability Evaluation
Copy link
Facebook
Email
Notes
More
A Guide to Creating Seed Conversational Data with Collinear
High-quality seed data can overcome many of the post-training challenges
Oct 19, 2024
•
Jahnavi Jambholkar
,
Nazneen Rajani
, and
Tanveesh Chaudhery
3
Share this post
Collinear AI’s Blog
A Guide to Creating Seed Conversational Data with Collinear
Copy link
Facebook
Email
Notes
More
Think Before You Score: Self-Rationalizing Evaluators are State-of-the-Art for Fine-grained Evaluation
Finetuning on Rationales improves Judge Rationale and Score
Oct 10, 2024
•
Prapti Trivedi
,
Aditya Gulati
,
Oliver Molenschot
,
Meghana A Rajeev
, and
Jahnavi Jambholkar
9
Share this post
Collinear AI’s Blog
Think Before You Score: Self-Rationalizing Evaluators are State-of-the-Art for Fine-grained Evaluation
Copy link
Facebook
Email
Notes
More
Collinear Flex Judge is Better Aligned than Few-shot Prompted GPT-4o
Accelerate Time to Production with Bespoke Quality Judge
Oct 5, 2024
•
Nazneen Rajani
,
Jahnavi Jambholkar
, and
Tanveesh Chaudhery
5
Share this post
Collinear AI’s Blog
Collinear Flex Judge is Better Aligned than Few-shot Prompted GPT-4o
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts