CodeRabbit blog

Your AI agent has amnesia

Fifty years of SDLC evolution pushed engineering toward shared understanding. Coding agents reversed the trend in 18 months.

Featured

What Claude Opus 4.7 means for AI code review

Claude Opus 4.7 outperformed in 100 evaluations across real open-source pull requests—finding more real bugs, delivering more actionable feedback, and reasoning across files better than anything we’ve tested.

Misalignment: The hidden cost of AI coding agents isn't from AI at all

TL;DR: The real cost of AI agents isn’t tokens or tools; it’s misalignment that shows up as rework, slop, and slowed teams. The conversation everyone is having (and why it misses the point) Most conversations about AI coding agents sound like a fant...

An (actually useful) framework for evaluating AI code review tools

Benchmarks promise clarity. They’re supposed to reduce a complex system to a score, compare competitors side by side, and let the numbers speak for themselves. But, in practice, they rarely do. Benchmarks don’t measure “quality” in the abstract. They...

All articles

What the Vercel breach means for enterprise code security

A stolen OAuth token brought down Vercel's internal systems. Learn the three security lessons every enterprise should take from this developer supply chain attack.

What changed in OpenAI GPT-5.5: Better judgment, stronger coding, better signal

GPT-5.5 benchmark results from CodeRabbit show improved code review precision, higher signal, and better performance in real workflows.

The IDE is no longer the center of software development

The IDE is no longer the center of software development. Learn how AI powered operational interfaces like CodeRabbit’s Agent for Slack are transforming engineering workflows, reducing context switching, and redefining developer productivity.

Your AI agent has amnesia

Fifty years of SDLC evolution pushed engineering toward shared understanding. Coding agents reversed the trend in 18 months.

Measure twice, cut once: How CodeRabbit built a planning layer on Claude

CodeRabbit's planning layer is built on Claude to catch costly assumptions before coding begins, helping teams ship better, more reliable AI-generated code, faster.

What Claude Opus 4.7 means for AI code review

Introducing the CodeRabbit plugin for Codex

Get AI-powered code reviews without leaving Codex. The CodeRabbit plugin runs reviews inside your session, catches bugs before PRs, and requires zero workflow changes to set up.

Why agentic code review beats RAG for multi-repository analysis

Traditional RAG-based code review misses cross-repo breaking changes. Learn why agentic code review delivers precise, real-time multi-repository impact analysis.