blog
product10 min read

What 'AI Code Review' Actually Looks Like Inside GitChat

AI code review is one of the most-promised and least-delivered features in modern developer tools. Here is the version where the AI reads the diff for you, surfaces the concerns, and lets you make every decision.

Almost every AI developer tool, at this point, claims to do code review. The claim is reasonable. The reality is mixed.

The version of AI code review that everyone has tried, and that most people have been disappointed by, looks like this. You hand the AI a pull request. The AI replies with a wall of generic feedback. Half of it is style issues that your linter already catches. A quarter of it is nitpicks about variable names. A few of the comments are actually substantive, but you have to read past the noise to find them. The signal-to-noise ratio is bad enough that, after a few attempts, the team stops using the feature.

This is a real problem. The promise of AI code review is significant. The execution, on most platforms, is poor. The reason is not that the underlying models cannot do it. It is that the way the feature is designed asks the wrong question.

This is a deep look at how GitChat does code review differently and why the difference matters.

What you are actually asking for in a review

When a senior engineer reviews a pull request, they do several distinct things almost without thinking. They check whether the change does what the description says. They check whether the tests cover the new behavior. They look for edge cases the author may have missed. They look for bugs, especially the kinds that pass tests but fail in production. They flag anything that violates the team's conventions. They look at the surrounding code that the change touches to confirm nothing else broke.

Most AI code review tools try to do all of these at once, generically. The result is a comment thread that reads like a checklist run against the diff, with everything either flagged or not flagged, with little prioritization, and with no understanding of which concerns matter for this particular change.

What good human reviewers do, and what good AI review needs to do, is focus the question. You do not ask "review this." You ask "look at this for potential bugs." Or "check whether the tests cover the new branch logic." Or "see whether this change introduces any backward-compatibility issues." The focused question gets a focused answer.

How GitChat structures the review

When you ask GitChat to review a pull request, it does something specific. It fetches the pull request metadata. It pulls the full diff, including the surrounding context where relevant. It identifies which files changed and what kind of changes they are. Then it reads the diff with the question you asked in mind.

The output you get is not a checklist. It is a structured analysis with explicit categories: potential bugs, missing test coverage, edge cases, stylistic concerns, and any other observations the AI thinks are worth surfacing. Each concern comes with the relevant file and line numbers. Each one is explained with reasoning. The AI does not just say "this might be a bug." It says "the function returns null when the input array is empty, but the caller assumes a number. Look at line 142."

You read the analysis. You agree with some items. You disagree with others. You decide what to do.

Critically, this is where the trust model kicks in. The AI does not leave comments on the pull request without your permission. It does not approve the pull request. It does not request changes. It does not request reviewers. It reads, it analyzes, it surfaces. The actions, every one of them, require you to confirm.

You can say "leave the three bugs as inline comments and request changes." You can say "ignore the style issues, leave the edge case as a comment, and approve." You can say "request the platform team as a reviewer because this touches their domain." Each action gets a plan. You approve. The action executes.

What this changes about review quality

The version of AI review most teams have tried produces lower-quality reviews because the AI is doing too much, too autonomously, and on too generic a question. The reviewer ends up sifting through noise instead of acting on signal.

The version GitChat ships does the opposite. The AI reads carefully. It surfaces what it found. You decide what to do with each item. The AI does not optimize for "leaving a lot of comments." It optimizes for "telling you what matters."

The downstream effect is that the review feels useful. You go through the AI's analysis, you make the decisions that require human judgment, and you act through the conversational interface. The pull request gets reviewed. The reviewer was you, with the AI as your analyst. The author gets a review that reads like it came from a thoughtful senior engineer, because the substantive content was yours and the legwork was the model's.

This is the model that scales. A team where every reviewer has an AI analyst at their side can review more pull requests, catch more issues, and turn around feedback faster. Not because the AI replaces the reviewer. Because the AI makes the reviewer faster at the part of the work that takes the most time: reading the diff and identifying what is worth flagging.

Stop clicking. Start typing.

GitChat's review tool reads the full diff and surfaces concerns by category. You decide which to act on. Every action — comment, approve, request changes — is gated by your approval.

The bottleneck nobody talks about

Code review is the single largest bottleneck in most teams' shipping pipelines. Industry data on engineering metrics consistently shows that pull request cycle time is dominated not by writing the code or by running the CI, but by waiting for review. Top-quartile organizations open pull requests in under three hours and complete reviews with an average of 1.1 review cycles. The industry average is slower. The bottom is much slower.

The cost of slow review is not just time-to-merge. It is the developer who has moved on to the next thing, lost the context, and has to reload the change when feedback finally arrives. It is the colleague whose work is blocked because the dependency they need is sitting in someone's review queue. It is the manager looking at metrics that say the team is busy without being productive.

Speeding up the review cycle has the highest leverage of any change you can make to a team's shipping cadence. And the way to speed it up is not to skip review or to lower the standard. It is to compress the part of review that takes the most time, which is reading the diff carefully enough to find the things that matter.

This is what AI review does well when it is designed correctly. It does not replace the reviewer. It does the careful-reading part faster, so the reviewer can spend their time on the parts only they can do.

What it feels like in practice

The first few times you use GitChat's review feature on real pull requests, the experience is slightly disorienting. You ask the AI to read a pull request. You get back a structured analysis in twenty seconds. You read it. You catch yourself looking for things to disagree with. You disagree with some. You agree with most. You make the decisions. The review is done in ten minutes.

The version of you that used to spend forty minutes on the same pull request is suspicious of this. It feels too fast. You ask yourself if you missed something. You check. You did not. The AI surfaced the same things you would have, in roughly the same order of priority, with the same reasoning. It just did the reading.

This is the moment the feature stops feeling like a productivity hack and starts feeling like infrastructure. Once your reviews go from forty minutes to ten without the quality dropping, you do not go back. You start reviewing more pull requests, more carefully, with more focused attention on the parts that need it.

The compounded effect, across a team of ten reviewers each doing this on every pull request, is not small. It is the difference between a team that has a review backlog and a team that does not. Between a team whose pull requests sit for days and a team whose pull requests turn around in hours. Between a team that ships once a week and a team that ships every day.

Finding stale pull requests is part of the same workflow

The review-the-diff feature is the most visible part of the review workflow, but it is not the only piece. The other part is knowing which pull requests are waiting for you in the first place.

You can ask GitChat to surface every pull request that has been open for longer than a threshold without action. You can ask it which pull requests are blocked on you specifically. You can ask which ones are blocking other people's work. The same tool that does the careful reading also does the queue management. The whole review workflow lives in one place.

For team leads and engineering managers, this is the part that quietly changes the team's velocity. The pull request that has been sitting unreviewed for nine days is the leak in the pipeline. Finding it and unblocking it is more valuable, in throughput terms, than any individual review. The tool that helps you find it is the same one that helps you finish it.

A note on what AI review does not do

It is worth being clear about what this version of AI review explicitly does not do.

It does not approve pull requests autonomously. The model can be wrong. Approving without you is the kind of decision that should never be automated.

It does not catch every issue. The AI is a careful reader. It is not a substitute for an expert who knows the codebase, the team, the customers, and the constraints. Use it as a first-pass analyst, not as a final judge.

It does not understand context the way your teammates do. If a change has an unusual purpose, or relies on tribal knowledge, the AI may miss the point. Bring that context yourself.

What AI review does, and what it does extraordinarily well, is the boring careful reading that humans are bad at doing under time pressure. That alone is enough to transform the bottleneck. The judgment stays with you. The legwork goes to the model. The pull request gets reviewed faster, and the team ships faster, and the code stays as good as the reviewer wants it to be.

The review that does the reading for you

Sign in with GitHub, bring your own LLM key, and try the review tool on a pull request you have been putting off. The first one is free.

That is the version of AI code review that actually works. That is the one we built.