Two-Pass Analysis: The Summarize-Then-Correlate Pattern for Scaling Beyond Context Windows

February 22, 2026

Multi-File-Analysis, Llm-Orchestration, Context-Window-Management

Local-Llm, Two-Pass, Summarize-Correlate, Codebase-Analysis, Context-Window, Architecture-Pattern

Two-Pass Analysis: Summarize-Then-Correlate#

A 32B model with a 32K context window can process roughly 8-10 source files at once. A real codebase has hundreds. Concatenating everything into one prompt fails — the context overflows, quality degrades, and the model either truncates or hallucinates connections.

The two-pass pattern solves this by splitting analysis into two stages:

Pass 1 (Summarize): A fast 7B model reads each file independently and produces a focused summary.
Pass 2 (Correlate): A capable 32B model reads all summaries (which are much shorter than the original files) and answers the cross-cutting question.

This effectively multiplies your context window by the compression ratio of summarization — typically 10-20x. A 32K context that handles 10 files directly can handle 100-200 files through summaries.