Skip to main content

Search and Retrieval

Search in Paddles is part of the recursive action-selection contract, not a separate utility. The model repeatedly chooses search and refine actions when it needs more grounded evidence before final rendering.

The retrieval provider is the only path that turns a query into ranked workspace evidence bundles for downstream final rendering.

What Triggers Retrieval

The workspace retrieval actions route here:

  • search — issue an initial or targeted workspace query
  • refine — tighten query wording and rerun search from existing loop context

Other workspace actions (read, list_files, shell, diff, edit, etc.) run in their own tool paths and do not run through the retrieval contract.

Retrieval Providers and Capabilities

The configured retrieval provider determines which backend executes search/refine:

ProviderRoleNotes
sift-directWorkspace retrieval backend (default)Direct workspace retrieval that returns ranked evidence bundles
context-1Experimental specialized retrieval providerExposed in capability checks; may be unavailable depending on environment

Each provider reports capability before execution:

  • Available — query is accepted
  • Unsupported — provider present but request not usable in this runtime
  • HarnessRequired — provider needs harness wiring before use

When unsupported, the turn loop continues without search results but does not claim false evidence.

Search Contracts and Budgets

Every search/refine invocation maps to a ContextGatherRequest with:

  • query string and intent signal (planner-search / planner-refine)
  • retrieval plan (mode, strategy, action-selection strategy)
  • budget caps (max_items, max_summary_chars, max_snippet_chars)
  • local prior context (interpretation, recent turns, and loop-derived evidence)

Defaults are:

  • max_items: 8
  • max_summary_chars: 1200
  • max_snippet_chars: 600
  • retrieval mode: linear unless set by action selection
  • retrieval strategy: lexical unless set by action selection
  • action-selection strategy: heuristic unless set by action selection
  • step limit: 1 for each search/refine turn
  • retained artifact limit: 5

Retrieval Modes and Strategies

Modes

  • Linear — single-threaded retrieval flow
  • Graph — action graph exploration with frontier/frontier-edge metadata and branch state

Retrieval Strategy

  • Lexical — BM25-centric ranking (default)
  • Hybrid — BM25 + vector search fused with RRF (Hybrid in docs and config)

Action-Selection Strategy

  • Heuristic — deterministic expansion policy
  • Model-driven — policy guided by a model profile when configured

Action Metadata and Graph Visibility

Sift-backed graph-mode retrieval returns explicit trace structure in the evidence bundle:

  • branches and branch status
  • nodes and edges
  • frontier queue
  • active branch + completed status
  • stop reasons (goal-satisfied, step-limit-reached, no-further-queries, no-additional-evidence)

This metadata is included with retrieval summaries and available to final rendering.

Progress, ETA, and Visibility

Search progress is emitted as retrieval progress events:

  • an immediate initial event with status: initializing retrieval
  • periodic progress updates from the backend (indexing, embedding, action steps, retrieving, ranking)
  • heartbeat updates at 2 seconds, then every 60 seconds while work is ongoing

eta is optional. It can be absent (unknown) when the backend cannot provide a remaining-time estimate for the current phase.

Evidence Shape and Failure Signals

The evidence bundle always includes:

  • summary — one human-readable summary sentence
  • items — ranked evidence with source, snippet, rationale, and rank
  • optional warnings — examples include empty retained evidence or fallback extraction conditions
  • optional action trace metadata for downstream observability

If no retained evidence is available from graph-mode action state, retrieval may fall back to the latest action-view emission and records that warning explicitly.

Constraints to Know

  • Search is scoped to the current workspace root configured for the session.
  • Action-selection decisions set retrieval strategy and mode per turn; there is no separate manual search policy in the TUI.
  • SIFT locator resolution is defined in architecture as a typed ContextLocator tier but still remains unsupported at resolver time in current runtime.
  • Context Tiers — where retrieval evidence sits in the four-tier context model
  • Recursive Planning — how action-selection decisions choose search/refine
  • Turn Loop — where search fits in the overall cycle
  • Steering Signals — how truncation and recursive guidance shape what gets surfaced inline