Relace Search coding
The relace-search model uses 4-12 `view_file` and `grep` tools in parallel to explore a codebase and return relevant files to the user request.
In contrast to RAG, relace-search performs agentic multi-step reasoning to produce highly precise results 4x faster than any frontier model. It's designed to serve as a subagent that passes its findings to an "oracle" coding agent, who orchestrates/performs the rest of the coding task.
To use relace-search you need to build an appropriate agent harness, and parse the response for relevant information to hand off to the oracle. Read more about it in the Relace documentation.
Capabilities
Context Window 256k tokens
Max Output 128k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $1.00
Output $3.00
Cache Read -
Cache Write -
Supported Parameters
max_tokensseedstoptemperaturetool_choicetoolstop_p