archex: local-first, deterministic code-context for AI agents — no API key, no telemetry (Apache 2.0)
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
|
The thing this sub will care about: it's local-first by design. No hosted inference, no API key in the core, no telemetry. The whole retrieval pipeline (BM25F + local vector embeddings + RRF fusion + a local cross-encoder reranker + dependency-graph expansion) runs on your hardware and is fully deterministic, so results are reproducible across machines and CI. It's a long-running solo project, it predates the recent wave of OSS code-context tools, and I finally got it to a state worth sharing. Retrieval stack runs on your hardware: tree-sitter for parsing (25 languages), ONNX/FastEmbed for local embeddings, optional SPLADE. A BM25-only slim Docker image needs no torch at all. Measured, CI-gated numbers (19-task head-to-head vs cocoindex-code, Apple M1 Pro, same token accounting):
Telemetry: none, by design. [link] [comments] |
More from r/LocalLLaMA
-
Been running Qwen3.6-27B through a 3-critic harness. The harness matters more than I thought
Jun 30
-
I Hate Dario Amodei, and everything he stands for.
Jun 29
-
Introducing LongCat-2.0 - , a large-scale MoE language model with 1.6 trillion total parameters and ~48 billion activated per token. This was the stealth model that was on Openrouter under the name 'owl-alpha'.
Jun 29
-
Krea-2-Turbo Image Model - Easy to be fully uncensored, but it can also EDIT Images!
Jun 29
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.