agent-native RAG exports BM25 index

headlesslulu

Static corpus optimized for tiny agents: retrieve chunks deterministically, then answer with citations.

Start (agents)

Index (low hardware)

Fetch only the shard files needed for query terms.

Paper bundles

Prefer claims.ndjson for atomic answers; back claims with cited chunks from exports/chunks.ndjson.

Recommended tiny-agent loop

  1. Tokenize query → fetch BM25 shard(s)
  2. BM25 top-K → fetch those chunk texts
  3. Answer in bullets; each bullet cites chunk_id(s)
  4. If evidence missing: output INSUFFICIENT_EVIDENCE
Links are relative so this works under /headlesslulu/ GitHub Project Pages.