Research Frame
CoreTex reflects several recurring findings from retrieval and agent-memory research:
| Theme | Design response |
|---|---|
| Dense embeddings miss memory-specific relevance | Qwen reranking and graded qrels check answer-bearing evidence beyond surface similarity |
| Long-horizon memory changes over time | Temporal validity, supersession, and stale-memory rejection are first-class substrate surfaces |
| Multi-hop agent memory needs structure | Relation and category-routing regions make useful paths explicit |
| Retrieval quality depends on hard negatives | Corpus generation includes plausible wrong documents and near-collision cases |
| Memory systems need compression | The 32 KB substrate forces miners to encode useful routing behavior under a fixed state budget |
| Public benchmarks invite overfitting | Hidden packs, canary records, seed reveal, and post-reveal replay separate search from verification |
Representative references:
| Reference | Relevance to CoreTex |
|---|---|
| MemReranker: Reasoning-Aware Reranking for Agent Memory Retrieval | Memory retrieval needs calibrated relevance, temporal reasoning, causal reasoning, and coreference handling. CoreTex reflects those families in qrels, hidden packs, and policy atoms. |
| Qwen3-Reranker-0.6B | Public launch reranker used for query/document ranking, pinned by model id, revision, prompt hash, and runtime. |
| On the Theoretical Limitations of Embedding-Based Retrieval | Single-vector retrieval has structural limits under realistic relevance sets. CoreTex uses compact routing state, relation surfaces, temporal atoms, and reranking. |
| WARP: An Efficient Engine for Multi-Vector Retrieval | Multi-vector and late-interaction retrieval motivate richer routing signals than one document embedding. |
| LoCoMo: Evaluating Very Long-Term Conversational Memory of LLM Agents | Long-term dialogue evaluation motivates current/stale facts, temporal grounding, and long-range consistency checks. |
| MemoryAgentBench | Incremental multi-turn memory benchmarks emphasize accurate retrieval, update behavior, long-range understanding, and selective forgetting. |
| MemoryArena | Multi-session agent tasks connect memory quality to later decisions, which supports evaluating memory as useful routing state. |
| Experience Compression Spectrum | Agent memory can be treated as compressed experience. CoreTex makes compression explicit through a fixed 32 KB substrate. |
| BEIR | Retrieval evaluation context for heterogeneous tasks and metrics such as nDCG, recall, and MRR. |
| MTEB | Broader embedding and retrieval benchmark context for model comparison and reproducible evaluation. |
CoreTex turns these concerns into protocol mechanics: compact state, public roots, hidden evaluation, pinned scoring, epoch reveal, and independent replay.