Skip to content

DefendableLedger · Royal Jelly Tiers

The five tiers

SwarmJelly assigns one of five Royal Jelly tiers to every extracted training pair, based on SwarmCurator’s 4-dimension grade.

tierscore bandwhat it isuse
Apex4.5–5.0Operator-grade ground truth · principal-level signalPrimary fine-tune corpus for Atlas + Curator + specialist Hacks
Honey3.5–4.5Production-ready · strongProduction fine-tune blends · validator chain training
Jelly2.5–3.5Solid corpus material · workhorseBreadth coverage · DPO pair candidates
Pollen1.5–2.5Broad coverage · breadth signalDiversity sampling · low-weight inclusion
Propolis< 1.5Edge cases · failure modes · repair liftSwarmFixer corpus · repair-pair training

How the tier is assigned

score = mean([accuracy, cre_judgment, format, score_overall])
if score >= 4.5: tier = "apex"
elif score >= 3.5: tier = "honey"
elif score >= 2.5: tier = "jelly"
elif score >= 1.5: tier = "pollen"
else: tier = "propolis"

The 4-dim rubric is held by SwarmCurator-9B (Qwen 3.5 base · in-house cook). No external grader. No hosted-LLM tax.

Why tiering matters

A flat corpus is a wasted corpus. Mixing apex signal with propolis edge cases at equal weight dilutes both ends — your fine-tune learns muddy ground truth AND fails to focus on the actual repair lift. Tiering preserves signal density.

Apex is what you train the first pass on. Propolis is what you train SwarmFixer on. Pollen + Jelly are the breadth corpus. Honey is the production-ready blend.

This is how the corpus compounds toward operator-grade Hacks without drifting.

What lives in each tier directory

data/swarmjelly/
├── apex/
│ ├── SJP-20260524-01KSE...AKVHNB.json
│ └── SJP-20260524-01KSE...XYZ.json
├── honey/
├── jelly/
├── pollen/
├── propolis/
└── corpus_index.jsonl # streaming index of all pairs (one line each)

Each pair file:

{
"pair_id": "SJP-YYYYMMDD-ULID",
"receipt_id": "DRR-YYYYMMDD-ULID",
"verdict_id": "TRIB-YYYYMMDD-ULID",
"tier": "apex|honey|jelly|pollen|propolis",
"input": "<raw client language>",
"output": { "assignment_type": "...", "expected_outputs": [...] },
"rubric_scores": { "accuracy": 4.5, "cre_judgment": 4.7, "format": 4.8, "score": 4.6 },
"created_at": "2026-05-24T22:33:26Z",
"pair_sha256": "<sha256>"
}

The 88K-pair compounding math

At one receipt every ~6 minutes (a steady principal cadence), the eco system mints ~88,000 training pairs per year — entirely sovereign · zero marginal cost · automatically tiered. Five years compounds to ~440K pairs / ~242K in Apex+Honey ready for fine-tune blends.

Compare to outsourced labeling at $2/pair market rate: $96K/yr of equivalent value · captured for $0 marginal · with full provenance · without sending operator-grade data to a third-party labeler.


🐝 The corpus compounds · the cook gets sharper · to the shed.