Skip to content

Demo Showcase Workflows

scripts/demo_showcases.py is the checkout-safe demo evidence runner for the current showcase roadmap stream. It composes existing WorldForge examples, diagnostics, preserved-run workspaces, issue bundles, replay fixtures, and host examples without installing optional model runtimes, calling paid providers, opening GUIs, or controlling robots.

uv run python scripts/demo_showcases.py list
uv run python scripts/demo_showcases.py run all --workspace-dir .worldforge/demo-showcases
uv run python scripts/demo_showcases.py run first-run --format json --overwrite

Expected success signal: the command exits 0, reports status: passed, and writes one run_manifest.json plus results/summary.json and reports/summary.md for every selected workflow. First triage step: open the failed workflow's workflow-result.json, then inspect the referenced preserved run manifest.

Artifact Layout

The default workspace is .worldforge/demo-showcases/.

Path Purpose Safe attachment note
<workflow>/workflow-result.json short machine-readable workflow result safe when safe_to_attach is true
<workflow>/runs/<run-id>/run_manifest.json preserved command, provider, operation, status, artifacts safe by construction; no raw secrets or signed URLs
<workflow>/runs/<run-id>/results/summary.json full workflow summary safe unless the workflow marks otherwise
<workflow>/runs/<run-id>/reports/summary.md human summary with claim boundary and first triage step safe unless the workflow marks otherwise
<workflow>/issue-bundle/ issue-ready evidence bundle for the diagnostics workflow attach only when evidence_manifest.json says safe_to_attach: true

Workflow Matrix

Workflow Issue Command Expected output Primary artifact First triage step
first-run #189 uv run python scripts/demo_showcases.py run first-run mock world created, object added, prediction recorded, export and preflight written first-run/exported-world.json and first-run/preflight.json run uv run worldforge world preflight --state-dir <demo>/worlds
diagnostics-issue-bundle #190 uv run python scripts/demo_showcases.py run diagnostics-issue-bundle skipped provider diagnostic preserved and bundled diagnostics-issue-bundle/issue-bundle/issue.md inspect evidence_manifest.json before attaching
robotics-replay #191 uv run python scripts/demo_showcases.py run robotics-replay deterministic policy-plus-score replay summary robotics-replay/robotics-replay-manifest.json run uv run worldforge-demo-lerobot before prepared-host commands
remote-media-dry-run #192 uv run python scripts/demo_showcases.py run remote-media-dry-run sanitized Cosmos success and Runway expired-artifact fixture events remote-media-dry-run/remote-media-events.json inspect redacted provider event targets before any live smoke
adapter-author #193 uv run python scripts/demo_showcases.py run adapter-author provider scaffold generated under demo output and promotion blockers reported adapter-author/generated-provider/ replace placeholder fixtures, then run the generated provider test
batch-eval #194 uv run python scripts/demo_showcases.py run batch-eval eval success and controlled benchmark budget failure preserved batch-eval/batch-host/runs/<run-id>/run_manifest.json inspect the failed benchmark manifest before changing budgets
service-host #195 uv run python scripts/demo_showcases.py run service-host stdlib service host readiness and one mock request summary service-host/runs/<run-id>/results/summary.json run uv run python examples/hosts/service/app.py --help and inspect /readyz
rerun-gallery #196 uv run python scripts/demo_showcases.py run rerun-gallery manifest-only Rerun gallery with missing-extra status rerun-gallery/rerun-gallery-manifest.json install the rerun extra before opening .rrd files
failure-lab #197 uv run python scripts/demo_showcases.py run failure-lab isolated failure drills, preflight, and recovery commands failure-lab/failure-lab-report.json read recovery_commands before touching real .worldforge state
use-case-cookbook #198 uv run python scripts/demo_showcases.py run use-case-cookbook cookbook recipe count and docs artifact reference docs/src/use-case-cookbook.md open the recipe matching the failed command and artifact

Runtime Boundaries

These workflows prove the WorldForge integration layer and artifact contracts, not upstream model quality or physical execution. Optional runtimes remain host-owned:

  • LeWorldModel, LeRobot, GR00T, torch, checkpoints, simulators, and robot controllers are not installed by this runner.
  • Cosmos and Runway paths use fixture-backed events and do not make paid API calls.
  • Rerun is represented by a manifest in the checkout path; .rrd generation still requires the rerun extra or a prepared-host robotics run.
  • Provider scaffolds generated by the adapter-author workflow are intentionally incomplete and must not be registered or promoted until real fixtures, runtime manifests, docs, and tests pass.
  • Benchmark failures in the batch workflow are controlled budget failures so the issue and release evidence path can be tested without changing production thresholds.

For task-oriented commands, see the Use Case Cookbook.