2.3 KiB
Troubleshooting
Docker or Compose Is Unavailable
Run:
docker version
docker compose version
If missing, run:
./install.sh
ARM64 Image Cannot Run
Running ARM64 images on x86_64 requires binfmt/qemu:
docker run --rm --privileged tonistiigi/binfmt --install arm64
docker run --rm --platform linux/arm64 debian:bookworm-slim uname -m
Expected output: aarch64.
First Snapshot Times Out
All-five snapshot can be slow, especially under QEMU. Increase timeout:
./start.sh --timeout-secs 14400
Output Counts Are Too Low
Check:
grep LIVE_TA_REFRESH_BEFORE_SNAPSHOT .env
ls -l /var/lib/ours-rp-arm64/state/live-ta
tail -100 /var/lib/ours-rp-arm64/logs/live-ta-refresh-*.log
In file-live-ta mode, snapshot should wait until live TA refresh succeeds.
Grafana Login Fails
Check .env:
GRAFANA_ADMIN_USER=admin
GRAFANA_ADMIN_PASSWORD=admin
If Grafana has already started, changing .env may not reset the existing Grafana database. Stop services and back up/clean ${HOST_DATA_DIR}/grafana if needed.
A Finite Acceptance Test Starts an Extra Run
If .env sets a finite MAX_RUNS=3 while SOAK_RESTART_POLICY=unless-stopped, Docker Compose restarts the soak container after it exits successfully.
For finite tests, set:
SOAK_RESTART_POLICY=no
How to Confirm a Periodic Forced Snapshot
Check the latest run metadata:
latest="$(find ${HOST_DATA_DIR}/runs -maxdepth 1 -type d -name 'run_*' | sort | tail -1)"
jq '{run_id,sync_mode,snapshot_reason,periodic_snapshot_delta_count,periodic_snapshot_forced,reset_db_cleanup_status}' "$latest/run-meta.json"
For a threshold-triggered reset you should see:
sync_mode: "snapshot"snapshot_reason: "periodic_snapshot_delta_limit"periodic_snapshot_forced: true
To confirm delta counting still advances after retained runs are pruned, also inspect:
jq '{last_success_snapshot,successful_deltas_since_snapshot,recent_runs_count:(.recent_runs|length)}' \
"${HOST_DATA_DIR}/state/run-lifecycle-state.json"
Lifecycle State File Is Corrupt
The script backs up the corrupt file before best-effort bootstrap from retained runs:
ls -1 "${HOST_DATA_DIR}/state"/run-lifecycle-state.json.corrupt.*
jq . "${HOST_DATA_DIR}/state/run-lifecycle-state.json"