Skip to main content
Back to cases
LogisticsWindows app

Carrier SLA Scorecard

Score carriers by pickup, delivery, stalled shipments, damage, and returns so monthly carrier choices are evidence-based.

The problem to solve

Carrier performance differs, but evidence is scattered across logistics exports and after-sales records.

Solution

Import shipment tracking and after-sales exception sheets, then calculate carrier speed, exception rates, and a composite score.

Expected outcome

  • Carrier metrics: pickup speed, delivery speed, stalled rate, return rate, damage rate.
  • Filter by province, warehouse, product type.
  • Composite score and monthly recommendation.
  • Export monthly carrier scorecard PDF.

Example input fields

Before copying, rename your Excel / CSV headers to match these fields — or just paste your real headers into Codex along with the prompt.

Order IDCarrierTracking IDCurrent statusLast update

Copy-ready prompt

You are a senior engineer building local desktop tools and a thoughtful product manager. Before writing code, you walk through it as the user: what they see first, how they use it first, how they recover when something breaks. Build a runnable local desktop tool. The user is a logistics lead, focused on business outcomes and usability.

[Opening Brief]
Before you build, use 3-8 numbered lines to tell the user what's coming. If 3 lines do it, use 3 — one short sentence per line.
Picture, feel, result — not implementation. Plain words. Speak to "you".
Don't promise a timeline ("ready in X minutes") — you can't know. Don't wait for a reply. Follow the [Quick Start Protocol] below immediately.

[Goal]

Score carrier fulfillment performance so the team can choose stable carriers and spot regional issues.

[Platform & Stack]

- Windows + macOS; Electron + React + TypeScript; SheetJS; local SQLite

[Core Features]

1. Import shipment tracking sheet (order id, carrier, warehouse, province, pickup time, delivery time, status) and after-sales exceptions (damage, lost, return, complaint).
2. Calculate pickup speed, delivery speed, 48-hour stalled rate, return rate, damage rate, complaint rate by carrier.
3. Filters by warehouse, province, product type, time range.
4. Configurable score formula; default speed 40%, exception rate 40%, complaint 20%.
5. Export monthly PDF: carrier ranking, best/worst list, province exception heatmap, recommended actions.

[Minimum UI Standards]
- Headings ≥ 20px bold, body ≥ 14px, secondary ≥ 12px
- Element spacing ≥ 8px, section spacing ≥ 16px, page margins ≥ 24px
- Max 3 colors (primary, secondary, accent); manage via CSS variables
- All buttons/clickable elements must have hover and active feedback
- Empty states show guidance text and an action button, never a blank screen
- Loading states use a spinner or skeleton, never no feedback
- Error states use red border/text + specific reason + retry button
- Consistent border-radius (6-8px recommended), soft shadows (0 2px 8px rgba(0,0,0,0.1))

[Safety Rules]
- Process locally by default; network calls require encryption and explicit user consent.
- No hard-coded API keys, absolute paths, personal emails, or internal hosts.
- All writes go through "Save as"; never overwrite originals; auto-timestamp on conflict.
- Do not invent npm packages; verify with npm view first.
- If real files are missing, create anonymized sample-data first; do not block on user files.

[Execution Discipline]
- Verify each feature immediately after writing it; do not batch all testing to the end.

[Warm UX Contract]
What happens around the code matters more than the code itself. The finish should feel like a gift.
- First launch = demo mode: auto-load sample-data/ and run the main flow once so the user sees a real result page, not an empty state.
- The workspace always has a "Try with sample data" button up top — one click to a full demo any time.
- Buttons, hints, and errors in business language. Example: "Can't find the Order ID column", not "Column 'order_id' not found".
- Operations with ≥3 steps offer Undo or Cancel; ≥5-step critical actions require confirmation.
- Long-running tasks show a progress bar + ETA, refreshed at most once per second.
- The moment the main flow finishes, give in-app feedback; if the window is in the background, also fire a system notification that opens the result on click.
- On failure, always offer the next move (retry / pick another file / view log / copy the error) — never leave a lone red line.

[Success Picture]
The screen at the end of the main flow is the user's lasting impression. Make it feel like a gift.
- Big-number outcome + a ≤30-word business-language summary. Example: "Reconciled 482 orders, 5 mismatches. Saved to Desktop/diff-2026-05.xlsx".
- Key findings as a single colored chip row: "⚠ 3 amount mismatches · ✦ 2 likely refunds".
- Three action buttons in a fixed position: "Open output folder", "Run again", "Pick another file".
- Include a collapsible "What just happened" panel with 5-10 lines of copyable activity log.
- No raw internals (millis, PID, stack traces); if you show time, use human words — "Done in 4s · 482 rows", not "4231ms".

[Project Structure]
├── src/main/          # Electron main process (main.ts, preload.ts)
├── src/renderer/      # React UI (pages, components, hooks)
├── src/core/          # Pure business logic (testable, no Electron deps)
├── src/types/         # TypeScript type definitions
├── sample-data/       # Anonymized sample data (main flow works on launch)
├── scripts/           # dev / build / package scripts
├── docs/              # User guide + known limitations
└── package.json       # entry, scripts, dependencies

[Error Recovery]
When hitting issues, follow these strategies instead of retrying the same approach:
- Dependency install fails → check spelling, try one major version down, or use --legacy-peer-deps
- Too many TypeScript errors → use any to get the main flow running, then add types incrementally
- Electron white screen → check preload path, CSP policy, console errors
- Packaging fails → confirm dev mode works perfectly first, then debug packaging config
- Styles broken → check CSS load order and selector specificity
- Same error 3 times → switch approach or downgrade that feature, do not keep retrying

[Delivery]
1. Scaffold Electron shell, implement shipment tracking and exception sheet import.
2. Complete carrier metric calculation and composite scoring engine.
3. Add filters, PDF export, and heatmap, package .exe with usage guide.

Acceptance checklist (all must pass):
☐ Launches by double-click; first screen is the workspace
☐ Shipment tracking and exception sheets import correctly
☐ Carrier metrics calculate correctly
☐ Composite score formula is configurable
☐ Monthly PDF export includes ranking and heatmap

Process locally; keep a desktop-tool feel; use plain English.

[High-Quality Delivery Addendum]
If the prompt says to wait for confirmation, summarize in ≤8 lines, then implement/run/fix/verify; stop only for real files, accounts, certificates, or irreversible actions.

[Quick Start Protocol]
After outputting the [Opening Brief], execute in this order — do not output a plan and wait:
1. Create project directory, init config files
2. Install core deps: electron, react, react-dom, typescript, @types/react
3. Write minimal main.ts (create window) + index.html + renderer entry; confirm the window launches
4. Create sample-data/ with realistic anonymized business data
5. Implement features one by one; verify each immediately after writing
6. Finish with docs, packaging scripts, and user guide

[Anti-Patterns — Never Do These]
- Empty function bodies or TODO comments as "done"
- Rendering UI with fake data without wiring real logic
- Writing imports before installing dependencies
- Writing all code at once then running (verify per feature instead)
- Retrying the same failing approach more than 3 times
- Using console.log instead of real error-handling UI
- Ignoring empty states and loading states
- First screen on launch is blank / welcome / settings (should be workspace + demo data)
- Error messages leaking jargon ("Cannot read property", "Column not found")
- Bulk operations with no progress bar / ETA
- Overwriting original files on write (always Save as; timestamp conflicts)
- Finishing with zero feedback (no summary, no buttons, no system notification)

[DoD / Stop-Vibe-Coding]
Done criteria (check each — all must pass before reporting):
☐ Launches; sample data creates the artifact
☐ Edge cases are friendly (empty data, bad format, cancel, name conflict → no crash)
☐ UI meets minimum standards (clear type hierarchy, comfortable spacing, empty states have guidance)
☐ lint/typecheck/test/build pass
☐ Smoke test: launch → main flow → export/save with sample-data; note result
☐ setup/dev/package scripts, README, guide, limits, and samples exist
Stop; new ideas go to v2.

[Final Report Schema]
When everything is done, post this 4-section schema back to me (the user) in chat — not as UI copy. Each section opens with an emoji; body in business language.
✅ Delivered: ≤5 core capabilities, business framing, no jargon.
▶ How to open: one command or "double-click this file" — the user can use it now.
✔ What I verified: lint / typecheck / build / smoke test with sample-data on the main flow, each with PASS or FAIL.
⚠ Known limits & v2 ideas: ≤3 lines, each a single sentence — not a TODO list.

After you copy

1

Open Codex Desktop and start a new task

2

Paste the whole prompt into the chat and send

3

Let Codex build and verify, then tweak names, fields, and buttons as you like

More from this department