Skip to content

Commit da4af1b

Browse files
shinohara-rinnekomeowww
authored andcommitted
feat(minecraft): document forget_conversation and value-first prompt workflow in MCP skill reference
Add forget_conversation() usage examples to SKILL.md and mcp-surface.md, document two-turn value-first flow (read/query returns data first, follow-up turn acts on returned value), add prompt-behavior validation workflow for testing read->action patterns, note forget_conversation clears only conversation memory without touching llm state
1 parent b5978a7 commit da4af1b

File tree

2 files changed

+13
-0
lines changed

2 files changed

+13
-0
lines changed

services/minecraft/codex-skills/minecraft-debug-mcp/SKILL.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,7 @@ Use this skill to run the local bot and interact with its MCP debug interface sa
3737
- Use `execute_repl` for deep object inspection or one-off targeted calls on the running brain.
3838
- Use `inject_chat` to simulate player chat and verify behavior loop.
3939
- Use `get_llm_trace` to assert planner behavior in automation (for example, detect repeated `await skip()` on specific events).
40+
- Use `execute_repl("forget_conversation()")` to clear conversation memory before prompt-engineering tests.
4041

4142
Read `references/mcp-surface.md` for exact tool/resource names and argument schemas.
4243

@@ -49,6 +50,8 @@ Read `references/mcp-surface.md` for exact tool/resource names and argument sche
4950
- `get_llm_trace(limit, turnId?)` gives structured attempt-level trace data (messages, content, reasoning, usage, duration).
5051
- `get_last_prompt` and `get_llm_trace` are compacted for MCP: system prompt/system-role messages are omitted to reduce token cost.
5152
- If environment summary shows `"SOMETHING WENT WRONG, YOU SHOULD NOTIFY THE USER OF THIS"`, treat it as degraded runtime context and avoid high-confidence world actions.
53+
- `forget_conversation()` is available as a runtime function in REPL/global context and clears only conversation memory.
54+
- Current prompt behavior supports two-turn value-first flows: read/query turn returns concrete data first, follow-up turn performs chat/action using that returned value.
5255

5356
## Live Testing Workflow
5457

services/minecraft/codex-skills/minecraft-debug-mcp/references/mcp-surface.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -44,6 +44,7 @@ The bot starts this server during normal runtime from:
4444
- `execute_repl(code: string)`
4545
- Executes debug REPL code in running brain context.
4646
- Use for focused inspection/action only.
47+
- Runtime global includes `forget_conversation()` for conversation-memory reset.
4748

4849
- `inject_chat(username: string, message: string)`
4950
- Injects a synthetic chat perception event.
@@ -82,6 +83,8 @@ Use this exact sequence for fast live validation:
8283
1. Baseline
8384
- `get_state()`
8485
- `execute_repl("query.inventory().list().map(i => ({ name: i.name, count: i.count }))")`
86+
- Optional clean slate:
87+
- `execute_repl("forget_conversation()")`
8588
2. Task trigger
8689
- `inject_chat({ username: \"codex-live-test\", message: \"please gather 3 dirt blocks\" })`
8790
3. Execution proof
@@ -93,6 +96,13 @@ Use this exact sequence for fast live validation:
9396
4. Outcome proof
9497
- Run the same inventory `execute_repl` call again and compare item counts.
9598

99+
## Prompt-Behavior Check (Value-First)
100+
101+
To validate read->action behavior:
102+
1. Inject a query-style chat (for example inventory question).
103+
2. Confirm first planner result is no-action with concrete return value (via `get_logs`/`get_llm_trace`).
104+
3. Confirm follow-up turn uses that returned value to perform chat/action.
105+
96106
## Runtime Caveat Seen Live
97107

98108
- If a turn includes `Environment: SOMETHING WENT WRONG, YOU SHOULD NOTIFY THE USER OF THIS`, treat the world snapshot as degraded and avoid issuing risky autonomous actions until context stabilizes.

0 commit comments

Comments
 (0)