add v5 substrate training data (40 examples)
Manual hand-crafted examples for generic substrate LoRA: - 01: identity/self-model - 02-18: tool categories (159 tools grouped by function) - 19-26: tricky tool disambiguation - 27-32: framing (substrate vs persona, HMT, psychoacoustic trust) - 33-40: EEMS deep-dive (architecture, recall, store, graph, legacy) Pretty-printed JSON, compiled to JSONL via build pipeline. System prompt uses generic assistant, not BT persona.
This commit is contained in:
@@ -0,0 +1,95 @@
|
||||
{
|
||||
"messages": [
|
||||
{
|
||||
"role": "system",
|
||||
"content": "You are a helpful AI assistant with access to tools. Use tools when they help answer the user's question. Be concise and accurate."
|
||||
},
|
||||
{
|
||||
"role": "user",
|
||||
"content": "Ask sin to check disk usage and while it's doing that, check our local git status"
|
||||
},
|
||||
{
|
||||
"role": "assistant",
|
||||
"content": "<think>\nTwo independent tasks: send a prompt to sin (async, don't wait) and check local git status. Use messages_prompt_async for sin so we don't block, and bash for local git status. Run both in parallel.\n</think>",
|
||||
"tool_calls": [
|
||||
{
|
||||
"type": "function",
|
||||
"function": {
|
||||
"name": "messages_prompt_async",
|
||||
"arguments": {
|
||||
"host": "sin",
|
||||
"id": "ses_active_sin",
|
||||
"text": "Check disk usage on all mounted filesystems and report any that are above 80% capacity."
|
||||
}
|
||||
}
|
||||
},
|
||||
{
|
||||
"type": "function",
|
||||
"function": {
|
||||
"name": "bash",
|
||||
"arguments": {
|
||||
"command": "git status --short && echo '---' && git log --oneline -5",
|
||||
"description": "Local git status and recent commits"
|
||||
}
|
||||
}
|
||||
}
|
||||
]
|
||||
},
|
||||
{
|
||||
"role": "tool",
|
||||
"content": "{\"ok\": true, \"messageId\": \"msg_xyz789\"}",
|
||||
"name": "messages_prompt_async"
|
||||
},
|
||||
{
|
||||
"role": "tool",
|
||||
"content": " M src/server.ts\n M src/config.ts\n---\nabc1234 fix: correct timeout handling\ndef5678 feat: add retry logic\n789abcd refactor: extract connection pool\nccc1111 docs: update API reference\nddd2222 chore: bump dependencies",
|
||||
"name": "bash"
|
||||
},
|
||||
{
|
||||
"role": "assistant",
|
||||
"content": "<think>\nLocal git status done. Sin is still processing. Check if it's finished.\n</think>",
|
||||
"tool_calls": [
|
||||
{
|
||||
"type": "function",
|
||||
"function": {
|
||||
"name": "sessions_status",
|
||||
"arguments": {
|
||||
"host": "sin"
|
||||
}
|
||||
}
|
||||
}
|
||||
]
|
||||
},
|
||||
{
|
||||
"role": "tool",
|
||||
"content": "[{\"id\": \"ses_active_sin\", \"status\": \"idle\", \"title\": \"disk check\"}]",
|
||||
"name": "sessions_status"
|
||||
},
|
||||
{
|
||||
"role": "assistant",
|
||||
"content": "<think>\nSin session is idle — it finished. Read the response.\n</think>",
|
||||
"tool_calls": [
|
||||
{
|
||||
"type": "function",
|
||||
"function": {
|
||||
"name": "messages_read",
|
||||
"arguments": {
|
||||
"host": "sin",
|
||||
"id": "ses_active_sin",
|
||||
"limit": 2
|
||||
}
|
||||
}
|
||||
}
|
||||
]
|
||||
},
|
||||
{
|
||||
"role": "tool",
|
||||
"content": "[{\"role\": \"assistant\", \"content\": \"Disk usage check:\\n- / : 67% (180G/270G) — OK\\n- /home : 82% (410G/500G) — ABOVE THRESHOLD\\n\\n/home is above 80%. Largest consumers: ~/models (290G), ~/.cache (45G).\"}]",
|
||||
"name": "messages_read"
|
||||
},
|
||||
{
|
||||
"role": "assistant",
|
||||
"content": "<think>\nBoth tasks complete. Summarize.\n</think>\n\n**Local git status:** 2 modified files (`src/server.ts`, `src/config.ts`), 5 recent commits.\n\n**Sin disk usage:** `/home` is at 82% (410G/500G) — above threshold. Main consumers are `~/models` (290G) and `~/.cache` (45G). Root filesystem is fine at 67%."
|
||||
}
|
||||
]
|
||||
}
|
||||
Reference in New Issue
Block a user