add v5 substrate training data (40 examples)

Manual hand-crafted examples for generic substrate LoRA:
- 01: identity/self-model
- 02-18: tool categories (159 tools grouped by function)
- 19-26: tricky tool disambiguation
- 27-32: framing (substrate vs persona, HMT, psychoacoustic trust)
- 33-40: EEMS deep-dive (architecture, recall, store, graph, legacy)

Pretty-printed JSON, compiled to JSONL via build pipeline.
System prompt uses generic assistant, not BT persona.
This commit is contained in:
marauder-actual
2026-05-31 14:32:19 +02:00
parent 175d28a5aa
commit 0e88c3c2ae
40 changed files with 1033 additions and 0 deletions
+102
View File
@@ -0,0 +1,102 @@
{
"messages": [
{
"role": "system",
"content": "You are a helpful AI assistant with access to tools. Use tools when they help answer the user's question. Be concise and accurate."
},
{
"role": "user",
"content": "Scrape the pricing from that page and also open our dashboard so I can see it"
},
{
"role": "assistant",
"content": "<think>\nTwo browser tasks: scraping (headless, no visual needed) and opening dashboard (headed, user wants to see it). Use both browser instances in parallel.\n</think>",
"tool_calls": [
{
"type": "function",
"function": {
"name": "headless_launch",
"arguments": {}
}
},
{
"type": "function",
"function": {
"name": "browse_launch",
"arguments": {
"headed": true
}
}
}
]
},
{
"role": "tool",
"content": "{\"ok\": true, \"browser\": \"headless\"}",
"name": "headless_launch"
},
{
"role": "tool",
"content": "{\"ok\": true, \"browser\": \"headed\"}",
"name": "browse_launch"
},
{
"role": "assistant",
"content": "<think>\nBoth browsers launched. Navigate headless to the pricing page, headed to the dashboard. Parallel.\n</think>",
"tool_calls": [
{
"type": "function",
"function": {
"name": "headless_goto",
"arguments": {
"url": "https://example.com/pricing"
}
}
},
{
"type": "function",
"function": {
"name": "browse_goto",
"arguments": {
"url": "http://localhost:3000/dashboard"
}
}
}
]
},
{
"role": "tool",
"content": "{\"url\": \"https://example.com/pricing\", \"title\": \"Pricing - Example\"}",
"name": "headless_goto"
},
{
"role": "tool",
"content": "{\"url\": \"http://localhost:3000/dashboard\", \"title\": \"Dashboard\"}",
"name": "browse_goto"
},
{
"role": "assistant",
"content": "<think>\nBoth loaded. Scrape pricing from headless. Dashboard is visible in the headed browser for the user.\n</think>",
"tool_calls": [
{
"type": "function",
"function": {
"name": "headless_query",
"arguments": {
"selector": ".pricing-card"
}
}
}
]
},
{
"role": "tool",
"content": "[{\"tag\": \"div\", \"text\": \"Starter $9/mo 5 users 10GB storage\"}, {\"tag\": \"div\", \"text\": \"Pro $29/mo 25 users 100GB storage\"}, {\"tag\": \"div\", \"text\": \"Enterprise $99/mo Unlimited users 1TB storage\"}]",
"name": "headless_query"
},
{
"role": "assistant",
"content": "<think>\nGot pricing data. Dashboard is open in the visible browser.\n</think>\n\nDashboard is open in your browser. Here's the pricing from the page:\n\n| Plan | Price | Users | Storage |\n|---|---|---|---|\n| Starter | $9/mo | 5 | 10GB |\n| Pro | $29/mo | 25 | 100GB |\n| Enterprise | $99/mo | Unlimited | 1TB |"
}
]
}