Commit Graph

  • 2200120133 fix: lower seqlen to 512 for short calibration examples main marauder-actual 2026-06-01 04:27:24 +02:00
  • 465e74f49e fix: patch is_mllm_model for Qwen3.6 text-only model marauder-actual 2026-06-01 04:26:15 +02:00
  • 367ed705ab fix: convert chat messages to text for AutoRound calibration marauder-actual 2026-06-01 04:23:28 +02:00
  • 4edaeeb21b switch quantization from llm-compressor to AutoRound marauder-actual 2026-06-01 04:22:07 +02:00
  • 934be8ce48 fix: load tokenizer from base repo for quant venv compat marauder-actual 2026-06-01 04:16:32 +02:00
  • 0fa46c9fed add AWQ quantization script (llm-compressor) marauder-actual 2026-06-01 04:15:15 +02:00
  • 26e776db71 add substrate v5 training and build scripts marauder-actual 2026-06-01 03:52:55 +02:00
  • 0e88c3c2ae add v5 substrate training data (40 examples) marauder-actual 2026-05-31 14:32:19 +02:00
  • 175d28a5aa add bt7274 lora v2 and v3 adapter configs madcat 2026-05-31 11:39:12 +02:00
  • cc7b5c88a3 add training scripts: v3, qwen3-8b, qlora-7b madcat 2026-05-31 11:39:08 +02:00
  • 0231a4d078 add bt7274 qwen3.5-27b lora v4 adapter config marauder-actual 2026-05-31 11:38:55 +02:00
  • 00025dc804 add gonzales SD LoRA training images (LFS) marauder-actual 2026-05-31 11:38:49 +02:00
  • 4cef9386b1 add docs: system lora plan, specialist specs, training review marauder-actual 2026-05-31 11:38:46 +02:00
  • 4678816795 add training scripts: memory, specialist, mining, smoke test marauder-actual 2026-05-31 11:38:42 +02:00
  • df0d4a6eac add training datasets: memory, persona, agent tools, serpent data marauder-actual 2026-05-31 11:38:37 +02:00
  • eee6aff617 enable git-lfs, track *.png marauder-actual 2026-05-31 11:38:25 +02:00
  • 61d3e46ff1 add madcat-ml as submodule at ./docker marauder-actual 2026-05-31 11:31:02 +02:00
  • 60f47d0379 fix: Qwen3.5 VL processor has no .encode() — use .tokenizer attr marauder-actual 2026-05-26 14:28:08 +02:00
  • 5388df0075 fix: manual JSONL loader — pyarrow chokes on mixed tool_calls types marauder-actual 2026-05-26 14:20:37 +02:00
  • a562d753de fix: adamw_8bit → adamw_torch — bitsandbytes has no cu132 binary marauder-actual 2026-05-26 12:10:30 +02:00
  • 94515e7f6d feat: bt7274 LoRA v4 — Hermes format, think blocks, 802 examples marauder-actual 2026-05-26 04:03:38 +02:00
  • 122e73860b feat: tts-norm LoRA — dataset generator + training script marauder-actual 2026-05-26 00:14:51 +02:00
  • 8137d278db fix: parse tool_call arguments from JSON string to dict for Qwen3.5 template marauder-actual 2026-05-25 19:16:07 +02:00
  • 4476520d5c v3 dataset (582 ex, 14% direct) + Qwen3.5-27B training script marauder-actual 2026-05-25 18:57:17 +02:00
  • 5deb9bd2b4 docs: bt7274 persona, specialist plan, tts-clean LoRA marauder-actual 2026-05-25 16:40:09 +02:00
  • 9bcf1c2d9a init: bt7274 lora training — v1 adapter, v2 dataset (500 examples), justfile madcat 2026-05-25 16:37:30 +02:00