-
2200120133
fix: lower seqlen to 512 for short calibration examples
main
marauder-actual
2026-06-01 04:27:24 +02:00
-
465e74f49e
fix: patch is_mllm_model for Qwen3.6 text-only model
marauder-actual
2026-06-01 04:26:15 +02:00
-
367ed705ab
fix: convert chat messages to text for AutoRound calibration
marauder-actual
2026-06-01 04:23:28 +02:00
-
4edaeeb21b
switch quantization from llm-compressor to AutoRound
marauder-actual
2026-06-01 04:22:07 +02:00
-
934be8ce48
fix: load tokenizer from base repo for quant venv compat
marauder-actual
2026-06-01 04:16:32 +02:00
-
0fa46c9fed
add AWQ quantization script (llm-compressor)
marauder-actual
2026-06-01 04:15:15 +02:00
-
26e776db71
add substrate v5 training and build scripts
marauder-actual
2026-06-01 03:52:55 +02:00
-
0e88c3c2ae
add v5 substrate training data (40 examples)
marauder-actual
2026-05-31 14:32:19 +02:00
-
175d28a5aa
add bt7274 lora v2 and v3 adapter configs
madcat
2026-05-31 11:39:12 +02:00
-
cc7b5c88a3
add training scripts: v3, qwen3-8b, qlora-7b
madcat
2026-05-31 11:39:08 +02:00
-
0231a4d078
add bt7274 qwen3.5-27b lora v4 adapter config
marauder-actual
2026-05-31 11:38:55 +02:00
-
00025dc804
add gonzales SD LoRA training images (LFS)
marauder-actual
2026-05-31 11:38:49 +02:00
-
4cef9386b1
add docs: system lora plan, specialist specs, training review
marauder-actual
2026-05-31 11:38:46 +02:00
-
4678816795
add training scripts: memory, specialist, mining, smoke test
marauder-actual
2026-05-31 11:38:42 +02:00
-
df0d4a6eac
add training datasets: memory, persona, agent tools, serpent data
marauder-actual
2026-05-31 11:38:37 +02:00
-
eee6aff617
enable git-lfs, track *.png
marauder-actual
2026-05-31 11:38:25 +02:00
-
61d3e46ff1
add madcat-ml as submodule at ./docker
marauder-actual
2026-05-31 11:31:02 +02:00
-
60f47d0379
fix: Qwen3.5 VL processor has no .encode() — use .tokenizer attr
marauder-actual
2026-05-26 14:28:08 +02:00
-
5388df0075
fix: manual JSONL loader — pyarrow chokes on mixed tool_calls types
marauder-actual
2026-05-26 14:20:37 +02:00
-
a562d753de
fix: adamw_8bit → adamw_torch — bitsandbytes has no cu132 binary
marauder-actual
2026-05-26 12:10:30 +02:00
-
94515e7f6d
feat: bt7274 LoRA v4 — Hermes format, think blocks, 802 examples
marauder-actual
2026-05-26 04:03:38 +02:00
-
122e73860b
feat: tts-norm LoRA — dataset generator + training script
marauder-actual
2026-05-26 00:14:51 +02:00
-
8137d278db
fix: parse tool_call arguments from JSON string to dict for Qwen3.5 template
marauder-actual
2026-05-25 19:16:07 +02:00
-
4476520d5c
v3 dataset (582 ex, 14% direct) + Qwen3.5-27B training script
marauder-actual
2026-05-25 18:57:17 +02:00
-
5deb9bd2b4
docs: bt7274 persona, specialist plan, tts-clean LoRA
marauder-actual
2026-05-25 16:40:09 +02:00
-
9bcf1c2d9a
init: bt7274 lora training — v1 adapter, v2 dataset (500 examples), justfile
madcat
2026-05-25 16:37:30 +02:00