122e73860bd01280d00ec32a657f20515385fb62
gen_tts_dataset.py: 4960 synthetic examples, 22 categories (numbers, currencies, dates, times, temperatures, acronyms, NATO phonetic, URLs, markdown, etc). Bilingual EN/PL with explicit [lang] tag prefix. train_tts_norm.py: Unsloth LoRA training for Qwen2.5-7B-Instruct. Rank 16, 3 epochs, packing, max_seq 768. Trained on H100 in 20m38s, final loss 0.091. Adapter: 154MB.
Description
LoRA fine-tuning pipeline for building persona, specialist, and voice adapters.
Languages
Python
96.7%
Jinja
2%
Just
1.3%