Commit Graph

231 Commits

Author SHA1 Message Date
github-actions[bot] 0092a74129 format: auto-format code [skip ci] 2026-05-17 16:45:07 +00:00
github-actions[bot] 280f88e0d9 style: auto-fix ruff warnings [skip ci] 2026-05-17 16:44:53 +00:00
aladac 1908cf91c4 feat(style-sweep): --parallel-queue/-P for concurrent ComfyUI submissions
Adds client-side concurrent queueing to style-sweep. -P N submits N
prompts to ComfyUI's HTTP queue concurrently via ThreadPoolExecutor.
The GPU still processes one prompt at a time (ComfyUI's queue is
single-worker), but the HTTP submission, websocket polling, image
download, and disk-write phases pipeline with the next prompt's
submission.

Expected speedup: 5-15% on a typical Flux sweep where per-image GPU
time is ~25-30s and overhead is ~3-5s. Real benefit grows with
slower networks or larger images.

Design choices:
- Default P=1 preserves the exact existing sequential behavior and
  log output (no "(submit #N)" suffix in messages).
- P>1 uses ThreadPoolExecutor.as_completed for completion-order
  reporting; the manifest is re-sorted to source-list order after.
- Skip-existing + dry-run cases are handled synchronously before the
  executor even starts (no point pipelining no-ops).
- --abort-on-error is incompatible with parallelism (can't reliably
  stop in-flight workers); we warn and continue.
- Per-task console output WILL interleave under -P>1 because
  _run_generation prints its own progress; users are pointed at the
  manifest for clean per-slug timing.

Why not full async multi-GPU-workflow parallelism:
- ComfyUI processes its queue strictly sequentially; we can't
  actually run two Flux UNets concurrently without a second ComfyUI
  instance, second port, second model dir, etc.
- Even with two instances on one GPU, the CUDA cores time-slice and
  you get ~1.1x not 2x.
- Memory math is tighter than it looks even on Spark's 80GB unified
  pool: two Flux dev instances = 64GB fixed before any activations.
- Maintenance burden is real; speed gain is marginal.

Client-side pipelining gets the practical wins (overhead hiding,
cleaner progress feedback for long sweeps) without the complexity
or OOM risk.

7 new tests covering: invalid P=0, P=1 equivalence with sequential,
multi-style execution, source-order manifest preservation under
chaotic completion, skip-existing in parallel mode, individual
failure containment, and abort-on-error warning.

267 -> 274 tests.
2026-05-17 18:44:15 +02:00
github-actions[bot] e063911178 format: auto-format code [skip ci] 2026-05-17 16:40:09 +00:00
github-actions[bot] 592b6858eb style: auto-fix ruff warnings [skip ci] 2026-05-17 16:39:54 +00:00
aladac 6935491081 feat(generate): validate model availability against live ComfyUI before queueing
Catches mismatches between local intent and what's actually loaded on the
ComfyUI host. Replaces ComfyUI's generic 400 'prompt_outputs_failed_validation'
with a clear "model X not available on host — did you mean Y?" suggestion.

Why: when a user types `tsr generate -m getphatFLUXReality_v5Hardcore` but
only v11Softcore is installed, they got a 30-line raw API error buried in
node validation output. Now they get one red line plus three fuzzy-matched
candidates from the actual loader bucket.

Implementation:
- Extends get_loaded_models() in comfyui.py to include the diffusion_models
  bucket (UNETLoader -> unet_name). Previously only checkpoints, loras, vae,
  clip, controlnet, upscale_models were exposed.
- New _validate_model_available() helper in cli.py runs after family
  detection, before prompt enhancement. Maps family -> loader bucket:
  flux_unet / flux2_klein -> diffusion_models/, else checkpoints/. Uses
  difflib.get_close_matches for the "did you mean" hint.
- Validates LoRA presence too when -l is passed.
- Special hint: if the requested file IS in checkpoints/ but the family
  requires diffusion_models/, suggests the symlink command the user needs
  to run on the host. Common case for newly-uploaded UNet-only checkpoints.
- Network failures are non-fatal — falls through to let ComfyUI surface
  the error itself rather than blocking on a stale endpoint.
- Skipped in --json mode (machine callers) and --remote dispatches (the
  server validates remotely).

8 new tests covering: unknown model in checkpoints bucket, unknown in
diffusion_models, flux2_klein routing, happy path, missing LoRA, network
failure, symlink hint, and a source-level check that the
diffusion_models bucket is wired into get_loaded_models.

259 -> 267 tests.
2026-05-17 18:37:13 +02:00
aladac b7263d1229 feat(workflow): support Flux.2 Klein 9B checkpoints (CLIPLoader type=flux2)
Adds a `flux2_klein` model family for Black Forest Labs' Flux.2 Klein 9B
release. Different architecture from Flux.1 D — required a separate
workflow rather than extending flux_unet.

Architecture differences from Flux.1 D:
- Single Qwen3-8B text encoder via CLIPLoader(type=flux2), producing
  12288-dim conditioning (3 stacked hidden layers). NOT DualCLIPLoader.
- EmptyFlux2LatentImage instead of EmptySD3LatentImage (different
  latent shape).
- Custom-sampling pipeline: Flux2Scheduler -> SIGMAS, fed into
  SamplerCustomAdvanced together with BasicGuider + RandomNoise +
  KSamplerSelect. No standalone KSampler node, so the caller's
  scheduler argument is ignored.
- Dedicated VAE: flux2-vae.safetensors (not Flux.1's ae.safetensors).

Detection:
- New FLUX2_KLEIN_PATTERNS constant lists known Klein filenames
  ("lust_", "moodydesire"). _is_flux2_klein() checks base_model field
  first ("flux.2 klein" / "flux2 klein") then filename pattern.
- detect_model_family() runs the Klein check BEFORE flux_unet, so
  Klein checkpoints that also match UNet-only patterns (lust_v10,
  moodyDesireMix) correctly route to flux2_klein.

Affected checkpoints reclassified from flux_unet -> flux2_klein:
- lust_v10.safetensors (Flux.2 Klein 9B-base per CivitAI DB)
- moodyDesireMix_v20PRO.safetensors (Flux.2 Klein 9B)

Still flux_unet (genuinely Flux.1 D UNet-only):
- cyberrealisticFlux_v25, fcFluxPonyPerfectBase,
  getphatFLUXReality_v11Softcore.

Required ComfyUI host setup (one-time):
- /home/madcat/comfyui/models/text_encoders/qwen_3_8b_fp8mixed.safetensors
  (8.1 GB, from Comfy-Org/vae-text-encorder-for-flux-klein-9b on HF)
- /home/madcat/comfyui/models/vae/flux2-vae.safetensors (321 MB)

Verified end-to-end on madcat: lust_v10 generated successfully through
the new flux2_klein workflow (~85s per image at 1024x1024, 20 steps).

7 new tests; 253 -> 259 total. Existing flux_unet tests retargeted to
genuine Flux.1 D checkpoints (getphat, fcFluxPony, cyberrealisticFlux).
2026-05-17 18:22:23 +02:00
aladac 3e04929515 feat(workflow): support UNet-only Flux.1 D checkpoints via DualCLIPLoader
Adds a `flux_unet` model family for Flux.1 D checkpoints that ship only the
UNet weights (no baked-in CLIP/T5/VAE) and require external encoder loading.

Affected checkpoints: lust_v10, cyberrealisticFlux_v25,
getphatFLUXReality_v11Softcore, moodyDesireMix_v20PRO,
fcFluxPonyPerfectBase. These live in `models/diffusion_models/` (or
`models/unet/`) on the ComfyUI host and are loaded via UNETLoader instead
of CheckpointLoaderSimple.

Detection:
- New `FLUX_UNET_ONLY_PATTERNS` constant in config.py lists known
  UNet-only filename substrings.
- `detect_model_family()` returns `flux_unet` when any pattern matches,
  taking precedence over base_model field (same architecture-override
  pattern used for FluxPony hybrids).

Workflow:
- New flux_unet workflow in comfyui.py uses UNETLoader + DualCLIPLoader
  (clip_l.safetensors + t5xxl_fp16.safetensors, type=flux) + VAELoader
  (ae.safetensors) wired into the same Flux KSampler graph as the
  monolithic flux family.
- Family defaults inherit from flux (sampler/scheduler/steps/cfg) with
  external_clip flag set.

Smoke-tested end-to-end on madcat with getphatFLUXReality_v11Softcore
producing valid output. Flux.2 Klein checkpoints (lust_v10,
moodyDesireMix) still fail because they require a different text encoder
(qwen_3_8b) — see follow-up commit.

13 new tests; 240 -> 253 total.
2026-05-17 18:15:33 +02:00
aladac af61ba79c9 feat(cli): add --list and --style filter flags to style-sweep
--list/-L prints the resolved styles list as a two-column rich table
(slug + suffix truncated to ~80 chars) and exits without generating.
Template becomes optional when --list is paired with an explicit
--styles source, so you can inspect any styles file standalone.

--style/-S SLUG selects a single style by exact slug match; repeatable
for multiple. Unknown slugs error red with the available slug list.
Filter applies before --limit and preserves the source file's order.

Both flags compose with --limit and --dry-run; when filtering down to
a subset, the manifest is still written for the smaller run.
2026-05-17 16:47:52 +02:00
aladac 0a2da5a98b feat(cli): add style-sweep command for batched style variation
New `tsr style-sweep` command renders one image per style suffix from a
template JSON, composing prompt = template.prompt + ', ' + style.suffix
and writing to {output_dir}/{slug}.png.

- Template JSON mirrors `generate --input` keys plus output_dir + styles.
- Styles source can be a path or inline list/object on either CLI or
  template. Relative styles paths in the template resolve against the
  template's directory (so templates can ship with their styles file).
- Skips existing outputs by default (--no-skip-existing to force).
- --dry-run prints planned prompts/paths without invoking generate.
- --limit N caps the sweep for fast iteration.
- --continue-on-error keeps going on individual failures; final exit code
  is non-zero if any style failed and failed slugs are reported.
- --remote propagates to the underlying generation, same as `generate`.
- Writes a manifest {output_dir}/_sweep.json with per-style results
  (slug, prompt, output, seed, duration_sec, success, error).

Delegates to the `_run_generation` helper extracted from `generate`.
2026-05-17 16:33:13 +02:00
aladac e1422b8f51 refactor(cli): extract _run_generation helper from generate
Move the post-merge body of generate() into a module-level _run_generation
helper so it can be invoked directly by other commands (next: style-sweep)
without going through Typer argv reconstruction.

No behaviour change. generate() still owns the --input JSON merge and CLI
parameter-source detection, then delegates to _run_generation.
2026-05-17 16:32:51 +02:00
aladac b66bc98386 feat(generate): expose --guidance flag for Flux models
Threads guidance value through CLI → remote payload → server schema →
generate_image() → FluxGuidance node. Ignored for non-Flux families.

Use cases:
- Lower guidance (2.0-3.0) for looser, more photorealistic Flux output
- Higher guidance (4.0-6.0) for tighter prompt adherence
2026-05-17 15:54:47 +02:00
aladac 338a7fe267 fix(generate): dispatch hybrid Flux models to Flux workflow
Models like gonzalomoXLFluxPony are architecturally Flux but CivitAI
tags them as 'Pony', causing the SDXL workflow to be sent to ComfyUI
which fails validation. The filename now overrides base_model when it
contains 'flux'.

Also adds:
- Full Flux Dev/Schnell workflow template (ModelSamplingFlux,
  FluxGuidance, ConditioningZeroOut, EmptySD3LatentImage); KSampler
  cfg locked to 1.0, caller cfg routed to FluxGuidance
- --family/-F flag to manually override family detection
- queue_prompt now surfaces ComfyUI node_errors from 400 responses
- Tests for Flux workflow builder (8 cases) and updated family defaults
2026-05-17 15:50:25 +02:00
aladac 78b46b9a13 release: v0.1.21 2026-05-16 00:44:23 +02:00
aladac b731a88beb fix: populate model cache tables after download so db list resolves names
The CLI download flow only set civitai_model_id/version_id on local_files
without caching the full model payload, so 'tsr db list' joined against
empty models/versions/creators tables and showed every linked file as
'unlinked'. The server's _auto_link_file path had additional bugs:
resolved-vs-unresolved path comparison after rescan, redundant CivitAI
hash lookup, and silent failure swallowed by 'completed' status.

- New Database.register_downloaded_file() consolidates hashing, metadata
  storage, FK linking, and cache_model() into a single idempotent call
  shared by both CLI and server paths.
- Server _do_download now passes version_info straight through and
  surfaces db_file_id/db_linked/db_cached/db_error onto _active_downloads.
- Drops the broken _auto_link_file rescan helper.
2026-05-16 00:44:12 +02:00
aladac ec080803fc release: v0.1.20 2026-05-16 00:03:56 +02:00
github-actions[bot] 05892cf287 format: auto-format code [skip ci] 2026-04-24 17:03:09 +00:00
github-actions[bot] b078e4f936 style: auto-fix ruff warnings [skip ci] 2026-04-24 17:02:53 +00:00
aladac 329b0d849e fix null params when no model family detected in tsr generate
Co-Authored-By: marauder-os <marauder@saiden.dev>
2026-04-24 19:02:31 +02:00
github-actions[bot] 55358e7b5a format: auto-format code [skip ci] 2026-04-20 20:08:33 +00:00
aladac 7162db1ab9 dynamic checkpoint presets with orientation, correct VAEs, and auto-resolved sampler/scheduler/cfg/steps 2026-04-20 22:08:01 +02:00
aladac b074a7dd50 Fix -m model flag causing 502 on /api/comfyui/generate
Family detection was force-injecting a default VAE (e.g. sdxl_vae.safetensors)
when a model was specified without an explicit --vae. If that VAE file didn't
exist on the ComfyUI server, the workflow was silently rejected. Now only
overrides VAE when the user explicitly passes --vae.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-07 16:18:03 +02:00
aladac 4a2fdce115 Add top-level generate and models commands with --remote support
Add `tsr generate` and `tsr models` as top-level CLI commands that call
ComfyUI library functions directly or HTTP to a remote tensors server.
Add `--remote` flag to existing `tsr search` and `tsr dl` commands.

New file `tensors/remote.py` provides HTTP client functions for all four
operations against the remote tensors API (generate, models, search,
download with progress polling).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-06 01:26:55 +02:00
aladac 56d5233962 Make CORS origins configurable via CORS_ORIGINS env var
Replaces hardcoded localhost origins with env-driven config.
Accepts comma-separated origins or wildcard (*). Defaults to
["*"] for backward compatibility.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-06 01:02:17 +02:00
aladac a349f6bc93 Add AI Agent Disclaimer section 2026-04-01 17:16:05 +02:00
aladac 53285e46ec LICENSE: add attribution and case-by-case options 2026-04-01 17:12:09 +02:00
aladac be14d29144 LICENSE: add BSL starting version 2026-04-01 17:00:48 +02:00
aladac 7f0ab19e4e Update README: BSL-1.1 license 2026-04-01 16:58:26 +02:00
aladac 8ddc265319 Release v0.1.19 - BSL baseline 2026-04-01 16:40:54 +02:00
aladac fcd1d1ac6e BSL: clarify version-based change dates 2026-04-01 16:31:34 +02:00
aladac 0035c432c3 Relicense to BSL 1.1 2026-04-01 16:29:59 +02:00
aladac 9f37b87909 Fix Dockerfile: use separate start.sh instead of heredoc 2026-04-01 14:42:21 +02:00
aladac 611f92a868 Rework Dockerfile for Tengu addon deployment (no bundled ComfyUI) 2026-04-01 14:29:06 +02:00
aladac 66f60989e4 Add .coverage to gitignore 2026-03-30 13:49:25 +02:00
aladac 2a704aa677 Add model family-specific sampler, scheduler, and VAE defaults
- Add sampler/scheduler/steps/vae to MODEL_FAMILY_DEFAULTS for all families
- Add zimage family detection for ZImageTurbo models
- Flux and zimage families use ae.safetensors VAE
- SD 1.5 families use checkpoint built-in VAE
- SDXL families use sdxl_vae.safetensors
- API auto-applies family defaults when request uses default values

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-03-20 09:22:41 +01:00
github-actions[bot] 3432e5cb99 format: auto-format code [skip ci] 2026-03-20 08:07:50 +00:00
aladac 372133edcc Update 2026-03-20 09:07 2026-03-20 09:07:19 +01:00
aladac 420d260936 Add LoRA support to generate API endpoint
- Add lora_name and lora_strength parameters
- Include LoRA info in request logging

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-03-03 18:57:38 +01:00
aladac f04e7b5cfa Configure logging for tensors package
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-03-03 18:55:20 +01:00
aladac c8d1efa641 Add logging for generation requests and results
- Log prompt, model, size, steps on generate request
- Log completion with prompt_id and image count
- Log warnings/errors on failures

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-03-03 18:49:26 +01:00
Adam Ladachowski 9de133b672 Include trigger words in /api/db/files response
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-03-03 16:13:45 +01:00
Adam Ladachowski 1ad0a09474 Add separate VAE loader for SDXL/Illustrious/Pony models
- Use sdxl_vae.safetensors by default instead of checkpoint VAE
- Add vae parameter to generate_image and API endpoint
- Better quality for modern SDXL-based models

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-03-03 16:09:42 +01:00
Adam Ladachowski 3d08d6c5e5 Enable CORS for local development
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-03-03 16:03:32 +01:00
github-actions[bot] 39551f7f05 format: auto-format code [skip ci] 2026-02-22 06:23:52 +00:00
github-actions[bot] 762b3f4eac style: auto-fix ruff warnings [skip ci] 2026-02-22 06:23:38 +00:00
Adam Ladachowski 82eb0d3b5c Update 2026-02-22 06:23:25 +00:00
Adam Ladachowski 5ddfb07448 Add WebSocket-based progress tracking for ComfyUI generation
- Replace polling with WebSocket connection for real-time progress
- Show step-by-step progress during sampling (Step 1/20, etc.)
- Display progress bar with actual completion percentage
- Fall back to polling if WebSocket connection fails
- Import websocket-client for sync WebSocket support

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-22 04:07:07 +00:00
Adam Ladachowski 3888558214 💬 Commit message: Update 2026-02-21 20:16:00, 1 files, 103 lines
📁 Files changed: 1
📝 Lines changed: 103

  • setup-comfyui.sh
2026-02-21 20:16:00 +01:00
Adam Ladachowski b1995cb0e4 Add key.txt to gitignore for local API key storage
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-21 05:37:54 +01:00
Adam Ladachowski d8d788c9b3 💬 Commit message: Update 2026-02-21 05:22:37, 2 files, 299 lines
📁 Files changed: 2
📝 Lines changed: 299

  • .gitignore
  • models-db.md
2026-02-21 05:22:37 +01:00