fix: lower seqlen to 512 for short calibration examples
Training examples are ~500-1500 tokens. seqlen=2048 causes 'no data has been cached' error. Also remove deprecated format param.
This commit is contained in:
+1
-2
@@ -46,10 +46,9 @@ rounder = AutoRound(
|
||||
dataset=calib,
|
||||
bits=4,
|
||||
group_size=128,
|
||||
seqlen=2048,
|
||||
seqlen=512,
|
||||
nsamples=min(128, len(calib)),
|
||||
iters=200,
|
||||
format="auto_round",
|
||||
)
|
||||
|
||||
print("Running AutoRound INT4 quantization...")
|
||||
|
||||
Reference in New Issue
Block a user