fix: lower seqlen to 512 for short calibration examples
Training examples are ~500-1500 tokens. seqlen=2048 causes 'no data has been cached' error. Also remove deprecated format param.
This commit is contained in:
+1
-2
@@ -46,10 +46,9 @@ rounder = AutoRound(
|
|||||||
dataset=calib,
|
dataset=calib,
|
||||||
bits=4,
|
bits=4,
|
||||||
group_size=128,
|
group_size=128,
|
||||||
seqlen=2048,
|
seqlen=512,
|
||||||
nsamples=min(128, len(calib)),
|
nsamples=min(128, len(calib)),
|
||||||
iters=200,
|
iters=200,
|
||||||
format="auto_round",
|
|
||||||
)
|
)
|
||||||
|
|
||||||
print("Running AutoRound INT4 quantization...")
|
print("Running AutoRound INT4 quantization...")
|
||||||
|
|||||||
Reference in New Issue
Block a user