Generating evaluation queries...
Preprocessing data...
Using naive dequantization...
Training model...

Evaluating model...
GMQ: 2.1544432608222936;
50%: 2.1131607190719963;
90%: 2.390177847321101;
95%: 3.5446638521435294;
MAX: 4.285541153729266
Generating finetune queries...
Finetuning model...

Evaluating model...
GMQ: 1.518089337520198;
50%: 1.145707388189137;
90%: 1.359151843719285;
95%: 44.102549254918095;
MAX: 88.17149024981862