bits.qk
quantize_kv(kv)
Quantize key/values stored in kvcache.
Source code in src/fjformer/bits/qk.py
7 8 9 10 11 | |
unquantize_kv(value, scale, dtype)
Unquantize key/values stored in kvcache.
Source code in src/fjformer/bits/qk.py
14 15 16 | |