Pre-computed Q-Filters for efficient KV cache compression.
Nathan Godey
nthngdy
AI & ML interests
None yet
Recent Activity
updated
a model about 22 hours ago
nthngdy/matryoshka-baselines published
a model about 22 hours ago
nthngdy/matryoshka-baselines updated
a model about 23 hours ago
nthngdy/matryoshka-1B