Codebase to test Top-k Attention and Top-theta Attention on Large Language Models using the lm-eval-harness [1] framework, and text generation tasks including HumanEval [2] and LongBench [3] ...