October 28, 2024

Steering Knowledge Selection Behaviours in LLMs

Large language models (LLMs) often face conflicts between stored knowledge and contextual information, which can lead to outdated or incorrect responses. Analyzing LLMs’ internal activations, we […]
August 25, 2024

Are We Done with MMLU?

Are We Done with MMLU? Read Paper Our analysis uncovers significant issues with the Massive Multitask Language Understanding (MMLU) benchmark, which is widely used to assess […]