Steering Knowledge Selection Behaviours in LLMs

October 28, 2024 1 min read

Full paper · available on arxiv.org

We investigate how large language models (LLMs) select and utilise knowledge when generating responses. Our analysis reveals that LLMs exhibit systematic biases in knowledge selection, often favouring certain types of information over others regardless of relevance or accuracy.

Through controlled experiments using knowledge-steering techniques, we demonstrate that it’s possible to influence LLMs’ knowledge selection behaviours. We introduce novel methods for steering models towards more balanced and contextually appropriate knowledge utilisation, significantly improving response quality and factual accuracy.

Our findings have important implications for developing more reliable and controllable language models, particularly in knowledge-intensive applications where accurate information retrieval and utilisation are critical.

Keep reading.

November 2025

GRADA: Graph-based Reranker against Adversarial Documents Attack

Retrieval Augmented Generation (RAG) frameworks improve the accuracy of large language models (LLMs) by integrating external knowledge from retrieved ...

November 2025

FLARE: Faithful Logic-Aided Reasoning and Exploration

We introduce Faithful Logic-Aided Reasoning and Exploration (FLARE), a novel interpretable approach for traversing the problem space using task decomp...

November 2025

DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations

Large Language Models (LLMs) often hallucinate, producing unfaithful or factually incorrect outputs by misrepresenting the provided context. We propos...

Start the conversation

Talk to a senior consultant.

30 minutes. Bring a problem you’re stuck on — we’ll tell you what we’d do next.

Book a consultation →