- Published on
Measuring temporal effects of agent knowledge by date-controlled tool use
- Authors
- Name
- R. Patrick Xian
- Name
- Qiming Cui
- Name
- Stefan Bauer
- Name
- Reza Abbasi-Asl
- Affiliation
- UC San Francisco
- Affiliation
- UC Berkeley
- Affiliation
- Technical University of Munich & Helmholtz AI
Temporal progression is an integral part of knowledge accumulation and update. Web search is frequently adopted as grounding for agent knowledge, yet an improper configuration affects the quality of the agent’s responses. Here, we assess the agent behavior using distinct date-controlled tools (DCTs) as stress test to measure the knowledge variability of large language model (LLM) agents. We demonstrate the temporal effects of an LLM agent as a writing assistant, which uses web search to complete scientific publication abstracts. We show that the temporality of search engine translates into tool-dependent agent performance but can be alleviated with base model choice and explicit reasoning instructions such as chain-of-thought prompting. Our results indicate that agent design and evaluations should take a dynamical view and implement measures to account for the temporal influence of external resources to ensure reliability.