Falk

Learning & Feedback

The agent does not learn from conversations. It improves when humans update config files based on feedback and evals.

Feedback loop

User asks question
     ↓
Agent answers → trace in Logfire
     ↓
User reacts 👍 or 👎 → score in Logfire
     ↓
Data steward reviews low scores
     ↓
Updates config/context files → agent improves

The improvement cycle

Find issues — filter Logfire traces by low scores
Understand why — see the full trace (query → tools → response)
Fix the source — update synonyms, gotchas, rules, or context
Write a test — add a case to evals/ to prevent regression
Verify — falk test

Everything is files

All agent knowledge lives in version-controlled files. No database. No migrations. PR-reviewed and version-controlled.

See also

Context — where vocabulary, gotchas, and domain knowledge live
Memory — what persists (knowledge vs session vs feedback)

This site is open source. Improve this page.