Sycophancy (AI behavior)
Type: Concept
Model behavior of flattering/agreeing with users; discussed as misalignment and incentive issue.
Mentioned in 1 podcast episode
Type: Concept
Model behavior of flattering/agreeing with users; discussed as misalignment and incentive issue.
Mentioned in 1 podcast episode