
Real-world alignment needs direct user feedback to improve the model and thus the product. As people interact with AI systems, their behaviours shape and influence the outputs they receive in the future.
Thus, creating a continuous feedback loop where both the system and user behaviour adapt over time. E.g., ChatGPT uses Reaction buttons and Comment boxes to collect user feedback.
How to use this pattern
Account for implicit feedback: Capture user actions such as skips, dismissals, edits, or interaction frequency. These passive signals provide valuable behavioral cues that can tune recommendations or surface patterns of disinterest.
Ask for explicit feedback: Collect direct user input through thumbs-up/down, NPS rating widgets or quick surveys after actions. Use this to improve both model behavior and product fit.
Communicate how feedback is used: Let users know how their feedback shapes future experiences. This increases trust and encourages ongoing contribution.