NLP-powered components - search bars, tag clouds, voice assistants, auto-translate - are embedded in almost every product we use daily. Yet standard UX testing methods were designed for visual interfaces, not for systems whose core interaction happens through language.
I set out to answer two questions:
- How do people actually behave when interacting with linguistic interface components?
- Can we adapt established UX methods to evaluate these systems effectively?
This thesis bridges UX research, cognitive psychology, and NLP - developing a reusable methodology for evaluating systems where language, not pixels, is the primary interface.