41:["$","div",null,{"className":"px-4","children":[["$","h2",null,{"className":"mb-4 text-sm font-medium text-balance text-muted-foreground tabular-nums","children":[0," comments"]}],["$","$L43",null,{"comments":[{"item":{"id":43349885,"type":"comment","by":"gffrd","time":1741833505,"title":"$undefined","url":"$undefined","text":"A real user will be worse … but that’s kinda the point.

The most valuable thing you learn in usability/research is not if your experience works, but the way it’ll be misinterpreted, abused, and bent to do things it wasn’t designed to.","score":"$undefined","descendants":"$undefined","kids":[43354989],"parent":43349525,"dead":"$undefined","deleted":"$undefined"},"children":[{"item":{"id":43354989,"type":"comment","by":"cpeterso","time":1741884082,"title":"$undefined","url":"$undefined","text":"Enter "Drunk User Testing". Host a happy hour event and give some buzzed users some scenarios to test.

https://www.newyorker.com/magazine/2018/04/30/an-open-bar-fo...

https://uxpamagazine.org/boozeability/","score":"$undefined","descendants":"$undefined","kids":"$undefined","parent":43349885,"dead":"$undefined","deleted":"$undefined"},"children":[]}]},{"item":{"id":43350247,"type":"comment","by":"Tepix","time":1741838320,"title":"$undefined","url":"$undefined","text":"More consistent? That's not a given with LLMs unless you set the temperature to 0.","score":"$undefined","descendants":"$undefined","kids":[43362972],"parent":43349525,"dead":"$undefined","deleted":"$undefined"},"children":[{"item":{"id":43362972,"type":"comment","by":"CaffeineLD50","time":1741962610,"title":"$undefined","url":"$undefined","text":"You are right. LLMs are totally random and useless.

Thanks for playing.","score":"$undefined","descendants":"$undefined","kids":[43363104],"parent":43350247,"dead":"$undefined","deleted":"$undefined"},"children":[{"item":{"id":43363104,"type":"comment","by":"Tepix","time":1741963454,"title":"$undefined","url":"$undefined","text":"You seem to disagree. Here's an interesting study where the researchers used an OpenAI-LLM-based tool to grade student papers and by grading them 10 times in a row, they got vastly different results:

https://rainermuehlhoff.de/en/fobizz-AI-grading-assistant-te...

Quote: "The results reveal significant shortcomings: The tool’s numerical grades and qualitative feedback are often random and do not improve even when its suggestions are incorporated."","score":"$undefined","descendants":"$undefined","kids":"$undefined","parent":43362972,"dead":"$undefined","deleted":"$undefined"},"children":[]}]}]}],"op":"CaffeineLD50"}]]}]

0 comments

0 comments