ModerateImprovement@sh.itjust.works to Technology@lemmy.worldEnglish · 4 months agoEveryone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.themarkup.orgexternal-linkmessage-square26fedilinkarrow-up1131arrow-down17
arrow-up1124arrow-down1external-linkEveryone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.themarkup.orgModerateImprovement@sh.itjust.works to Technology@lemmy.worldEnglish · 4 months agomessage-square26fedilink
minus-squarewater@lemmy.worldlinkfedilinkEnglisharrow-up2·4 months agoThis is the way: https://chat.lmsys.org/?arena
This is the way:
https://chat.lmsys.org/?arena