29-03-2025 12:00 via zdnet.com

With AI models clobbering every benchmark, it's time for human evaluation

The latest frontier in AI research is having more humans in the loop assessing just how good the models are.