[4/9] LLMs generate at scale, humans verify easily. UniScientist exploits this via Evolving Polymathic Synthesis:
Models generate research problems from expert-validated claims; domain experts verify quality.
Dataset: 4,700+ instances, 50+ disciplines, 20+ rubric items each. 📊