Submitted by Sy-Tuyen Ho 6 SoundnessBench: Can Your AI Scientist Really Tell Good Research Ideas from Bad Ones? Furong Huang's Lab at UMD 0 2