SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science

SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science

Annual Meeting of the Association for Computational Linguistics (ACL), 2025

Papers citing "SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science"