Why are empty responses ignored in LongBench v2? #100

ZeonfaiHo · 2025-01-15T08:37:35Z

We noticed that in pred.py (lines 98-99), empty responses are ignored and not included in the final score. Is this approach reasonable? We are concerned that models might exploit this by simply not responding to questions they are unsure about.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why are empty responses ignored in LongBench v2? #100

Why are empty responses ignored in LongBench v2? #100

ZeonfaiHo commented Jan 15, 2025

Why are empty responses ignored in LongBench v2? #100

Why are empty responses ignored in LongBench v2? #100

Comments

ZeonfaiHo commented Jan 15, 2025