Overview
NLP Lab Evidence Verification is an individual challenge built from a fictional English laboratory corpus. Participants predict a claim label and cite sentence-level evidence from the released corpus.
Competition
Each test claim must be classified as SUPPORTED, REFUTED, or
NOT_ENOUGH_INFO. For supported and refuted claims, submissions should include the evidence
sentences that justify the prediction.
The public leaderboard is scored on the public split only. Final reveal scores are computed server-side and remain hidden until the end.
Evaluation
The final public score combines label macro F1 and evidence F1. Submit predictions through the Submission page; only valid JSONL files are added to the public leaderboard.
Participation
This competition is strictly individual. Use the assigned ID to log in, submit one prediction file at a time, and review your own submission history from the Submission page.