Skip to content

Default stop flag to None#117

Open
RiddleHe wants to merge 1 commit intoLiveCodeBench:mainfrom
collinear-ai:stop-flag
Open

Default stop flag to None#117
RiddleHe wants to merge 1 commit intoLiveCodeBench:mainfrom
collinear-ai:stop-flag

Conversation

@RiddleHe
Copy link

We identified a bug in the parser.py file, where the default stop token for model generation is "###":

parser.add_argument(
        "--stop",
        default="###",
        type=str,
        help="Stop token (use `,` to separate multiple tokens)",
    )

This will cutoff model response prematurely. For example, in our evaluations, we find that Qwen3-8B likes to output "### Solution code" so the solution code is discarded.

Currently, this feature is not disabled for vllm runner, which is the default entry point for most open source models.

We changed the default flag to None and also handled the none type in subsequent logic.

@Naman-ntc
Copy link
Contributor

These are default values and not the recommend values (especially for newer models). Please pass empty string "" for the stop argument.

@RiddleHe
Copy link
Author

Thanks for the info. We have talked to a few labs and none of them are aware of this default value. As a result, many of them often fail to reproduce paper results as they just accept the default. Since it is quite safe to assume that most people are not aware of "###" being the default, we suggest changing the default to make it easier for the researchers!

@acsezen
Copy link

acsezen commented Sep 14, 2025

These are default values and not the recommend values (especially for newer models). Please pass empty string "" for the stop argument.

Will anything break if the default value is left empty?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants