Is llama2-7B-chat weaker thank llama2-7B? #13

Open

sunyuhan19981208

opened

I got only 9.7% for llama2-7B-chat on human-eval using your script

{'pass@1': 0.0975609756097561}

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests