Berkeley Function Calling Leaderboard Updates (v1.2) #869
ShishirPatil
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Highlights
🏆 Berkeley Function Calling Leaderboard V3 with Multi-step and Multi-turn function call evaluation
What's Changed
o1-preview-2024-09-12ando1-mini-2024-09-12by @HuanzhiMao in [BFCL] Add New Modelo1-preview-2024-09-12ando1-mini-2024-09-12#635_multi_threaded_inferenceby @HuanzhiMao in [BFCL] Robustness Patch for_multi_threaded_inference#754Llama-3.2-3B-Instruct-FCandLlama-3.2-1B-Instruct-FCfrom Leaderboard by @HuanzhiMao in [BFCL] RemoveLlama-3.2-3B-Instruct-FCandLlama-3.2-1B-Instruct-FCfrom Leaderboard #749data_multi_turn.csvfor Multi-Turn Evaluation Results by @HuanzhiMao in [BFCL Chore] Supplydata_multi_turn.csvfor Multi-Turn Evaluation Results #762record_cost_latencyby @HuanzhiMao in [BFCL] Remove Duplicate Line inrecord_cost_latency#767claude-3-5-haiku-20241022,claude-3-5-haiku-20241022-FC,claude-3-5-sonnet-20241022,claude-3-5-sonnet-20241022-FCby @HuanzhiMao in [BFCL] Addclaude-3-5-haiku-20241022,claude-3-5-haiku-20241022-FC,claude-3-5-sonnet-20241022,claude-3-5-sonnet-20241022-FC#750Qwen/Qwen2.5-72B-Instructby @HuanzhiMao in [BFCL] Add New ModelQwen/Qwen2.5-72B-Instruct#787@finaland@overridesDecorators to Class Methods in Model Handler by @VishnuSuresh27 in [BFCL Chore] Add@finaland@overridesDecorators to Class Methods in Model Handler #790@overridesto@overrideby @VishnuSuresh27 in [BFCL Chore] Quick fix change of decorators from@overridesto@override#797nova-pro-v1.0,nova-lite-v1.0, andnova-micro-v1.0by @HuanzhiMao in [BFCL] Add Amazon Modelsnova-pro-v1.0,nova-lite-v1.0, andnova-micro-v1.0#815README.mdfor Clearer Instructions by @HuanzhiMao in [BFCL Chore] RevampREADME.mdfor Clearer Instructions #819Llama-3.3-70B-Instruct,Llama-3.3-70B-Instruct-FCby @HuanzhiMao in [BFCL] Add New ModelLlama-3.3-70B-Instruct,Llama-3.3-70B-Instruct-FC#837o1-2024-12-17ando1-2024-12-17-FCby @HuanzhiMao in [BFCL] Addo1-2024-12-17ando1-2024-12-17-FC#840Qwen2.5-0.5B-Instruct,Qwen2.5-3B-Instruct,Qwen2.5-14B-Instruct,Qwen2.5-32B-Instructby @HuanzhiMao in [BFCL] AddQwen2.5-0.5B-Instruct,Qwen2.5-3B-Instruct,Qwen2.5-14B-Instruct,Qwen2.5-32B-Instruct#842watt-tool-8Bandwatt-tool-70Bby @zhanghanduo in [BFCL] Add New Modelwatt-tool-8Bandwatt-tool-70B#847gemini-2.0-flash-exp-FC,gemini-2.0-flash-exp,gemini-exp-1206-FC,gemini-exp-1206by @HuanzhiMao in [BFCL] Addgemini-2.0-flash-exp-FC,gemini-2.0-flash-exp,gemini-exp-1206-FC,gemini-exp-1206#843N/Ain Score Report for Unevaluated Categories by @HuanzhiMao in [BFCL] UseN/Ain Score Report for Unevaluated Categories #849mistralai/Ministral-8B-Instruct-2410by @HuanzhiMao in [BFCL] Add Mistral Local Serving Handler and Add New Modelmistralai/Ministral-8B-Instruct-2410#855DeepSeek-V3by @HuanzhiMao in [BFCL] Add New ModelDeepSeek-V3#857proprietary_model->api_inference,oss_model->local_inferencefor Better Clarity by @HuanzhiMao in [BFCL] Rename Directories:proprietary_model->api_inference,oss_model->local_inferencefor Better Clarity #859New Contributors
watt-tool-8Bandwatt-tool-70B#847Full Changelog: v1.1...v1.2
This discussion was created from the release Berkeley Function Calling Leaderboard Updates (v1.2).
Beta Was this translation helpful? Give feedback.
All reactions