`tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) #11585

ochafik · 2025-02-02T01:58:37Z

Command R7B native tool call format:
- Extract normal responses vs. tool calls w/ planning
- Returning message.tool_plan in the API if available)
  - Note: may need to revisit this when supporting thinking tokens of R1 and such: will we want a single thinking field? What does DeepSeek's API do?
- Ensured neither CohereForAI/c4ai-command-r-v01 nor CohereForAI/c4ai-command-r-plus trigger detection (different format)
Cleaned up false triggers --> introduced preserved_tokens
Cleaned up tests: only test grammars from first trigger
Updated README w/ models that work well with this agent tutorial

Note

Needs a template override:

llama-server --jinja -fa -hf bartowski/c4ai-command-r7b-12-2024-GGUF:Q6_K_L \
  --chat-template-file <( python scripts/get_chat_template.py CohereForAI/c4ai-command-r7b-12-2024 tool_use )

cf. #9639

…st cleanup

ggerganov · 2025-02-02T07:19:57Z

examples/server/server.cpp

+                    if (ids.size() == 1) {
+                        LOG_DBG("Preserved token: %d\n", ids[0]);
+                        params.sampling.preserved_tokens.insert(ids[0]);
+                    }


Here we don't handle ids.size() > 1

Ah good point, just added a comment + ~~debug~~ warning log for now, should only happen when using a native tool call format with an incompatible model (e.g. wrong template override)

…I) (ggml-org#11585) * `tool-call`: support Command R7B (w/ tool_plan return) * `tool-call`: cleaner preservation of tokens + warn when likely bad chat template override * `tool-call`: test cleanup / handle lazy grammar triggers

tool-call: Command R7B (w/ tool_plan return), preserved_tokens & te…

e44a8eb

…st cleanup

github-actions bot added testing Everything test related examples server labels Feb 2, 2025

ochafik added 3 commits February 2, 2025 02:02

fix test-chat

548ac5a

rm msg.thoughts (that's for later / R1)

a28d9be

use set::find

5fd28b3

ochafik marked this pull request as ready for review February 2, 2025 02:07

ochafik requested a review from ngxson as a code owner February 2, 2025 02:07

ochafik mentioned this pull request Feb 2, 2025

Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars #9639

Merged

41 tasks

ggerganov approved these changes Feb 2, 2025

View reviewed changes

ochafik added 3 commits February 2, 2025 09:00

comment / warn about preserved tokens not being single tokens

3a37ae4

make multi token not preserved warning more actionable

a278637

bump multi token not preserved log to warning

4a5b654

ochafik merged commit bfcce4d into ggml-org:master Feb 2, 2025
45 checks passed

ochafik deleted the command-r7b branch February 2, 2025 09:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) #11585

`tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) #11585

ochafik commented Feb 2, 2025 •

edited

Loading

ggerganov Feb 2, 2025

ochafik Feb 2, 2025 •

edited

Loading

tool-call: support Command R7B (+ return tool_plan "thoughts" in API) #11585

tool-call: support Command R7B (+ return tool_plan "thoughts" in API) #11585

Conversation

ochafik commented Feb 2, 2025 • edited Loading

ggerganov Feb 2, 2025

Choose a reason for hiding this comment

ochafik Feb 2, 2025 • edited Loading

Choose a reason for hiding this comment

`tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) #11585

`tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) #11585

ochafik commented Feb 2, 2025 •

edited

Loading

ochafik Feb 2, 2025 •

edited

Loading