One .NET library to consume OpenAI, Anthropic, Cohere, Google, Azure, Groq, and self-hosed APIs.
-
Updated
Feb 28, 2025 - C#
One .NET library to consume OpenAI, Anthropic, Cohere, Google, Azure, Groq, and self-hosed APIs.
Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a small set of examples and anti-examples, then detect which item truly fits that theme among a collection of misleading candidates.
Benchmark that evaluates LLMs using 436 NYT Connections puzzles
Add a description, image, and links to the sonnet3-7 topic page so that developers can more easily learn about it.
To associate your repository with the sonnet3-7 topic, visit your repo's landing page and select "manage topics."