Peter Steinberger
6ce17db11a
fix: gate max thinking by model support
2026-04-21 07:02:43 +01:00
Peter Steinberger
0da5e0e34e
fix(openai): tighten gpt prompt contract
2026-04-21 06:14:54 +01:00
Peter Steinberger
2641b052dc
fix: align OpenAI reasoning effort handling
2026-04-21 04:58:31 +01:00
Peter Steinberger
bd0c9024a2
docs: document Kimi cost live smoke
2026-04-21 03:10:56 +01:00
Sliverp
b938e6398b
feat: add tiered model pricing support ( #67605 )
...
Adds tiered model pricing support for cost tracking, keeps configured pricing ahead of cached catalog values, and includes latest Moonshot Kimi K2.6/K2.5 cost estimates.\n\nThanks @sliverp.
2026-04-21 03:02:57 +01:00
Peter Steinberger
525e66e513
fix(openai): use tagged GPT-5 prompt contract
2026-04-21 02:45:17 +01:00
Peter Steinberger
82b8a4aab6
docs(openai): clarify GPT-5 prompt defaults
2026-04-21 02:36:16 +01:00
Peter Steinberger
ab03d4e037
fix(openai): default GPT-5 prompt overlay
2026-04-21 02:36:16 +01:00
aniaan
c8e5150fd4
feat(moonshot): default to Kimi K2.6 with K2.6-only thinking.keep support ( #68816 )
...
Merged via squash.
Prepared head SHA: ed54e02842
Co-authored-by: aniaan <40813941+aniaan@users.noreply.github.com >
Co-authored-by: odysseus0 <8635094+odysseus0@users.noreply.github.com >
Reviewed-by: @odysseus0
2026-04-20 18:04:49 -07:00
Peter Steinberger
8dc756747b
docs: update GitHub Copilot default model
2026-04-20 14:19:26 +01:00
Peter Steinberger
28fe0296c4
fix: support Gemini latest thinking config
2026-04-18 19:22:27 +01:00
Barron Roth
bf59917cd1
fix: add Google Gemini TTS provider ( #67515 ) (thanks @barronlroth)
...
* Add Google Gemini TTS provider
* Remove committed planning artifact
* Explain Google media provider type shape
* google: distill Gemini TTS provider
* fix: add Google Gemini TTS provider (#67515 ) (thanks @barronlroth)
* fix: honor cfg-backed Google TTS selection (#67515 ) (thanks @barronlroth)
* fix: narrow Google TTS directive aliases (#67515 ) (thanks @barronlroth)
---------
Co-authored-by: Ayaan Zaidi <hi@obviy.us >
2026-04-16 11:54:35 +05:30
Ayaan Zaidi
33154ce745
fix: simplify ollama onboarding ( #67005 )
...
* feat(ollama): split interactive cloud and local setup
* test(ollama): cover cloud onboarding flow
* docs(ollama): simplify provider setup docs
* docs(onboarding): update ollama wizard copy
* fix(ollama): restore web search auth helper
* fix(ollama): harden setup auth and ssrf handling
* fix(ollama): address review regressions
* fix(ollama): scope ssrf hardening to ollama
* feat(ollama): add hybrid onboarding mode
* fix(ollama): tighten cloud credential setup
* refactor(ollama): distill host-backed setup modes
* fix(ollama): preserve cloud api key in config
* fix: simplify ollama onboarding (#67005 )
2026-04-15 19:06:21 +05:30
Pengfei Ni
88d3620a85
feat(github-copilot): add embedding provider for memory search ( #61718 )
...
Merged via squash.
Prepared head SHA: 05a78ce7f2
Co-authored-by: feiskyer <676637+feiskyer@users.noreply.github.com >
Co-authored-by: vincentkoc <25068+vincentkoc@users.noreply.github.com >
Reviewed-by: @vincentkoc
2026-04-15 10:39:28 +01:00
Rugved Somwanshi
0cfb83edfa
feat: LM Studio Integration ( #53248 )
...
* Feat: LM Studio Integration
* Format
* Support usage in streaming true
Fix token count
* Add custom window check
* Drop max tokens fallback
* tweak docs
Update generated
* Avoid error if stale header does not resolve
* Fix test
* Fix test
* Fix rebase issues
Trim code
* Fix tests
Drop keyless
Fixes
* Fix linter issues in tests
* Update generated artifacts
* Do not have fatal header resoltuion for discovery
* Do the same for API key as well
* fix: honor lmstudio preload runtime auth
* fix: clear stale lmstudio header auth
* fix: lazy-load lmstudio runtime facade
* fix: preserve lmstudio shared synthetic auth
* fix: clear stale lmstudio header auth in discovery
* fix: prefer lmstudio header auth for discovery
* fix: honor lmstudio header auth in warmup paths
* fix: clear stale lmstudio profile auth
* fix: ignore lmstudio env auth on header migration
* fix: use local lmstudio setup seam
* fix: resolve lmstudio rebase fallout
---------
Co-authored-by: Frank Yang <frank.ekn@gmail.com >
2026-04-13 15:22:44 +08:00
Vincent Koc
90fac50987
docs(providers): fill undocumented capability gaps (TTS, media understanding, embeddings, xSearch, env vars)
2026-04-12 12:06:18 +01:00
Vincent Koc
93f2da8426
docs(providers): fix missing titles, tidy sidebar names, alphabetize provider nav
2026-04-12 11:41:31 +01:00
Vincent Koc
571c4db5d4
docs(providers): improve openrouter, nvidia, deepseek, opencode-go with Mintlify components
2026-04-12 11:37:09 +01:00
Vincent Koc
7de76ac6e3
docs(providers): improve opencode, glm, runway, perplexity-provider, vercel-ai-gateway with Mintlify components
2026-04-12 11:34:59 +01:00
Vincent Koc
0d9eca0e1a
docs(providers): improve mistral, zai, alibaba, cloudflare-ai-gateway, fireworks with Mintlify components
2026-04-12 11:31:43 +01:00
Vincent Koc
4d3ce427ad
docs(providers): improve qianfan, xiaomi, kilocode, arcee, github-copilot with Mintlify components
2026-04-12 11:28:32 +01:00
Vincent Koc
4081603ad5
docs(providers): improve chutes, synthetic, together, volcengine, deepgram with Mintlify components
2026-04-12 11:24:24 +01:00
Vincent Koc
e7076617f9
docs(providers): improve sglang, fal, groq, bedrock-mantle, vllm with Mintlify components
2026-04-12 11:20:58 +01:00
Vincent Koc
81d32c05f4
docs(providers): improve claude-max-api-proxy, litellm, stepfun, vydra, xai with Mintlify components
2026-04-12 11:17:49 +01:00
Vincent Koc
2b68af784f
docs(providers): improve moonshot, qwen, comfy, huggingface, inferrs with Mintlify components
2026-04-12 11:10:48 +01:00
Vincent Koc
279f82ba5f
docs(providers): improve ollama, google, bedrock, minimax, venice with Mintlify components
2026-04-12 11:01:48 +01:00
Vincent Koc
af38536fb9
docs(providers): improve Anthropic doc with Mintlify Steps, Tabs, Accordions, and Cards
2026-04-12 10:47:44 +01:00
Vincent Koc
1cff54c783
docs(providers): improve OpenAI doc with Mintlify Steps, Tabs, Accordions, and Cards
2026-04-12 10:44:59 +01:00
Peter Steinberger
e1b2ae235a
docs: clarify strict-agentic and codex modes
2026-04-11 17:13:40 +01:00
Peter Steinberger
c3aeb71f74
feat(fal): add HeyGen video-agent model
2026-04-11 02:58:04 +01:00
Peter Steinberger
b56cd114e7
feat: add Seedance 2 fal video models
2026-04-11 02:18:31 +01:00
Peter Steinberger
39cc6b7dc7
fix: stabilize character eval and Qwen model routing
2026-04-09 01:04:09 +01:00
Eric Curtin
0de5db8772
docs(inferrs): fix Gemma model id from gg-hf-gg to google ( #62586 )
2026-04-08 10:15:07 -04:00
Vincent Koc
3e7e6f2f60
docs: cover 2026.4.7 changelog gaps
2026-04-08 07:26:56 +01:00
Serg
b2456e8037
fix(zai): default to GLM-5.1 instead of GLM-5
2026-04-08 04:38:39 +01:00
Bruce MacDonald
86f35a9bc0
chore(ollama): update suggested onboarding models ( #62626 )
...
Merged via squash.
Prepared head SHA: 48c083b88a
Co-authored-by: BruceMacD <5853428+BruceMacD@users.noreply.github.com >
Co-authored-by: BruceMacD <5853428+BruceMacD@users.noreply.github.com >
Reviewed-by: @BruceMacD
2026-04-07 11:42:29 -07:00
Peter Steinberger
9d4b0d551d
fix: support inferrs string-only completions
2026-04-07 15:55:20 +01:00
nv-kasikritc
d43cc470c6
refactor(nvidia-endpoints): updated language & default models ( #59866 )
...
* fix(nvidia-endpoints): updated language & default models
* fix(nvidia-endpoints): updated link for api key
* fix(nvidia-endpoints): removed unused const
* fix(nvidia-endpoints): edited max tokens
* fix(nvidia-endpoints): fixed typo
---------
Co-authored-by: Devin Robison <drobison00@users.noreply.github.com >
2026-04-07 08:47:29 -06:00
Peter Steinberger
c2f9de3935
feat: unify live cli backend probes
2026-04-07 10:35:24 +01:00
Neerav Makwana
b9179ee4b6
Docs: match Greptile wording for magistral-* line
...
Made-with: Cursor
2026-04-07 12:52:47 +05:30
Neerav Makwana
68bfc6fcf5
Mistral: enable reasoning_effort for mistral-small-latest
...
Made-with: Cursor
2026-04-07 12:52:47 +05:30
Peter Steinberger
bc18e69fbf
fix: separate arcee auth envs from openrouter
2026-04-06 19:53:27 +01:00
arthurbr11
95106be59b
feat: enhance Arcee AI provider with OpenRouter support and update onboarding instructions
2026-04-06 19:53:27 +01:00
arthurbr11
5ac2f58c57
feat: add Arcee AI provider plugin
...
Add a bundled Arcee AI provider plugin with ARCEEAI_API_KEY onboarding,
Trinity model catalog (mini, large-preview, large-thinking), and
OpenAI-compatible API support.
- Trinity Large Thinking: 256K context, reasoning enabled
- Trinity Large Preview: 128K context, general-purpose
- Trinity Mini 26B: 128K context, fast and cost-efficient
2026-04-06 19:53:27 +01:00
Peter Steinberger
f9c721d5bf
fix: add vydra kling live lane
2026-04-06 19:47:43 +01:00
Vincent Koc
e7fe087677
fix(openai): normalize prompt overlay personality config
2026-04-06 17:24:51 +01:00
Peter Steinberger
0c5e6037b0
fix(openai): clarify auth routes in picker and docs
2026-04-06 16:14:51 +01:00
Peter Steinberger
ac38f332c5
fix(anthropic): prefer claude cli over setup-token
2026-04-06 15:31:07 +01:00
Peter Steinberger
d378a504ac
fix: restore claude cli guidance and doctor behavior
2026-04-06 14:21:11 +01:00
Peter Steinberger
c39f061003
Revert "refactor(cli): remove bundled cli text providers"
...
This reverts commit 05d351c430 .
2026-04-06 13:40:41 +01:00