Commit graph

503 commits

Author SHA1 Message Date
djholtby
73bfc936a0
Close response generator when stopping API generation (#7014) 2025-05-26 22:39:03 -03:00
oobabooga
83bfd5c64b Fix API issues 2025-05-18 12:45:01 -07:00
oobabooga
076aa67963 Fix API issues 2025-05-17 22:22:18 -07:00
oobabooga
470c822f44 API: Hide the uvicorn access logs from the terminal 2025-05-16 12:54:39 -07:00
oobabooga
fd61297933 Lint 2025-05-15 21:19:19 -07:00
oobabooga
c375b69413 API: Fix llama.cpp generating after disconnect, improve disconnect detection, fix deadlock on simultaneous requests 2025-05-13 11:23:33 -07:00
oobabooga
0c5fa3728e Revert "Fix API failing to cancel streams (attempt), closes #6966"
This reverts commit 006a866079.
2025-05-10 19:12:40 -07:00
oobabooga
006a866079 Fix API failing to cancel streams (attempt), closes #6966 2025-05-10 17:55:48 -07:00
Jonas
fa960496d5
Tools support for OpenAI compatible API (#6827) 2025-05-08 12:30:27 -03:00
Alireza Ghasemi
99bd66445f
SuperboogaV2: minor update to avoid json serialization errors #6945 2025-05-05 19:04:06 -03:00
oobabooga
f82667f0b4 Remove more multimodal extension references 2025-05-05 14:17:00 -07:00
oobabooga
85bf2e15b9 API: Remove obsolete multimodal extension handling
Multimodal support will be added back once it's implemented in llama-server.
2025-05-05 14:14:48 -07:00
oobabooga
d10bded7f8 UI: Add an enable_thinking option to enable/disable Qwen3 thinking 2025-04-28 22:37:01 -07:00
oobabooga
bbcaec75b4 API: Find a new port if the default one is taken (closes #6918) 2025-04-27 21:13:16 -07:00
oobabooga
35717a088c API: Add an /v1/internal/health endpoint 2025-04-26 15:42:27 -07:00
oobabooga
bc55feaf3e Improve host header validation in local mode 2025-04-26 15:42:17 -07:00
oobabooga
d9de14d1f7
Restructure the repository (#6904) 2025-04-26 08:56:54 -03:00
Ziya
60ac495d59
extensions/superboogav2: existing embedding check bug fix (#6898) 2025-04-24 12:42:05 -03:00
oobabooga
d5e1bccef9 Remove the SpeechRecognition requirement 2025-04-20 11:47:28 -07:00
oobabooga
ae02ffc605
Refactor the transformers loader (#6859) 2025-04-20 13:33:47 -03:00
oobabooga
ae54d8faaa
New llama.cpp loader (#6846) 2025-04-18 09:59:37 -03:00
oobabooga
4ed0da74a8 Remove the obsolete 'multimodal' extension 2025-04-09 20:09:48 -07:00
Shixian Sheng
cbffcf67ef
Fix links in the ngrok extension README (#6826) 2025-04-02 14:28:29 -03:00
SeanScripts
60d67994d9
Perplexity colors extension updates (#6764) 2025-03-14 16:45:53 -03:00
oobabooga
5bcd2d7ad0
Add the top N-sigma sampler (#6796) 2025-03-14 16:45:11 -03:00
SeanScripts
b131f86584
Perplexity colors extension v2 (#6756) 2025-02-18 11:56:28 -03:00
Alireza Ghasemi
01f20d2d9f
Improve SuperboogaV2 with Date/Time Embeddings, GPU Support, and Multiple File Formats (#6748) 2025-02-17 22:38:15 -03:00
Manuel Schmid
b54bf359bf
sd_api_pictures model reload fix (#6720) 2025-02-03 00:11:49 -03:00
oobabooga
edbe0af647 Minor fixes after 0360f54ae8 2025-02-02 17:04:56 -08:00
oobabooga
0360f54ae8 UI: add a "Show after" parameter (to use with DeepSeek </think>) 2025-02-02 15:30:09 -08:00
oobabooga
f01cc079b9 Lint 2025-01-29 14:00:59 -08:00
FP HAM
5d6f3e6f92
Training pro- removed monkeypatch references (#6695) 2025-01-24 11:23:44 -03:00
FP HAM
4bd260c60d
Give SillyTavern a bit of leaway the way the do OpenAI (#6685) 2025-01-22 12:01:44 -03:00
oobabooga
1ef748fb20 Lint 2025-01-14 16:44:15 -08:00
Lounger
ed16374ece
Fix the gallery extension (#6656) 2025-01-11 23:35:22 -03:00
oobabooga
83c426e96b
Organize internals (#6646) 2025-01-10 18:04:32 -03:00
BPplays
619265b32c
add ipv6 support to the API (#6559) 2025-01-09 10:23:44 -03:00
FP HAM
03a0f236a4
Training_PRO fix: add if 'quantization_config' in shared.model.config.to_dict() 2025-01-08 16:54:09 -03:00
oobabooga
11af199aff Add a "Static KV cache" option for transformers 2025-01-04 17:52:57 -08:00
Aluísio Pires
2bea4dfa96
Fix an issue caused during the installation of tts (#6496) 2024-12-18 18:16:56 -03:00
hronoas
9b3a3d8f12
openai extension fix: Handle Multiple Content Items in Messages (#6528) 2024-11-18 11:59:52 -03:00
SeanScripts
e1338a1804
Add whisper turbo (#6423) 2024-10-01 17:49:35 -03:00
Philipp Emanuel Weidmann
301375834e
Exclude Top Choices (XTC): A sampler that boosts creativity, breaks writing clichés, and inhibits non-verbatim repetition (#6335) 2024-09-27 22:50:12 -03:00
Jean-Sylvain Boige
4924ee2901
typo in OpenAI response format (#6365) 2024-09-05 21:42:23 -03:00
Stefan Merettig
9a150c3368
API: Relax multimodal format, fixes HuggingFace Chat UI (#6353) 2024-09-02 23:03:15 -03:00
FartyPants (FP HAM)
41a8eb4eeb
Training pro update script.py (#6359) 2024-09-02 23:00:15 -03:00
oobabooga
addcb52c56 Make --idle-timeout work for API requests 2024-07-28 18:31:40 -07:00
TimStrauven
8074fba18d
Whisper stt overhaul js (#6194)
---------

Co-authored-by: RandoInternetPreson <aaronalai1@gmail.com>
2024-07-01 23:27:18 -03:00
oobabooga
3e3f8637d6 Fix the AUTOMATIC1111 request in sd-api-pictures (closes #5993) 2024-06-29 11:43:57 -07:00
mamei16
cc825dd1f4
Addressing Whisper STT issues (#5929) 2024-06-29 01:32:54 -03:00