All releases
Auxot Server v0.1.25 April 15, 2026
File attachments, multi-model support, and resilient job infrastructure
v0.1.25 brings file attachment support in agent chat, multi-model provider selection for Claude Code, Discord/Slack pass() tool with reprompt loop, prompt cache token metrics, and sweeping improvements to job queue resilience and cloud inference stability.
Features
- File attachments in agent chat — Added a unified file attach button that routes images to the vision pipeline and all other file types to the VM, with production hardening for edge cases.
- Multi-model provider support — Agents can now be configured with multiple model providers. Includes a Claude Code CLI integration using short aliases (
sonnet,opus,haiku) and a checklist UI for managing allowed and default models. - Discord/Slack:
pass()tool and reprompt loop — Introduced apass()tool for Discord and Slack integrations, along with a reprompt loop and fixes to message threading behavior. - Prompt cache token metrics — Inference events now surface prompt cache token counts, giving operators visibility into caching efficiency.
- Cloud provider refresh UI — Added a cloud provider refresh action and an allowed-models checklist to the provider settings UI.
- Resilient job recovery and client-abandon cancellation — The job queue now recovers stalled jobs and cancels work when a client abandons a session.
- OpenUI datasource cache and coordinator API routing — OpenUI gained a datasource cache layer, scoped app/dashboard views, and proper routing through the coordinator API.
Improvements
- User profiles now include first and last name fields; applied a UI refresh alongside updated design system documentation.
- Pre-inference queue depth is now visible in the chat interface, and the job poller is capped by per-consumer PEL size to prevent runaway polling.
- Allowed and default model lists are now persisted correctly for cloud providers, with future-proof model list handling.
Fixes
- Hardened cloud inference against hangs and connection races during LLM and Redis operations.
- Fixed GPU job lifecycle to prevent stuck PEL entries and infinite retry loops on failed jobs.
- Tool loop errors now hard-fail instead of silently continuing; all ACKs now include an
XDelto prevent message redelivery. - OAuth public base URL is now correctly derived from proxy headers when the server runs behind an ingress.
- File input filter was broadened from images-only to accept all file types.
Internal
- Added a
make environmenttarget to streamline worktree and CI setup. - Fixed stub interface drift across Discord, Slack, and WebSocket packages in the test suite.