supervisor

mirror of https://github.com/home-assistant/supervisor.git synced 2026-02-15 07:27:13 +00:00

Author	SHA1	Message	Date
Stefan Agner	50e6c88237	Add periodic progress logging during initial Core installation (#6562 ) * Add periodic progress logging during initial Core installation Log installation progress every 15 seconds while downloading the Home Assistant Core image during initial setup (landing page to core transition). Uses asyncio.Event with wait_for timeout to produce time-based logs independent of Docker pull events, ensuring visibility even when the network stalls. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Add test coverage --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Jan Čermák <sairon@users.noreply.github.com>	2026-02-13 14:17:35 +01:00
Stefan Agner	8dd42cb7a0	Fix getting Supervisor IP address in testing (#6564 ) * Fix getting Supervisor IP address in testing Newer Docker versions (probably newer than 29.x) do not have a global IPAddress attribute under .NetworkSettings anymore. There is a network specific map under Networks. For our case the hassio has the relevant IP address. This network specific maps already existed before, hence the new inspect format works for old as well as new Docker versions. While at it, also adjust the test fixture. * Actively wait for hassio IPAddress to become valid	2026-02-13 08:12:19 +01:00
Mike Degatano	590674ba7c	Remove blocking I/O added to import_image (#6557 ) * Remove blocking I/O added to import_image * Add scanned modules to extra blockbuster functions * Use same cast avoidance approach in export_image * Remove unnecessary local image_writer variable * Remove unnecessary local image_tar_stream variable --------- Co-authored-by: Stefan Agner <stefan@agner.ch>	2026-02-12 17:37:15 +01:00
Stefan Agner	da800b8889	Simplify HomeAssistantWebSocket and raise on connection errors (#6553 ) * Raise HomeAssistantWSError when Core WebSocket is unreachable Previously, async_send_command silently returned None when Home Assistant Core was not reachable, leading to misleading error messages downstream (e.g. "returned invalid response of None instead of a list of users"). Refactor _can_send to _ensure_connected which now raises HomeAssistantWSError on connection failures while still returning False for silent-skip cases (shutdown, unsupported version). async_send_message catches the exception to preserve fire-and-forget behavior. Update callers that don't handle HomeAssistantWSError: _hardware_events and addon auto-update in tasks. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Simplify HomeAssistantWebSocket command/message distinction The WebSocket layer had a confusing split between "messages" (fire-and-forget) and "commands" (request/response) that didn't reflect Home Assistant Core's architecture where everything is just a WS command. - Remove dead WSClient.async_send_message (never called) - Rename async_send_message → _async_send_command (private, fire-and-forget) - Rename send_message → send_command (sync wrapper) - Simplify _ensure_connected: drop message param, always raise on failure - Simplify async_send_command: always raise on connection errors - Remove MIN_VERSION gating (minimum supported Core is now 2024.2+) - Remove begin_backup/end_backup version guards for Core < 2022.1.0 - Add debug logging for silently ignored connection errors Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Wait for Core to come up before backup This is crucial since the WebSocket command to Core now fails with the new error handling if Core is not running yet. * Wait for Core install job instead * Use CLI to fetch jobs instead of Supervisor API The Supervisor API needs authentication token, which we have not available at this point in the workflow. Instead of fetching the token, we can use the CLI, which is available in the container. --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 09:20:23 +01:00
Stefan Agner	7ae14b09a7	Reload ingress tokens on addon update and rebuild (#6556 ) When an addon updates from having no ingress to having ingress, the ingress token map was never rebuilt. Both update() and rebuild() called _check_ingress_port() to assign a dynamic port but skipped the sys_ingress.reload() call that registers the token. This caused Ingress.get() to return None, resulting in a 503 error. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 15:20:08 -05:00
Stefan Agner	6877a8b210	Add D-Bus tolerant enum base classes to prevent crashes on unknown values (#6545 ) * Add D-Bus tolerant enum base classes to prevent crashes on unknown values D-Bus services (systemd, NetworkManager, RAUC, UDisks2) can introduce new enum values at any time via OS updates. Standard Python enum construction raises ValueError for unknown values, which would crash the Supervisor. Introduce DBusStrEnum and DBusIntEnum base classes that use Python's _missing_ hook to create pseudo-members for unknown values. These pseudo-members pass isinstance checks (satisfying typeguard), preserve the original value, don't pollute __members__, and report unknown values to Sentry (deduplicated per class+value) for observability. Migrate 17 D-Bus enums in dbus/const.py and udisks2/const.py to the new base classes. Enums only sent TO D-Bus (StopUnitMode, StartUnitMode, etc.) are left unchanged. Remove the manual try/except workaround in NetworkInterface.type now that DBusIntEnum handles it automatically. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Add explicit enum conversions for systemd-resolved D-Bus properties The resolved properties (dns_over_tls, dns_stub_listener, dnssec, llmnr, multicast_dns, resolv_conf_mode) were returning raw string values from D-Bus without converting to their declared enum types. This would fail runtime type checking with typeguard. Now safe to add explicit conversions since these enums use DBusStrEnum, which tolerates unknown values from D-Bus without crashing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Avoid blocking I/O in D-Bus enum Sentry reporting Move sentry_sdk.capture_message out of the event loop by adding a fire_and_forget_capture_message helper that offloads the call to the executor when a running loop is detected. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Handle exceptions when reporting message to Sentry * Narrow typing of reported values Use str/int explicitly since that is what the two existing Enum classes can actually report. * Adjust test style * Apply suggestions from code review --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 15:53:19 +01:00
Mike Degatano	825ff415e0	Migrate export image to aiodocker (#6534 ) * Migrate export image to aiodocker * Remove aiofiles and just use executor * Fixes from feedback	2026-02-11 10:03:50 +01:00
Stefan Agner	6b974a5b88	Validate device option type before path conversion in addon options (#6542 ) Add a type check for device options in AddonOptions._single_validate to ensure the value is a string before passing it to Path(). When a non-string value (e.g. a dict) is provided for a device option, this now raises a proper vol.Invalid error instead of an unhandled TypeError. Fixes SUPERVISOR-175H Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 09:44:10 +01:00
Stefan Agner	66228f976d	Use session.request() instead of getattr dispatch in HomeAssistantAPI (#6541 ) Replace the dynamic `getattr(self.sys_websession, method)(...)` pattern with the explicit `self.sys_websession.request(method, ...)` call. This is type-safe and avoids runtime failures from typos in method names. Also wrap the timeout parameter in `aiohttp.ClientTimeout` for consistency with the typed `request()` signature. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 09:43:55 +01:00
Stefan Agner	0cd668ec77	Fix typeguard errors by explicitly converting IP addresses to strings (#6531 ) * Fix environment variable type errors by converting IP addresses to strings Environment variables must be strings, but IPv4Address and IPv4Network objects were being passed directly to container environment dictionaries, causing typeguard validation errors. Changes: - Convert IPv4Address objects to strings in homeassistant.py for SUPERVISOR and HASSIO environment variables - Convert IPv4Network object to string in observer.py for NETWORK_MASK environment variable - Update tests to expect string values instead of IP objects in environment dictionaries - Remove unused ip_network import from test_observer.py Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * Use explicit string conversion for extra_hosts IP addresses Use the !s format specifier in the f-string to explicitly convert IPv4Address objects to strings when building the ExtraHosts list. While f-strings implicitly convert objects to strings, using !s makes the conversion explicit and consistent with the environment variable fixes in the previous commit. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-05 11:00:43 +01:00
Stefan Agner	3d4849a3a2	Include Docker storage driver in Sentry reports (#6529 ) Add the Docker storage driver (e.g., overlay2, vfs) to the context information sent with Sentry error reports. This helps correlate issues with specific storage backends and improves debugging of Docker-related problems. The storage driver is now included in both SETUP and RUNNING state error reports under contexts.docker.storage_driver. Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-05 09:27:51 +01:00
Tom Quist	4d8d44721d	Fix MCP API proxy support for streaming and headers (#6461 ) * Fix MCP API proxy support for streaming and headers This commit fixes two issues with using the core API core/api/mcp through the API proxy: 1. Streaming support: The proxy now detects text/event-stream responses and properly streams them instead of buffering all data. This is required for MCP's Server-Sent Events (SSE) transport. 2. Header forwarding: Added MCP-required headers to the forwarded headers: - Accept: Required for content negotiation - Last-Event-ID: Required for resuming broken SSE connections - Mcp-Session-Id: Required for session management across requests The proxy now also preserves MCP-related response headers (Mcp-Session-Id) and sets X-Accel-Buffering to "no" for streaming responses to prevent buffering by intermediate proxies. Tests added to verify: - MCP headers are properly forwarded to Home Assistant - Streaming responses (text/event-stream) are handled correctly - Response headers are preserved * Refactor: reuse stream logic for SSE responses (#3) * Fix ruff format + cover streaming payload error * Fix merge error * Address review comments (headers / streaming proxy) (#4) * Address review: header handling for streaming/non-streaming * Forward MCP-Protocol-Version and Origin headers * Do not forward Origin header through API proxy (#5) --------- Co-authored-by: Stefan Agner <stefan@agner.ch>	2026-02-04 17:28:11 +01:00
Stefan Agner	a849050369	Improve CpuArch type safety with explicit conversions (#6524 ) The CpuArch enum was being used inconsistently throughout the codebase, with some code expecting enum values and other code expecting strings. This caused type checking issues and potential runtime errors. Changes: - Fix match_base() to return CpuArch enum instead of str - Add explicit string conversions using !s formatting where arch values are used in f-strings (build.py, model.py) - Convert CpuArch to str explicitly in contexts requiring strings (docker/addon.py, misc/filter.py) - Update all tests to use CpuArch enum values instead of strings - Update test mocks to return CpuArch enum values This ensures type consistency and improves MyPy type checking accuracy across the architecture detection and management code. Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-04 11:34:23 +01:00
Stefan Agner	d1a576e711	Fix Docker Hub manifest fetching by using correct registry API endpoint (#6525 ) The manifest fetcher was using docker.io as the registry API endpoint, but Docker Hub's actual registry API is at registry-1.docker.io. When trying to access https://docker.io/v2/..., requests were being redirected to https://www.docker.com/ (the marketing site), which returned HTML instead of JSON, causing manifest fetching to fail. This matches exactly what Docker itself does internally - see daemon/pkg/registry/config.go:49 where Docker hardcodes DefaultRegistryHost = "registry-1.docker.io" for registry operations. Changes: - Add DOCKER_HUB_API constant for the actual API endpoint - Add _get_api_endpoint() helper to translate docker.io to registry-1.docker.io for HTTP API calls - Update _get_auth_token() and _fetch_manifest() to use the API endpoint - Keep docker.io as the registry identifier for naming and credentials - Add tests to verify the API endpoint translation Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-03 19:03:47 +01:00
Mike Degatano	a122b5f1e9	Migrate info, events and container logs to aiodocker (#6514 ) * Migrate info and events to aiodocker * Migrate container logs to aiodocker * Fix dns plugin loop test * Fix mocking for docker info * Fixes from feedback * Harden monitor error handling * Deleted failing tests because they were not useful	2026-02-03 18:36:41 +01:00
Stefan Agner	6806c1d58a	Fix Docker exec exit code handling by using detach=False (#6520 ) * Fix Docker exec exit code handling by using detach=False When executing commands inside containers using `container_run_inside()`, the exec metadata did not contain a valid exit code because `detach=True` starts the exec in the background and returns immediately before completion. Root cause: With `detach=True`, Docker's exec start() returns an awaitable that yields output bytes. However, the await only waits for the HTTP/REST call to complete, NOT for the actual exec command to finish. The command continues running in the background after the HTTP response is received. Calling `inspect()` immediately after returns `ExitCode: None` because the exec hasn't completed yet. Solution: Use `detach=False` which returns a Stream object that: - Automatically waits for exec completion by reading from the stream - Provides actual command output (not just empty bytes) - Makes exit code immediately available after stream closes - No polling needed Changes: - Switch from `detach=True` to `detach=False` in container_run_inside() - Read output from stream using async context manager - Add defensive validation to ensure ExitCode is never None - Update tests to mock the Stream interface using AsyncMock - Add debug log showing exit code after command execution Fixes #6518 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * Address review feedback --------- Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-03 13:36:24 +01:00
Stefan Agner	7ad9a911e8	Add DELETE method support to /core/api proxy (#6521 ) The Supervisor's /core/api proxy previously only supported GET and POST methods, returning 405 Method Not Allowed for DELETE requests. This prevented addons from calling Home Assistant Core REST API endpoints that require DELETE methods, such as deleting automations, scripts, or scenes. The underlying proxy implementation already supported passing through any HTTP method via request.method.lower(), so only the route registration was needed. Fixes #6509 Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-03 11:51:59 +01:00
Stefan Agner	79f9afb4c2	Fix port conflict tests for aiodocker 0.25.0 compatibility (#6519 ) The aiodocker 0.25.0 upgrade (PR #6448) changed how DockerError handles the message parameter. The library now extracts the message string from Docker API JSON responses before passing it to DockerError, rather than passing the entire dict. The port conflict detection tests were written before this change and incorrectly passed dicts to DockerError. This caused TypeErrors when the port conflict detection code tried to match err.message with a regex, expecting a string but receiving a dict. Update both test_addon_start_port_conflict_error and test_observer_start_port_conflict to pass message strings directly, matching the real aiodocker 0.25.0 behavior. Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-03 10:34:47 +01:00
Mike Degatano	11b754102c	Map port conflict on start error into a known error (#6445 ) * Map port conflict on start error into a known error * Apply suggestions from code review * Run ruff format --------- Co-authored-by: Stefan Agner <stefan@agner.ch>	2026-02-02 17:16:31 +01:00
Stefan Agner	6957341c3e	Refactor Docker pull progress with registry manifest fetcher (#6379 ) * Use count-based progress for Docker image pulls Refactor Docker image pull progress to use a simpler count-based approach where each layer contributes equally (100% / total_layers) regardless of size. This replaces the previous size-weighted calculation that was susceptible to progress regression. The core issue was that Docker rate-limits concurrent downloads (~3 at a time) and reports layer sizes only when downloading starts. With size- weighted progress, large layers appearing late would cause progress to drop dramatically (e.g., 59% -> 29%) as the total size increased. The new approach: - Each layer contributes equally to overall progress - Per-layer progress: 70% download weight, 30% extraction weight - Progress only starts after first "Downloading" event (when layer count is known) - Always caps at 99% - job completion handles final 100% This simplifies the code by moving progress tracking to a dedicated module (pull_progress.py) and removing complex size-based scaling logic that tried to account for unknown layer sizes. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Exclude already-existing layers from pull progress calculation Layers that already exist locally should not count towards download progress since there's nothing to download for them. Only layers that need pulling are included in the progress calculation. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Add registry manifest fetcher for size-based pull progress Fetch image manifests directly from container registries before pulling to get accurate layer sizes upfront. This enables size-weighted progress tracking where each layer contributes proportionally to its byte size, rather than equal weight per layer. Key changes: - Add RegistryManifestFetcher that handles auth discovery via WWW-Authenticate headers, token fetching with optional credentials, and multi-arch manifest list resolution - Update ImagePullProgress to accept manifest layer sizes via set_manifest() and calculate size-weighted progress - Fall back to count-based progress when manifest fetch fails - Pre-populate layer sizes from manifest when creating layer trackers The manifest fetcher supports ghcr.io, Docker Hub, and private registries by using credentials from Docker config when available. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Clamp progress to 100 to prevent floating point precision issues Floating point arithmetic in weighted progress calculations can produce values slightly above 100 (e.g., 100.00000000000001). This causes validation errors when the progress value is checked. Add min(100, ...) clamping to both size-weighted and count-based progress calculations to ensure the result never exceeds 100. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Use sys_websession for manifest fetcher instead of creating new session Reuse the existing CoreSys websession for registry manifest requests instead of creating a new aiohttp session. This improves performance and follows the established pattern used throughout the codebase. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Make platform parameter required and warn on missing platform - Make platform a required parameter in get_manifest() and _fetch_manifest() since it's always provided by the calling code - Return None and log warning when requested platform is not found in multi-arch manifest list, instead of falling back to first manifest which could be the wrong architecture 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Log manifest fetch failures at warning level Users will notice degraded progress tracking when manifest fetch fails, so log at warning level to help diagnose issues. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Add pylint disable comments for protected access in manifest tests 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Separate download_current and total_size updates in pull progress Update download_current and total_size independently in the DOWNLOADING handler. This ensures download_current is updated even when total is not yet available. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Reject invalid platform format in manifest selection --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-02-02 15:56:24 +01:00
Stefan Agner	77f3da7014	Disable Home Assistant watchdog during system shutdown (#6512 ) During system shutdown (reboot/poweroff), the watchdog was incorrectly detecting the Home Assistant Core container as failed and attempting to restart it. This occurred because Docker was stopping all containers in parallel with Supervisor's own shutdown sequence, causing the watchdog to trigger while add-ons were still being stopped. This led to an abrupt termination of Core before it could cleanly shut down its SQLite database, resulting in a warning on the next startup: "The system could not validate that the sqlite3 database was shutdown cleanly". The fix registers a supervisor state change listener that unregisters the watchdog when entering any shutdown state (SHUTDOWN, STOPPING, or CLOSE). This prevents restart attempts during both user-initiated reboots (via API) and external shutdown signals (Docker SIGTERM, console reboot commands). Since SHUTDOWN, STOPPING, and CLOSE are terminal states with no reverse transition back to RUNNING, no re-registration logic is needed. Fixes #6511 Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-31 17:01:05 +01:00
Stefan Agner	3db60170aa	Fix flaky test_group_throttle_rate_limit race condition (#6504 ) The test was failing intermittently in CI because concurrent async operations in asyncio.gather() were getting slightly different timestamps (microseconds apart) despite being inside a time_machine context. When test2.execute() calls were timestamped at start+2ms due to async scheduling delays, they weren't cleaned up in the final test block (cutoff = start+1ms), causing a false rate limit error. Fix by using tick=False to completely freeze time during the gather, ensuring all 4 calls get the exact same timestamp. Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-30 17:17:50 +01:00
Mike Degatano	a5c3781f9d	Migrate network interactions to aiodocker (#6505 )	2026-01-30 15:34:12 +01:00
dependabot[bot]	2a4890e2b0	Bump aiodocker from 0.24.0 to 0.25.0 (#6448 ) * Bump aiodocker from 0.24.0 to 0.25.0 Bumps [aiodocker](https://github.com/aio-libs/aiodocker) from 0.24.0 to 0.25.0. - [Release notes](https://github.com/aio-libs/aiodocker/releases) - [Changelog](https://github.com/aio-libs/aiodocker/blob/main/CHANGES.rst) - [Commits](https://github.com/aio-libs/aiodocker/compare/v0.24.0...v0.25.0) --- updated-dependencies: - dependency-name: aiodocker dependency-version: 0.25.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * Update to new timeout configuration * Fix pytest failure --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Mike Degatano <michael.degatano@gmail.com> Co-authored-by: Stefan Agner <stefan@agner.ch>	2026-01-30 09:39:06 +01:00
Stefan Agner	a2db716a5f	Check frontend availability after Home Assistant Core updates (#6311 ) * Check frontend availability after Home Assistant Core updates Add verification that the frontend is actually accessible at "/" after core updates to ensure the web interface is serving properly, not just that the API endpoints respond. Previously, the update verification only checked API endpoints and whether the frontend component was loaded. This could miss cases where the API is responsive but the frontend fails to serve the UI. Changes: - Add check_frontend_available() method to HomeAssistantAPI that fetches the root path and verifies it returns HTML content - Integrate frontend check into core update verification flow after confirming the frontend component is loaded - Trigger automatic rollback if frontend is inaccessible after update - Fix blocking I/O calls in rollback log file handling to use async executor 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Avoid checking frontend if config data is None * Improve pytest tests * Make sure Core returns a valid config * Remove Core version check in frontend availability test The call site already makes sure that an actual Home Assistant Core instance is running before calling the frontend availability test. So this is rather redundant. Simplify the code by removing the version check and update tests accordingly. * Add test coverage for get_config --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-01-29 09:06:45 +01:00
David Rapan	641b205ee7	Add configurable interface route metric (#6447 ) * Add route_metric attribute to IpProperties class Signed-off-by: David Rapan <david@rapan.cz> * Refactor dbus setting IP constants Signed-off-by: David Rapan <david@rapan.cz> * Add route metric Signed-off-by: David Rapan <david@rapan.cz> * Merge test_api_network_interface_info Signed-off-by: David Rapan <david@rapan.cz> * Add test case for route metric update Signed-off-by: David Rapan <david@rapan.cz> --------- Signed-off-by: David Rapan <david@rapan.cz>	2026-01-28 13:08:36 +01:00
AlCalzone	de02bc991a	fix: pull missing images before running (#6500 ) * fix: pull missing images before running * add tests for auto-pull behavior	2026-01-28 13:08:03 +01:00
AlCalzone	df8201ca33	Update `get_docker_args()` to return `mounts` not `volumes` (#6499 ) * Update `get_docker_args()` to return `mounts` not `volumes` * fix more mocks to return PurePaths	2026-01-27 15:00:33 -05:00
Mike Degatano	909a2dda2f	Migrate (almost) all docker container interactions to aiodocker (#6489 ) * Migrate all docker container interactions to aiodocker * Remove containers_legacy since its no longer used * Add back remove color logic * Revert accidental invert of conditional in setup_network * Fix typos found by copilot * Apply suggestions from code review Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Revert "Apply suggestions from code review" This reverts commit `0a475433ea`. --------- Co-authored-by: Stefan Agner <stefan@agner.ch> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-01-27 12:42:17 +01:00
Jan Čermák	ec0f7c2b9c	Use query_dns instead of deprecated query after aiohttp 4.x update (#6478 )	2026-01-14 15:22:12 +01:00
Jan Čermák	753021d4d5	Fix 'DockerMount is not JSON serializable' in DockerAPI.run_command (#6477 )	2026-01-14 15:21:11 +01:00
Mike Degatano	1d1a8cdad3	Add API to force repository repair (#6439 ) * Add API to force repository repair * Fix inheritance for error * Fix absolute import	2026-01-06 16:01:48 +01:00
Mike Degatano	d23bc291d5	Migrate create container to aiodocker (#6415 ) * Migrate create container to aiodocker * Fix extra hosts transformation * Env not Environment * Fix tests * Fixes from feedback --------- Co-authored-by: Jan Čermák <sairon@users.noreply.github.com>	2025-12-15 09:57:30 +01:00
Hendrik Bergunde	a2d301ed27	Increase timeout waiting for Core API to work around 2025.12.x issues (#6404 ) * Fix too short timeouts for Synology NAS With Home Assistant Core 2025.12.x updates available the STARTUP_API_RESPONSE_TIMEOUT that HA supervisor is willing to wait (before assuming a startup failure and rolling back the entire core update) seems to be too low on not-so-beefy hosts. The problem has been seen on Synology NAS machines running Home Assistant on the side (like in my case). I have doubled the timeout from 3 to 6 minutes and the upgrade to Core 2025.12.1 works on my Synology DS723+. My update took 4min 56s -- hence the timeout increase was proven necessary. * Fix tests for increased API Timeout * Increase the timeout to 10 minutes * Increase the timeout in tests --------- Co-authored-by: Jan Čermák <sairon@users.noreply.github.com>	2025-12-08 11:05:57 -05:00
Jan Čermák	cdef1831ba	Add option to Core settings to enable duplicated logs (#6400 ) Introduce new option `duplicate_log_file` to HA Core configuration that will set an environment variable `HA_DUPLICATE_LOG_FILE=1` for the Core container if enabled. This will serve as a flag for Core to enable the legacy log file, along the standard logging which is handled by Systemd Journal.	2025-12-08 16:35:56 +01:00
Stefan Agner	9862499751	Handle missing origin remote in git store pull operation (#6398 ) Add AttributeError to the exception handler in the git pull operation. This catches the case where a repository exists but has no 'origin' remote configured, which can happen if the remote was renamed or deleted by the user or due to repository corruption. When this error occurs, it now creates a CORRUPT_REPOSITORY issue with an EXECUTE_RESET suggestion, triggering the auto-fix mechanism to re-clone the repository. Fixes SUPERVISOR-69Z Fixes SUPERVISOR-172C 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-07 00:38:38 +01:00
Jan Čermák	cd4e7f2530	Remove the option to revert to `overlay2` driver (#6399 ) OS Agent will no longer support migrating to the overlay2 driver due to reasons explained in home-assistant/os-agent#245. Remove it from the Docker API as well.	2025-12-05 14:45:56 +01:00
Stefan Agner	5d02b09a0d	Fix addon options reset to defaults (#6397 ) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-05 13:53:51 +01:00
Stefan Agner	382f0e8aef	Disable timeout for Docker image pull operations (#6391 ) * Disable timeout for Docker image pull operations The aiodocker migration introduced a regression where image pulls could timeout during slow downloads. The session-level timeout (900s total) was being applied to pull operations, but docker-py explicitly sets timeout=None for pulls, allowing them to run indefinitely. When aiodocker receives timeout=None, it converts it to ClientTimeout(total=None), which aiohttp treats as "no timeout" (returns TimerNoop instead of enforcing a timeout). This fixes TimeoutError exceptions that could occur during installation on systems with slow network connections or when pulling large images. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Fix pytests --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-12-03 21:52:46 +01:00
Mike Degatano	81b7e54b18	Remove unknown errors from addons and auth (#6303 ) * Remove unknown errors from addons * Remove customized unknown error types * Fix docker ratelimit exception and tests * Fix stats test and add more for known errors * Add defined error for when build fails * Fixes from feedback * Fix mypy issues * Fix test failure due to rename * Change auth reset error message	2025-12-03 18:11:51 +01:00
Stefan Agner	fea8159ccf	Fix typing issues in NetworkManager D-Bus integration (#6385 ) * Fix typing for IPv6 addr-gen-mode and ip6-privacy settings * Fix ConnectionStateType typing * Rename ConnectionStateType to ConnectionState The extra type suffix is unnecessary. * Apply suggestions from code review Co-authored-by: Jan Čermák <sairon@users.noreply.github.com> --------- Co-authored-by: Jan Čermák <sairon@users.noreply.github.com>	2025-12-03 16:28:43 +01:00
Stefan Agner	d220fa801f	Await aiodocker import_image coroutine (#6378 ) The aiodocker images.import_image() method returns a coroutine that needs to be awaited, but the code was iterating over it directly, causing "TypeError: 'coroutine' object is not iterable". Fixes SUPERVISOR-13D9 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-02 14:11:06 -05:00
Stefan Agner	50d31202ae	Use Docker's official registry domain detection logic (#6360 ) * Use Docker's official registry domain detection logic Replace the custom IMAGE_WITH_HOST regex with a proper implementation based on Docker's reference parser (vendor/github.com/distribution/ reference/normalize.go). Changes: - Change DOCKER_HUB from "hub.docker.com" to "docker.io" (official default) - Add DOCKER_HUB_LEGACY for backward compatibility with "hub.docker.com" - Add IMAGE_DOMAIN_REGEX and get_domain() function that properly detects: - localhost (with optional port) - Domains with "." (e.g., ghcr.io, 127.0.0.1) - Domains with ":" port (e.g., myregistry:5000) - IPv6 addresses (e.g., [::1]:5000) - Update credential handling to support both docker.io and hub.docker.com - Add comprehensive tests for domain detection 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Refactor Docker domain detection to utils module Move get_domain function to supervisor/docker/utils.py and rename it to get_domain_from_image for consistency with get_registry_for_image. Use named group in the regex for better readability. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Rename domain to registry for consistency Use consistent "registry" terminology throughout the codebase: - Rename get_domain_from_image to get_registry_from_image - Rename IMAGE_DOMAIN_REGEX to IMAGE_REGISTRY_REGEX - Update named group from "domain" to "registry" - Update all related comments and variable names 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-12-02 14:30:03 +01:00
Stefan Agner	fa490210cd	Improve CpuArch type safety across codebase (#6372 ) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-01 19:56:05 +01:00
Mike Degatano	6302c7d394	Fix progress when using containerd snapshotter (#6357 ) * Fix progress when using containerd snapshotter * Add test for tiny image download under containerd-snapshotter * Fix API tests after progress allocation change * Fix test for auth changes * Apply suggestions from code review Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Stefan Agner <stefan@agner.ch> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-11-27 16:26:22 +01:00
Jan Čermák	f55fd891e9	Add API endpoint for migrating Docker storage driver (#6361 ) Implement Supervisor API for home-assistant/os-agent#238, adding possibility to schedule migration either to Containerd overlayfs driver, or migration to the graph overlay2 driver, once the device is rebooted the next time. While it's technically in the DBus OS interface, in Supervisor's abstraction it makes more sense to put it under `/docker` endpoints.	2025-11-27 16:02:39 +01:00
Stefan Agner	8a251e0324	Pass registry credentials to add-on build for private base images (#6356 ) * Pass registry credentials to add-on build for private base images When building add-ons that use a base image from a private registry, the build would fail because credentials configured via the Supervisor API were not passed to the Docker-in-Docker build container. This fix: - Adds get_docker_config_json() to generate a Docker config.json with registry credentials for the base image - Creates a temporary config file and mounts it into the build container at /root/.docker/config.json so BuildKit can authenticate when pulling the base image - Cleans up the temporary file after build completes Fixes #6354 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Fix pylint errors * Apply suggestions from code review Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Refactor registry credential extraction into shared helper Extract duplicate logic for determining which registry matches an image into a shared `get_registry_for_image()` method in `DockerConfig`. This method is now used by both `DockerInterface._get_credentials()` and `AddonBuild.get_docker_config_json()`. Move `DOCKER_HUB` and `IMAGE_WITH_HOST` constants to `docker/const.py` to avoid circular imports between manager.py and interface.py. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Apply suggestions from code review Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * ruff format * Document raises --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Mike Degatano <michael.degatano@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-11-27 11:10:17 +01:00
Stefan Agner	ae7700f52c	Fix private registry authentication for aiodocker image pulls (#6355 ) * Fix private registry authentication for aiodocker image pulls After PR #6252 migrated image pulling from dockerpy to aiodocker, private registry authentication stopped working. The old _docker_login() method stored credentials in ~/.docker/config.json via dockerpy, but aiodocker doesn't read that file - it requires credentials passed explicitly via the auth parameter. Changes: - Remove unused _docker_login() method (dockerpy login was ineffective) - Pass credentials directly to pull_image() via new auth parameter - Add auth parameter to DockerAPI.pull_image() method - Add unit tests for Docker Hub and custom registry authentication Fixes #6345 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Ignore protected access in test * Fix plug-in pull test * Fix HA core tests --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-11-26 17:37:24 +01:00
dependabot[bot]	6042694d84	Bump dbus-fast from 2.45.1 to 3.1.2 (#6317 ) * Bump dbus-fast from 2.45.1 to 3.1.2 Bumps [dbus-fast](https://github.com/bluetooth-devices/dbus-fast) from 2.45.1 to 3.1.2. - [Release notes](https://github.com/bluetooth-devices/dbus-fast/releases) - [Changelog](https://github.com/Bluetooth-Devices/dbus-fast/blob/main/CHANGELOG.md) - [Commits](https://github.com/bluetooth-devices/dbus-fast/compare/v2.45.1...v3.1.2) --- updated-dependencies: - dependency-name: dbus-fast dependency-version: 3.1.2 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> * Update unit tests for dbus-fast 3.1.2 changes * Fix type annotations --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Stefan Agner <stefan@agner.ch>	2025-11-25 16:25:06 +01:00
Stefan Agner	3be0c13fc5	Drop Debian 12 from supported OS list (#6337 ) * Drop Debian 12 from supported OS list With the deprecation of Home Assistant Supervised installation method Debian 12 is no longer supported. This change removes Debian 12 from the list of supported operating systems in the evaluation logic. * Improve tests	2025-11-24 11:46:23 +01:00

1 2 3 4 5 ...

766 Commits