operating-system

mirror of https://github.com/home-assistant/operating-system.git synced 2026-02-15 07:29:08 +00:00

Author	SHA1	Message	Date
Stefan Agner	562471166e	Reenable systemd-resolved stub resolver Enable the systemd-resolved stub resolver and make it available on the hassio host network interface (172.30.32.1). This allows to use systemd-resolved directly from all containers. Note that this makes /etc/resolv.conf point to the stub resolver running at 127.0.0.53 by default. This stub resolver isn't reachable from within containers. However, Docker does regnize this situation [1] and falls back to the alternate path at /run/systemd/resolve/resolv.conf, which is what /etc/resolv.conf is today. So this should not affect the initial /etc/resolv.conf in containers in practise. This will however bind to port 53 and affect add-on potentially attempt to use that port. Add-ons should not bind to 127.0.0.53 or the hassio host network (172.30.32.1). [1] https://github.com/moby/moby/blob/v28.0.4/libnetwork/internal/resolvconf/resolvconf_path.go#L51C32-L51C45	2025-05-27 12:29:39 +02:00
Jan Čermák	2916a1c247	Relocate HAOS Systemd drop-ins to /usr/lib/systemd (#3582 ) * Relocate HAOS Systemd drop-ins to /usr/lib/systemd With some exceptions, Systemd drop-ins overriding default unit configuration have been placed to `/etc/systemd/system`. This is meant for user overrides of those, or per `man 5 systemd.unit` for "system unites created by the administrator". Relocate all of these to `/usr/lib/systemd` which should be used as path for units "installed by the distribution package manager" which is closer to what we're trying to achieve. This will make it easier to detect changes to unit files once we enable the possibility to edit the content of /etc. * Patch systemd-timesyncd.service instead of replacing it fully	2024-09-12 12:47:22 +02:00
Jan Čermák	b0bc71a835	Reduce log spam from Docker health checks (#3491 ) Reduce verbosity from deactivated Docker mounts, triggered by the Docker healthcheck. These messages do not carry any value for us and logs supplied by users are often spammed mostly with these. Moreover, they sometimes cause confusion that something is wrong, see for example #3021. Unfortunately, it's not possible to use LogFilterPatterns= here, because it's not applied to these messages, as explicitly said in the docs: Filtering is based on the unit for which LogFilterPatterns= is defined meaning log messages coming from systemd(1) about the unit are not taken into account. runc 1.2.0 supposedly should fix this, but it's unclear when it would be available, so let's stick to this solution (reducing verbosity from debug to notice for all units `run-docker-*.mount`) for the time being.	2024-07-24 16:34:37 +02:00
Stefan Agner	689faa0260	Improve service ordering for udisks2.service (#2632 ) UDisks2 requires D-Bus and the systemd-journald, hence add the two sockets as a requirement and order the service after them.	2023-07-05 02:40:50 +02:00
Stefan Agner	276f531f35	Avoid waiting for time synchronization too early (#2594 ) In case a system takes a bit longer to boot (e.g. due to SWAP initialization on first boot, especially on a system with lots of memory and not very fast strage, e.g. an ODROID-M1 using an SD card) we might time-out waiting for time synchronization before the time synchronization service even got started. By ordering the systemd-time-wait-sync.service after the network is online, the timeout of this service should be started much later. With that the systemd-time-wait-sync.service shouldn't timeout any longer.	2023-06-14 00:07:25 +02:00
Stefan Agner	9261843a27	Make sure rpcbind gets started after systemd-tmpfiles is ready (#2582 )	2023-06-10 00:43:35 +02:00
Stefan Agner	1edb5c8c9e	Limit systemd-journald log size to 500MB (#2226 ) (#2435 )	2023-03-28 18:18:27 +02:00
Stefan Agner	a8f6f7aa43	Don't kill ssh connection on OOM (#2424 ) By default systemd kills the service which causes an OOM. That make sense for a typical service, however, for SSH we don't want this behavior: The connection should continue, just the command which caused OOM should be killed.	2023-03-23 21:45:57 +01:00
Stefan Agner	66c15adbbf	Move Docker configuration to daemon.json (#2116 ) This is more readable than passing arguments to the daemon directly. It also shortens the ExecStart command significantly, which is stored in every log entry in systemd-journald.	2022-09-07 19:13:47 +02:00
Stefan Agner	ed554f2a39	Check free disk space before starting Docker (#2097 ) It seems that Docker can fail to start if there is no space left on the device. Try to free up some space in that case by asking journald to limit its size to 256MiB. This should work for any storage larger than ~2.5GiB (as the journals maximum size is 10% of the disk size). It still should leave enough logs to diagnose problems if necessary. Note: We could also limit the size of the journal in first place, but that isn't sustainable: Once that space is used up, we run into the same problem again. By only asking journalctl to free up if necessary, we kinda (miss)use the journal as way to "reserve" some space which we can free up at boot if necessary.	2022-08-31 23:04:23 +02:00
Stefan Agner	cd5e42341d	Start dropbear earlier (#2083 ) This can be helpful when debugging HAOS issues. Dropbear is only started for users which actually enabled it by configuring a SSH key, so this change won't have an effect for most people.	2022-08-25 00:09:13 +02:00
Stefan Agner	ea5acb0950	Fix delaying systemd-timesyncd start correctly (#2082 ) Unfortunately, orderings like Before= cannot be overriden by vendor settings. This is mentioned in "Example 2. Overriding vendor settings" on https://www.freedesktop.org/software/systemd/man/systemd.unit.html. Correctly fix ordering by overriding the entire unit.	2022-08-24 23:02:09 +02:00
Stefan Agner	f8c8198bb9	Fix delaying systemd-timesyncd start (#2069 ) * Fix delaying systemd-timesyncd Setting WantedBy=time-sync.target in a service.d config file does not clear previous assignments of WantedBy. This caused the services to still be pulled in by the sysinit.target, causing a ordering cycle and the system to not start essential services. * Remove sysinit.target from Before ordering	2022-08-18 15:51:07 +02:00
Stefan Agner	7a693bed46	Delay systemd-timesyncd start after network is deemed online (#2068 ) With commit `2d3119ef22` ("Delay Supervisor start until time has been sychronized (#1360)") systemd-time-wait-sync.service got enabled, which waits until systemd-timesyncd synchronizes time with a NTP server. By default systemd-timesyncd.service and systemd-time-wait-sync.service are pulled in by sysinit.target. This starts the services before full network connectivity is established. The first sychronization fails and systemd-timesyncd only retries after a ratelimit mechanism times out. This causes a dealy of 30s during startup. While systemd-timesyncd has a mechanism to (re)try time synchronization when network becomes online, it seems that those only work properly when systemd-networkd is used, see also https://github.com/systemd/systemd/issues/24298. Simply reordering systemd-timesyncd.service after network-online.target does not work as it causes circular dependencies (NetworkManager itself depends ultimately on the sysinit.target). With this change, the services are only pulled in by time-sync.target. That allows to order the service after network-online.target. With that the first synchronization succeeds. This mechanism also works when a NTP server is provided through DHCP. In that case, a the systemd-timesyncd service is started by the dispatch script /usr/lib/NetworkManager/dispatcher.d/10-ntp before the systemd even considers starting the service. Tests show that the default fallback NTP is not contacted, only the DHCP provided service.	2022-08-17 18:51:35 +02:00
Pascal Vizeli	05778a2d32	Support IPv6 NAT (#2051 ) * Support IPv6 NAT * Add experimental * Enable IPv6 NAT in kernel configuration Co-authored-by: Stefan Agner <stefan@agner.ch>	2022-08-12 17:43:49 +02:00
Stefan Agner	7729db1e11	Synchronize network time quicker on bootup (#2057 ) Currently systemd-timesyncd tries to connect to the NTP server quite early at boot-up. At this time the network connection has not been established yet. This causes resolving the NTP server to fail and a rate limit kicks in which makes systemd-timesyncd wait for 30s until the next attempt. Lowering the retry attempt to 10s makes systemd-timesyncd connecting shortly after. Note: The rate limit is 10 attempts per 10s. Because the attempts are immediately exhausted lowering connection retry attempt below 10s adds no benefit. See also: https://github.com/systemd/systemd/issues/24298	2022-08-12 17:43:26 +02:00
Stefan Agner	2d8ec0c8ee	Use dbus-broker as default D-Bus broker (#2053 ) * Bump buildroot * buildroot 99b62b8bd3...97287bbebf (3): > package/dbus-broker: bump to release 32 > package/dbus-broker: new package > Merge pull request #3 from home-assistant/2022.02.x-haos-cgroup-v2 * Use dbus-broker as default D-Bus broker The dbus-broker (Linux D-Bus Message Broker) aims to be a high performance and reliable D-Bus broker which can be used as a drop in replacement to the reference implementation D-Bus broker. In tests it showed significantly better performance especially when routing BLE messages. * Allow dbus-broker to start early For HAOS device wipe feature we need haos-agent.service and udisk2.service early. Both require a working D-Bus broker. The options PrivateTmp and PrivateDevices add additional After= orderings which doesn't allow dbus-broker to be started early. * Fix D-Bus dependency D-Bus services should just depend on dbus.socket.	2022-08-10 17:01:02 +02:00
Stefan Agner	5d0a61fafc	Set lower OOM Score for Supervisor (#2050 ) * Set lower OOM Score for Supervisor * Adjust OOM for Docker daemon	2022-08-10 13:56:45 +02:00
Stefan Agner	4d9b604c04	Use Control Group v2 (#1329 ) * Disable real-time scheduling It seems that Linux' cgroup v2 currenlty does not support RT scheduling. * Remove Supervisor RT support flag With CGroups v2 we can no longer support CPU resource allocation for realtime scheduling. * Bump OS Agent to 1.3.0 for CGroups v2 support	2022-08-09 11:29:12 +02:00
Stefan Agner	26bca2666d	Remove key.json file if it appears to be corrupted (#1706 ) (#1988 ) * Remove key.json file if it appears to be corrupted (#1706) * Check with jq if key.json is parsable	2022-06-25 09:30:20 -07:00
Stefan Agner	a9c2eedc71	Handle long-press system keys only (#1874 ) Only handle long-press system power, reboot and sleep keys.	2022-04-27 20:33:39 +02:00
redgryphon	f62fee2ff7	Add support for NTP configuration via DHCP (fixes #689 ) (#1798 ) * Add support for NTP configuration via DHCP. * Default fallback NTP pool is the Cloudflare's one	2022-03-21 00:40:43 +01:00
Stefan Agner	ce566167f2	Improve OS logging (#1590 ) * Avoid duplicate log entries So far the hassos-supervisor.service starts the hassos-supervisor script which in turn attaches to the Supervisor container. This causes stdout and stderr to be forwarded to the service unit, which in turn logs it in the journal. However, Docker too logs all stdout/stderr to the journal through the systemd-journald log driver. Do not attach to the Supervisor container to avoid logging the Supervisor twice. Note that this no longer forwards signals to the container. However, the hassos-supervisor.service uses the ExecStop= setting to make sure the container gets gracefully stopped. * Use image and container name as syslog identifier By default Docker users the container id as syslog identifier. This leads to log messages which cannot easily be attributed to a particular container (since the container id is a random hex string). Use the image and container name as syslog identifier. Note that the Docker journald log driver still stores the container id as a separate field (CONTAINER_ID), in case the particular instance need to be tracked.	2021-10-13 16:12:05 +02:00
Stefan Agner	40b4d5ca2e	Start Home Assistant CLI on tty1 without login (#1366 ) * Start ha-cli on tty1 instead of a getty Instead of starting a getty start the ha-cli directly. This will show the banner right on startup with the important information such as IP address of the instance or the URL to reach it. * Use default shell as root shell instead of HA CLI Instead of using the ha-cli.sh script as login shell use the regular shell. Amongst other things, this allows to run VS Code devcontainers remotely via SSH or using scp. The HA CLI is still available using the `ha` command.	2021-05-19 13:18:02 +02:00
Stefan Agner	2d3119ef22	Delay Supervisor start until time has been sychronized (#1360 ) * Enable systemd-time-wait-sync.service by default Enable the systemd-time-wait-sync.service by default. This allows to use the time-sync.target which allows to make sure services only get started once the time is synchronized. * Make sure time is synchronized when starting hassos-supervisor.service Use the time-sync.target to make sure that the Supervisor gets stsarted after the time has been synchronized. * Set timeout for systemd-time-wait-sync.service Don't delay startup forever in case time synchronization doesn't work. This allows to boot the system even without Internet connection.	2021-05-12 17:47:42 +02:00
Stefan Agner	82db02756c	Make D-Bus and udisks2 available early (#1291 ) * Disable systemd-logind support for udisks2 Currently udisks2 uses systemd-logind to prevent the system from rebooting or similar operations while udisks operations are ongoing. Unfortunately this stops us from using udisks2 during early boot since systemd-logind is not ready at this point. Make the dependency configureable so we can opt-out of using systemd-logind. * Make dbus.service/socket and udisks2.service/socket available early Disable default dependencies. This avoids those services to be ordered after sysinit.target, and makes them available before local-fs.target is reached. All mounts like mnt-data.mount are ordered before local-fs.target, so breaking this dependency allows to use D-Bus before mounting local file systems. This seems fine when using the system bus directly from /run (instead of /var/run, which is anyway a symlink to /run normally). It seems that udisks misses /var/lib/udisks2 but it seems not to be required for the features used so far.	2021-03-31 21:46:02 +02:00
Stefan Agner	ab19c8c6cf	Improve dropbear.service to avoid failed state after stop (#1286 ) So far the exit code has been evaluated, which seems to be non-zero even with a regular term signal. With that systemd assumed the service is in a failed state, when in fact this seems the regular behavior of dropbear when shutting it down.	2021-03-24 19:13:43 +01:00
Stefan Agner	a15671a530	Remove unnecessary avahi-daemon.service configs (#1279 ) The avahi-daemon.service has been removed a while ago, this extra systemd configurations are no longer necessary.	2021-03-18 16:06:51 +01:00
Stefan Agner	82a40de894	Add --cpu-rt-runtime to allow Docker allocate real-time CPU time (#1235 ) (#1236 ) * Add --cpu-rt-runtime to allow Docker allocate real-time CPU time (#1235) * Enable Supervisor's CPU bandwith allocation feature (#1235) Since we have CONFIG_RT_GROUP_SCHED enabled in the Home Assistant OS kernel the Supervisor needs to enable CPU bandwith allocation for Add-Ons which need real-time scheduling. Set the appropriate environment variable.	2021-02-25 18:40:24 +01:00
Stefan Agner	e3120df5d0	Disable DNS over TLS by default (#1113 ) (#1121 ) It seems that on certain setups the default DNS over TLS mode "opportunistic" causes delays of ~10s when trying to resolve names. This is probably caused by providers and/or firewall setups not properly rejecting connections on port 853. It seems that also other distributions (such as Arch Linux) still disable DNS over TLS currently. Side step issues with DNS over TLS by disabling it for now.	2020-12-22 18:42:58 +01:00
Stefan Agner	c515dd19f2	Don't suspend Home Assistant OS on closed lids (#1093 ) (#1100 ) Old Laptops are a popular choice to run Home Assistant: They have low power consumption, are relatively fast and cheap to come by. However, closing their lid by default puts a Linux system into suspend. This is not what the typical user of Home Assistant OS wants. Ignore lid activity in any state by default.	2020-12-17 16:29:45 +01:00
Stefan Agner	7c25f7c187	Fix DNS resolving (#1004 ) * Add resolved.conf to disable stub resolver and DNSSEC There are Add-Ons which try to bind port 53 on all interfaces including 127.0.0.53. Disable the stub resolver to make them continue working. We don't need the resolver currently anyway. Also disable DNSSEC to make sure the baords can access a NTP time server even when their time is incorrect (since DNSSEC validation may fail). This is a known chicken-egg problem with systemd-resolved/systemd-timesyncd and might be addressed in a future version, with what we can reenable DNSSEC: https://github.com/systemd/systemd/issues/5873 * Make sure resolve gets added only once to nsswitch.conf Only add resolve to nsswitch.conf if not already present.	2020-11-18 09:56:38 +01:00
Pascal Vizeli	22f7777099	Make getty for serial/tty same (#850 ) * Make getty for serial/tty same * adjust log output	2020-09-06 22:18:19 +02:00
Pascal Vizeli	50176a0e3b	Add support for snapshots/restore on OS level (#801 )	2020-08-03 16:28:08 +02:00
Pascal Vizeli	b3530d1ce1	Allow easy move data partition (#651 ) * Allow easy move data partition * Cleanup handling systemd * Improve handling * fix pipeline * pipeline * fix shell handling * fix scripts * Add bin folder * fix lint * Fix service handling * Fix loading * hide output * Fix handling	2020-05-06 00:00:28 +02:00
Pascal Vizeli	db184bf5d3	Make machine-id persist (#649 ) * Make machine-id persist * update * Adjustments * cleanup * fix cleanups * small cleanup	2020-05-04 15:11:43 +02:00
Pascal Vizeli	4df2ab6013	Use default CloudFlare anycast time service (#629 )	2020-04-21 14:41:13 +02:00
Pascal Vizeli	febc4473d3	New supervisor / Adjust rauc handling (#530 ) * OS: Update supervisor 194 * Remove old rauc mark code * OS: Update supervisor 195 * Adjust dockerd	2020-01-09 23:02:46 +01:00
Pascal Vizeli	41d3f59002	Update Buildroot to 2019.02.3 (#415 ) * Update Buildroot to 2019-02.3 * Fix enter script * Update ova_defconfig * Fix network manager * Remove runc patches * Use same docker version * Fix build * Fix vmtools * Fix depens * Fix handling with tempfiles * Fix permission handling * Fix cp * Cleanup * Fix mounts	2019-06-27 11:58:50 +02:00
Maël Kimmerlin	5e82060124	Adding /etc/modprobe.d folder for module configuration (#397 ) * Add a /etc/modprobe.d folder populated using a config USB stick with folder modprobe * Add the documentation for the modprobe folder	2019-05-08 13:15:48 +02:00
Pascal Vizeli	1d40ffd4f9	OS: Fix boot problem (#370 ) * OS: Fix boot problem * Update var-lib-docker.mount * Update var-lib-docker.mount * Update var-lib-docker.mount	2019-03-18 09:53:22 +01:00
Pascal Vizeli	a21871f4af	OS: Disable coredump (#365 )	2019-02-22 21:24:58 +01:00
Bryan Berg	43828c2555	Migrate from LABEL= to /dev/disk/by-label/* (#318 )	2019-01-28 09:58:40 +01:00
Pascal Vizeli	bc3cc20629	Don't use a bind mount for docker data (#266 )	2018-11-29 15:20:58 +01:00
Pascal Vizeli	dc35208efe	OS: allow set NTP (#252 ) * OS: allow set NTP * Create etc-systemd-timesyncd.conf.mount * Update hassos-config * Update configuration.md * add link * Update Documentation/configuration.md Co-Authored-By: pvizeli <pascal.vizeli@syshack.ch>	2018-11-21 00:28:40 +01:00
Pascal Vizeli	dac76b1632	Allow udev rules & make it persistent (#210 ) * Install default to lib * Create hmip-rfusb.rules * Create etc-udev-rules.d.mount * Update hassos-config * Update configuration.md * Update rauc.sh * Add ln	2018-10-11 20:49:20 +02:00
Johannes Truschnigg	faa371191f	Implement basic ACPI shutdown support via busybox acpid (#209 ) On systems where ACPI support is present as inidcated by the presence of /proc/acpi (e.g. on OVA compatible hypervisors), we want to properly shut down the system when the power button is pressed (or the hypervisor simulates this kind of event to the guest machine that executes hassos). This changeset provides the following basic infrastructure for this feature to work as expected: * a systemd service to start acpid, if ACPI support can be assumed * an acpid configuration directory * a trivial shutdown script to invoke when a PWR event is registered	2018-10-10 11:34:06 +02:00
Pascal Vizeli	7cffb1f45b	Native bluetooth support (#91 ) * Add bluez * rpi firmware * add rpi support * Make packages from rpi * Fix build * Fix target * persistent * fix package * Auto enable * Fix parameter * Fix rpi * Fix done	2018-07-10 01:14:07 +02:00
Pascal Vizeli	38c1df36c0	Fix time on overlay (#88 ) * Fix time on overlay * Fix rights * Fix timesync	2018-07-08 00:43:46 +02:00
Pascal Vizeli	c6d0fef0b4	Some cleanup (#84 )	2018-07-06 21:04:48 +02:00

1 2 3

104 Commits