Only pass directories and test_*.py files to pytest --collect-only so
helpers like tests/components/conftest.py and tests/components/common.py
are not treated as explicit collection targets, and bail out with a
clear error if no eligible paths are found instead of running pytest
with no arguments.
cProfile showed 99.6% of split_tests.py wall time was spent in the
single pytest --collect-only subprocess. Fan out the collection across
``os.cpu_count()`` workers; round-robin chunking keeps each batch
roughly equal, and tests/components is expanded one level deeper so
the ~1000 integration subdirectories distribute evenly. Local wall
time dropped from ~132s to ~11s on an 18-core box. Bucket output is
unchanged because we still parse the same pytest -qq output, just
aggregated from multiple invocations.