Merge pull request #355 from gyptazy/release/1.1.9.1

release: Create hotfix release 1.1.9.1
2026-04-06 04:41:58 +02:00 · 2025-10-30 17:53:14 +01:00 · 2025-10-30 17:46:54 +01:00 · 2025-10-30 17:39:42 +01:00 · 2025-10-30 17:18:17 +01:00 · 2025-10-30 07:55:45 +01:00
65 changed files with 1848 additions and 642 deletions
--- a/.changelogs/1.1.2/222_fix_force_type_cast_cpu_metrics_to_int.yml
+++ b/.changelogs/1.1.2/222_fix_force_type_cast_cpu_metrics_to_int.yml
@@ -0,0 +1,2 @@
+fixed:
+  - Force type cast cpu count of guests to int for some corner cases where a str got returned (by @gyptazy). [#222]
--- a/.changelogs/1.1.2/release_meta.yml
+++ b/.changelogs/1.1.2/release_meta.yml
@@ -1 +1 @@
-date: TBD
+date: 2025-05-13
--- a/.changelogs/1.1.3/189_add_reload_function.yml
+++ b/.changelogs/1.1.3/189_add_reload_function.yml
@@ -0,0 +1,2 @@
+added:
+  - Add relaod (SIGHUP) function to ProxLB to reload the configuration (by @gyptazy). [#189]
--- a/.changelogs/1.1.3/232_align_maintenance_mode_proxmox_ha.yml
+++ b/.changelogs/1.1.3/232_align_maintenance_mode_proxmox_ha.yml
@@ -0,0 +1,2 @@
+fixed:
+  - Align maintenance mode with Proxmox HA maintenance mode (by @gyptazy). [#232]
--- a/.changelogs/1.1.3/239_add_optional_delay_time_until_service_starts.yml
+++ b/.changelogs/1.1.3/239_add_optional_delay_time_until_service_starts.yml
@@ -0,0 +1,2 @@
+added:
+  - Add optional wait time parameter to delay execution until the service takes action (by @gyptazy). #239
--- a/.changelogs/1.1.3/241_make_amount_of_parallel_migrations_configurable.yml
+++ b/.changelogs/1.1.3/241_make_amount_of_parallel_migrations_configurable.yml
@@ -0,0 +1,2 @@
+added:
+  - Make the amount of parallel migrations configurable (by @gyptazy). [#241]
--- a/.changelogs/1.1.3/94_cpu_balancing_use_avg_instead_current_consumption.yml
+++ b/.changelogs/1.1.3/94_cpu_balancing_use_avg_instead_current_consumption.yml
@@ -0,0 +1,2 @@
+changed:
+  - Use the average CPU consumption of a guest within the last 60 minutes instead of the current CPU usage (by @philslab-ninja & @gyptazy). [#94]
--- a/.changelogs/1.1.3/release_meta.yml
+++ b/.changelogs/1.1.3/release_meta.yml
@@ -0,0 +1 @@
+date: 2025-06-19
--- a/.changelogs/1.1.4/245_add_guest_pinning_to_groups_of_nodes.yml
+++ b/.changelogs/1.1.4/245_add_guest_pinning_to_groups_of_nodes.yml
@@ -0,0 +1,2 @@
+added:
+  - Allow pinning of guests to a group of nodes (@gyptazy). [#245]
--- a/.changelogs/1.1.4/248_fix_dry_run_combination_balancing_disabled.yml
+++ b/.changelogs/1.1.4/248_fix_dry_run_combination_balancing_disabled.yml
@@ -0,0 +1,2 @@
+fixed:
+  - Fixed an issue where balancing was performed in combination of deactivated balancing and dry-run mode (@gyptazy). [#248]
--- a/.changelogs/1.1.4/255_fix_loglevels.yml
+++ b/.changelogs/1.1.4/255_fix_loglevels.yml
@@ -0,0 +1,2 @@
+fixed:
+  - Modified log levels to make output lighter at INFO level (@pmarasse) [#255]
--- a/.changelogs/1.1.4/release_meta.yml
+++ b/.changelogs/1.1.4/release_meta.yml
@@ -0,0 +1 @@
+date: 2025-06-27
--- a/.changelogs/1.1.5/260_allow_custom_api_ports.yml
+++ b/.changelogs/1.1.5/260_allow_custom_api_ports.yml
@@ -0,0 +1,2 @@
+added:
+  - Allow custom API ports instead of fixed tcp/8006 (@gyptazy). [#260]
--- a/.changelogs/1.1.5/release_meta.yml
+++ b/.changelogs/1.1.5/release_meta.yml
@@ -0,0 +1 @@
+date: 2025-07-14
--- a/.changelogs/1.1.6/268_fix_balancing_type_eval.yml
+++ b/.changelogs/1.1.6/268_fix_balancing_type_eval.yml
@@ -0,0 +1,2 @@
+fixed:
+  - Fix balancing evaluation of guest types (e.g., VM or CT) (@gyptazy). [#268]
--- a/.changelogs/1.1.6/290_validate_user_token_syntax.yml
+++ b/.changelogs/1.1.6/290_validate_user_token_syntax.yml
@@ -0,0 +1,2 @@
+added:
+  - Add validation for provided API user token id to avoid confusions (@gyptazy). [#291]
--- a/.changelogs/1.1.6/291_fix_stack_trace_no_user_permissions_testing.yml
+++ b/.changelogs/1.1.6/291_fix_stack_trace_no_user_permissions_testing.yml
@@ -0,0 +1,2 @@
+fixed:
+  - Fix stacktrace output when validating permissions on non existing users in Proxmox (@gyptazy). [#291]
--- a/.changelogs/1.1.6/295_fix_overprovisioning_first_node_if_anti_affinity_group_has_only_one-member.yml
+++ b/.changelogs/1.1.6/295_fix_overprovisioning_first_node_if_anti_affinity_group_has_only_one-member.yml
@@ -0,0 +1,3 @@
+fixed:
+  - Fix Overprovisioning first node if anti_affinity_group has only one member (@MiBUl-eu). [#295]
+  
--- a/.changelogs/1.1.6/296_fix_validate_node_presence_when_pinning_guests.yml
+++ b/.changelogs/1.1.6/296_fix_validate_node_presence_when_pinning_guests.yml
@@ -0,0 +1,3 @@
+fixed:
+  - Validate for node presence when pinning guests to avoid crashing (@gyptazy). [#296]
+  
--- a/.changelogs/1.1.6/release_meta.yml
+++ b/.changelogs/1.1.6/release_meta.yml
@@ -0,0 +1 @@
+date: 2025-09-04
--- a/.changelogs/1.1.7/304_add_graceful_shutdown_sigint.yml
+++ b/.changelogs/1.1.7/304_add_graceful_shutdown_sigint.yml
@@ -0,0 +1,2 @@
+added:
+  - Add graceful shutdown for SIGINT (e.g., CTRL + C abort). (@gyptazy). [#304]
--- a/.changelogs/1.1.7/305_add_conntrack_support_for_migrations.yml
+++ b/.changelogs/1.1.7/305_add_conntrack_support_for_migrations.yml
@@ -0,0 +1,2 @@
+added:
+  - Add conntrack state aware migrations of VMs (@gyptazy). [#305]
--- a/.changelogs/1.1.7/308_fix_only_validate_valid_jobids.yml
+++ b/.changelogs/1.1.7/308_fix_only_validate_valid_jobids.yml
@@ -0,0 +1,2 @@
+fixed:
+  - Fix crash when validating absent migration job ids. (@gyptazy). [#308]
--- a/.changelogs/1.1.7/310_guest-object-names-not-being-evaluated-in-log.yml
+++ b/.changelogs/1.1.7/310_guest-object-names-not-being-evaluated-in-log.yml
@@ -0,0 +1,2 @@
+fixed:
+  - Fix guest object names are not being evaluated in debug log. (@gyptazy). [#310]
--- a/.changelogs/1.1.7/release_meta.yml
+++ b/.changelogs/1.1.7/release_meta.yml
@@ -0,0 +1 @@
+date: 2025-09-19
--- a/.changelogs/1.1.8/317_container_image_non_root.yml
+++ b/.changelogs/1.1.8/317_container_image_non_root.yml
@@ -0,0 +1,3 @@
+changed:
+  - Container image does not run as root anymore (@mikaelkrantz945). [#317]
+  - Container image uses venv for running ProxLB (@mikaelkrantz945). [#317]
--- a/.changelogs/1.1.8/318_fix_conntrack_aware_migrations_api_pve8.yml
+++ b/.changelogs/1.1.8/318_fix_conntrack_aware_migrations_api_pve8.yml
@@ -0,0 +1,2 @@
+fixed:
+  - Fix API errors when using conntrack aware migration with older PVE versions (@gyptazy). [#318]
--- a/.changelogs/1.1.8/329_add_log_prefix.yml
+++ b/.changelogs/1.1.8/329_add_log_prefix.yml
@@ -0,0 +1,2 @@
+fixed:
+  - Add a static ProxLB prefix to the log output when used by journal handler (@gyptazy). [#329]
--- a/.changelogs/1.1.8/release_meta.yml
+++ b/.changelogs/1.1.8/release_meta.yml
@@ -0,0 +1 @@
+date: 2025-10-09
--- a/.changelogs/1.1.9/337_add_pressure_based_balancing.yml
+++ b/.changelogs/1.1.9/337_add_pressure_based_balancing.yml
@@ -0,0 +1,5 @@
+added:
+  - Add pressure (PSI) based balancing for memory, cpu, disk (req. PVE9 or greater) (@gyptazy). [#337]
+  - Pressure (PSI) based balancing for nodes
+  - Pressure (PSI) based balancing for guests
+  - Add PVE version evaluation
--- a/.changelogs/1.1.9/342_add_memory_balancing_threshold.yml
+++ b/.changelogs/1.1.9/342_add_memory_balancing_threshold.yml
@@ -0,0 +1,2 @@
+added:
+  - Add an optional memory balancing threshold (@gyptazy). [#342]
--- a/.changelogs/1.1.9/343_add_affinity_rules_support_by_pools.yml
+++ b/.changelogs/1.1.9/343_add_affinity_rules_support_by_pools.yml
@@ -0,0 +1,2 @@
+added:
+  - Add affinity/anti-affinity support by pools (@gyptazy). [#343]
--- a/.changelogs/1.1.9/release_meta.yml
+++ b/.changelogs/1.1.9/release_meta.yml
@@ -0,0 +1 @@
+date: 2025-10-30
--- a/.changelogs/1.2.0/141_add_power_management_for_cluster_nodes.yml
+++ b/.changelogs/1.2.0/141_add_power_management_for_cluster_nodes.yml
@@ -1,2 +0,0 @@
-added:
-  - Add power management feature for cluster nodes (by @gyptazy) [#141]
--- a/.changelogs/1.2.0/release_meta.yml
+++ b/.changelogs/1.2.0/release_meta.yml
@@ -1 +0,0 @@
-date: TBD
--- a/.github/workflows/30-pipeline-build-container-amd64.yml
+++ b/.github/workflows/30-pipeline-build-container-amd64.yml
@@ -0,0 +1,26 @@
+name: "Build Container Image: AMD64"
+on: [push]
+jobs:
+  build:
+    runs-on: ubuntu-latest
+    steps:
+    - name: Checkout repository
+      uses: actions/checkout@v4
+    - name: Set up QEMU
+      uses: docker/setup-qemu-action@v3
+    - name: Set up Docker Buildx
+      uses: docker/setup-buildx-action@v3
+    - name: Build amd64 image and save as tar
+      run: |
+        docker buildx build \
+          --platform linux/amd64 \
+          --load \
+          -t proxlb-image:amd64 \
+          .
+
+        docker save proxlb-image:amd64 -o proxlb_image_amd64.tar
+    - name: Upload Docker image artifact
+      uses: actions/upload-artifact@v4
+      with:
+        name: proxlb-image-amd64
+        path: proxlb_image_amd64.tar
--- a/.github/workflows/30-pipeline-build-container-arm64.yml
+++ b/.github/workflows/30-pipeline-build-container-arm64.yml
@@ -0,0 +1,26 @@
+name: "Build Container Image: ARM64"
+on: [push]
+jobs:
+  build:
+    runs-on: ubuntu-latest
+    steps:
+    - name: Checkout repository
+      uses: actions/checkout@v4
+    - name: Set up QEMU
+      uses: docker/setup-qemu-action@v3
+    - name: Set up Docker Buildx
+      uses: docker/setup-buildx-action@v3
+    - name: Build arm64 image and save as tar
+      run: |
+        docker buildx build \
+          --platform linux/arm64 \
+          --load \
+          -t proxlb-image:arm64 \
+          .
+
+        docker save proxlb-image:arm64 -o proxlb_image_arm64.tar
+    - name: Upload Docker image artifact
+      uses: actions/upload-artifact@v4
+      with:
+        name: proxlb-image-arm64
+        path: proxlb_image_arm64.tar
--- a/.github/workflows/30-pipeline-build-container-multi-arch.yml
+++ b/.github/workflows/30-pipeline-build-container-multi-arch.yml
@@ -0,0 +1,23 @@
+name: "Build Container Image: Multiarch"
+on: [push]
+jobs:
+  build:
+    runs-on: ubuntu-latest
+    steps:
+    - name: Checkout repository
+      uses: actions/checkout@v4
+    - name: Set up QEMU
+      uses: docker/setup-qemu-action@v3
+    - name: Set up Docker Buildx
+      uses: docker/setup-buildx-action@v3
+    - name: Build multi-arch image and save as tar
+      run: |
+        docker buildx build \
+          --platform linux/amd64,linux/arm64 \
+          --output type=tar,dest=proxlb_image_multiarch.tar \
+          .
+    - name: Upload Docker image artifact
+      uses: actions/upload-artifact@v4
+      with:
+        name: proxlb-image-multiarch
+        path: proxlb_image_multiarch.tar
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -5,6 +5,114 @@ All notable changes to this project will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

+## [1.1.9.1] - 2025-10-30
+
+### Added
+
+- Fix quoting in f-strings which may cause issues on PVE 8 / Debian Bookworm systems (@gyptazy). [#352]
+
+## [1.1.9] - 2025-10-30
+
+### Added
+
+- Add an optional memory balancing threshold (@gyptazy). [#342]
+- Add affinity/anti-affinity support by pools (@gyptazy). [#343]
+- Add pressure (PSI) based balancing for memory, cpu, disk (req. PVE9 or greater) (@gyptazy). [#337]
+    - Pressure (PSI) based balancing for nodes
+    - Pressure (PSI) based balancing for guests
+- Add PVE version evaluation
+
+## [1.1.8] - 2025-10-09
+
+### Fixed
+
+- Fix API errors when using conntrack aware migration with older PVE versions (@gyptazy). [#318]
+- Add a static ProxLB prefix to the log output when used by journal handler (@gyptazy). [#329]
+
+### Changed
+- Container image does not run as root anymore (@mikaelkrantz945). [#317]
+- Container image uses venv for running ProxLB (@mikaelkrantz945). [#317]
+
+## [1.1.7] - 2025-09-19
+
+### Added
+
+- Add conntrack state aware migrations of VMs (@gyptazy). [#305]
+- Add graceful shutdown for SIGINT (e.g., CTRL + C abort). (@gyptazy). [#304]
+
+### Fixed
+
+- Fix crash when validating absent migration job ids. (@gyptazy). [#308]
+- Fix guest object names are not being evaluated in debug log. (@gyptazy). [#310]
+
+## [1.1.6.1] - 2025-09-04
+
+### Fixed
+
+- Validate for node presence when pinning VMs to avoid crashing (@gyptazy). [#296]
+
+## [1.1.6] - 2025-09-04
+
+### Added
+
+- Add validation for provided API user token id to avoid confusions (@gyptazy). [#291]
+
+### Fixed
+
+- Fix stacktrace output when validating permissions on non existing users in Proxmox (@gyptazy). [#291]
+- Fix Overprovisioning first node if anti_affinity_group has only one member (@MiBUl-eu). [#295]
+- Validate for node presence when pinning guests to avoid crashing (@gyptazy). [#296]
+- Fix balancing evaluation of guest types (e.g., VM or CT) (@gyptazy). [#268]
+
+## [1.1.5] - 2025-07-14
+
+### Added
+
+- Allow custom API ports instead of fixed tcp/8006 (@gyptazy). [#260]
+
+
+## [1.1.4] - 2025-06-27
+
+### Added
+
+- Allow pinning of guests to a group of nodes (@gyptazy). [#245]
+
+### Fixed
+
+- Modified log levels to make output lighter at INFO level (@pmarasse) [#255]
+- Fixed an issue where balancing was performed in combination of deactivated balancing and dry-run mode (@gyptazy). [#248]
+
+
+## [1.1.3] - 2025-06-19
+
+### Added
+
+- Add relaod (SIGHUP) function to ProxLB to reload the configuration (by @gyptazy). [#189]
+- Add optional wait time parameter to delay execution until the service takes action (by @gyptazy). [#239]
+- Make the amount of parallel migrations configurable (by @gyptazy). [#241]
+
+### Changed
+
+- Use the average CPU consumption of a guest within the last 60 minutes instead of the current CPU usage (by @philslab-ninja & @gyptazy). [#94]
+
+### Fixed
+
+- Align maintenance mode with Proxmox HA maintenance mode (by @gyptazy). [#232]
+
+
+## [1.1.2] - 2025-05-13
+
+### Added
+
+- Add a configurable retry mechanism when connecting to the Proxmox API (by @gyptazy) [#157]
+- Add 1-to-1 relationships between guest and hypervisor node to ping a guest on a node (by @gyptazy) [#218]
+
+### Fixed
+
+- Force type cast cpu count of guests to int for some corner cases where a str got returned (by @gyptazy). [#222]
+- Fix systemd unit file to run after network on non PVE nodes (by @robertdahlem) [#137]
+
+
 ## [1.1.1] - 2025-04-20

 ### Added
--- a/35
+++ b/35
@@ -9,20 +9,33 @@ LABEL org.label-schema.vendor="gyptazy"
 LABEL org.label-schema.url="https://proxlb.de"
 LABEL org.label-schema.vcs-url="https://github.com/gyptazy/ProxLB"

-# Install Python3
-RUN apk add --no-cache python3 py3-pip
+# --- Step 1 (root): system deps, user, dirs ---
+RUN apk add --no-cache python3 py3-pip \
+  && addgroup -S plb \
+  && adduser -S -G plb -h /home/plb plb \
+  && mkdir -p /app/conf /opt/venv \
+  && chown -R plb:plb /app /home/plb /opt/venv

-# Create a directory for the app
 WORKDIR /app

-# Copy the python program from the current directory to /app
-COPY proxlb /app/proxlb
+# Copy only requirements first for better layer caching
+COPY --chown=plb:plb requirements.txt /app/requirements.txt

-# Copy requirements to the container
-COPY requirements.txt /app/requirements.txt
+# --- Step 2 (appuser): venv + deps + code ---
+USER plb

-# Install dependencies in the virtual environment
-RUN pip install --break-system-packages -r /app/requirements.txt
+# Create venv owned by appuser and put it on PATH
+RUN python3 -m venv /opt/venv
+ENV PATH="/opt/venv/bin:${PATH}"

-# Set the entry point to use the virtual environment's python
-ENTRYPOINT ["/usr/bin/python3", "/app/proxlb/main.py"]
+# Install Python dependencies into the venv (no PEP 668 issues)
+RUN pip install --no-cache-dir -r /app/requirements.txt
+
+# Copy application code (owned by appuser)
+COPY --chown=plb:plb proxlb /app/proxlb
+
+# Optional: placeholder config so a bind-mount can override cleanly
+RUN touch /app/conf/proxlb.yaml
+
+# Run as non-root using venv Python
+ENTRYPOINT ["/opt/venv/bin/python", "/app/proxlb/main.py"]
--- a/README.md
+++ b/README.md
@@ -1,5 +1,5 @@
 # ProxLB - (Re)Balance VM Workloads in Proxmox Clusters
-<img align="left" src="https://cdn.gyptazy.com/images/Prox-LB-logo.jpg"/>
+<img align="left" src="https://cdn.gyptazy.com/img/ProxLB.jpg"/>
 <br>

 <p float="center"><img src="https://img.shields.io/github/license/gyptazy/ProxLB"/><img src="https://img.shields.io/github/contributors/gyptazy/ProxLB"/><img src="https://img.shields.io/github/last-commit/gyptazy/ProxLB/main"/><img src="https://img.shields.io/github/issues-raw/gyptazy/ProxLB"/><img src="https://img.shields.io/github/issues-pr/gyptazy/ProxLB"/></p>
@@ -8,31 +8,32 @@
 1. [Introduction](#introduction)
 2. [Features](#features)
 3. [How does it work?](#how-does-it-work)
-4. [Installation](#installation)
+4. [Documentation](#documentation)
+5. [Installation](#installation)
   1. [Requirements / Dependencies](#requirements--dependencies)
   2. [Debian Package](#debian-package)
   4. [Container / Docker](#container--docker)
   5. [Source](#source)
-5. [Usage / Configuration](#usage--configuration)
+6. [Usage / Configuration](#usage--configuration)
   1. [GUI Integration](#gui-integration)
   2. [Proxmox HA Integration](#proxmox-ha-integration)
   3. [Options](#options)
-6. [Affinity & Anti-Affinity Rules](#affinity--anti-affinity-rules)
+7. [Affinity & Anti-Affinity Rules](#affinity--anti-affinity-rules)
   1. [Affinity Rules](#affinity-rules)
   2. [Anti-Affinity Rules](#anti-affinity-rules)
   3. [Ignore VMs](#ignore-vms)
   4. [Pin VMs to Hypervisor Nodes](#pin-vms-to-hypervisor-nodes)
-7. [Maintenance](#maintenance)
-8. [Misc](#misc)
+8. [Maintenance](#maintenance)
+9. [Misc](#misc)
   1. [Bugs](#bugs)
   2. [Contributing](#contributing)
-   3. [Documentation](#documentation)
-   4. [Support](#support)
-9. [Author(s)](#authors)
+   3. [Support](#support)
+   4. [Enterprise-Support](#enterprise-support)
+10. [Author(s)](#authors)


 ## Introduction
-ProxLB is an advanced load balancing solution specifically designed for Proxmox clusters, addressing the absence of a Dynamic Resource Scheduler (DRS) that is familiar to VMware users. As a third-party solution, ProxLB enhances the management and efficiency of Proxmox clusters by intelligently distributing workloads across available nodes. Workloads can be balanced by different times like the guest's memory, CPU or disk usage or their assignment to avoid overprovisioning and ensuring resources.
+ProxLB is an advanced load balancing solution specifically designed for Proxmox clusters, addressing the absence of an intelligent and more advanced resource scheduler. As a third-party solution, ProxLB enhances the management and efficiency of Proxmox clusters by intelligently distributing workloads across available nodes. Workloads can be balanced by different times like the guest's memory, CPU or disk usage or their assignment to avoid overprovisioning and ensuring resources.

 One of the key advantages of ProxLB is that it is fully open-source and free, making it accessible for anyone to use, modify, and contribute to. This ensures transparency and fosters community-driven improvements. ProxLB supports filtering and ignoring specific nodes and guests through configuration files and API calls, providing administrators with the flexibility to tailor the load balancing behavior to their specific needs.

@@ -43,41 +44,51 @@ ProxLB can also return the best next node for guest placement, which can be inte
 Overall, ProxLB significantly enhances resource management by intelligently distributing workloads, reducing downtime through its maintenance mode, and providing improved flexibility with affinity and anti-affinity rules. Its seamless integration with CI/CD tools and reliance on the Proxmox API make it a robust and secure solution for optimizing Proxmox cluster performance.

 ### Video of Migration
-<img src="https://cdn.gyptazy.com/images/proxlb-rebalancing-demo.gif"/>
+<img src="https://cdn.gyptazy.com/img/proxlb-rebalancing-demo.gif"/>

 ## Features
-ProxLB's key features are by enabling automatic rebalancing of VMs and CTs across a Proxmox cluster based on memory, CPU, and local disk usage while identifying optimal nodes for automation. It supports maintenance mode, affinity rules, and seamless Proxmox API integration with ACL support, offering flexible usage as a one-time operation, a daemon, or through the Proxmox Web GUI. In addition, ProxLB also supports additional enterprise alike features like power managements for nodes (often also known as DPM) where nodes can be turned on/off on demand when workloads are higher/lower than usual. Also the automated security-patching of nodes within the cluster (known as ASPM) may help to reduce the manual work from cluster admins, where nodes will install patches, move guests across the cluster, reboot and then reblance the cluster again.
+ProxLB's key features are by enabling automatic rebalancing of VMs and CTs across a Proxmox cluster based on memory, CPU, and local disk usage while identifying optimal nodes for automation. It supports maintenance mode, affinity rules, and seamless Proxmox API integration with ACL support, offering flexible usage as a one-time operation, a daemon, or through the Proxmox Web GUI.

 **Features**
-* Re-Balancing (DRS)
-  * Supporting VMs & CTs
-  * Balancing by:
-    * CPU
-    * Memory
-    * Disk
-  * Affinity / Anti-Affinity Rules
-    * Affinity: Groups guests together
-    * Anti-Affinity: Ensuring guests run on different nodes
-  * Best node evaluation
-    * Get the best node for guest placement (e.g., CI/CD)
+* Rebalance VMs/CTs in the cluster by:
+  * Memory
+  * Disk (only local storage)
+  * CPU
+* Rebalance by different modes:
+  * Used resources
+  * Assigned resources
+  * PSI (Pressure) of resources
+* Get best nodes for further automation
+* Supported Guest Types
+  * VMs
+  * CTs
 * Maintenance Mode
-  * Evacuating a sinlge or multiple nodes
-* Node Power Management (DPM)
-* Auto Node Security-Patch-Management (ASPM)
+  * Set node(s) into maintenance
+  * Move all workloads to different nodes
+* Affinity / Anti-Affinity Rules
 * Fully based on Proxmox API
-  * Utilizing the Proxmox User Authentications
-  * Supporting API tokens
-  * No SSH or Agents required
-  * Can run everywhere
+  * Fully integrated into the Proxmox ACL
+  * No SSH required
+* Usage
+  * One-Time
+  * Daemon
+  * Proxmox Web GUI Integration

 ## How does it work?
 ProxLB is a load-balancing system designed to optimize the distribution of virtual machines (VMs) and containers (CTs) across a cluster. It works by first gathering resource usage metrics from all nodes in the cluster through the Proxmox API. This includes detailed resource metrics for each VM and CT on every node. ProxLB then evaluates the difference between the maximum and minimum resource usage of the nodes, referred to as "Balanciness." If this difference exceeds a predefined threshold (which is configurable), the system initiates the rebalancing process.

 Before starting any migrations, ProxLB validates that rebalancing actions are necessary and beneficial. Depending on the selected balancing mode — such as CPU, memory, or disk — it creates a balancing matrix. This matrix sorts the VMs by their maximum used or assigned resources, identifying the VM with the highest usage. ProxLB then places this VM on the node with the most free resources in the selected balancing type. This process runs recursively until the operator-defined Balanciness is achieved. Balancing can be defined for the used or max. assigned resources of VMs/CTs.

+## Documentation
+This `README.md` doesn't contain all information and only highlights the most important facts. Extended information, such like API permissions, creating dedicated user, best-practices in running ProxLB and much more can be found in the [docs/](https://github.com/gyptazy/ProxLB/tree/main/docs) directory. Please consult the documentation before creating issues.
+
 ## Installation

 ### Requirements / Dependencies
+* Proxmox
+    * Proxmox 7.x
+    * Proxmox 8.x
+    * Proxmox 9.x
 * Python3.x
 * proxmoxer
 * requests
@@ -131,7 +142,7 @@ wget -O /etc/apt/trusted.gpg.d/proxlb.asc https://repo.gyptazy.com/repository.gp

 #### Debian Packages (.deb files)
 If you do not want to use the repository you can also find the debian packages as a .deb file on gyptazy's CDN at:
-* https://cdn.gyptazy.com/files/os/debian/proxlb/
+* https://cdn.gyptazy.com/debian/

 Afterwards, you can simply install the package by running:
 ```bash
@@ -162,6 +173,16 @@ docker run -it --rm -v $(pwd)/proxlb.yaml:/etc/proxlb/proxlb.yaml proxlb
 | Version | Image |
 |------|:------:|
 | latest | cr.gyptazy.com/proxlb/proxlb:latest |
+| v1.1.9.1 | cr.gyptazy.com/proxlb/proxlb:v1.1.9.1 |
+| v1.1.9 | cr.gyptazy.com/proxlb/proxlb:v1.1.9 |
+| v1.1.8 | cr.gyptazy.com/proxlb/proxlb:v1.1.8 |
+| v1.1.7 | cr.gyptazy.com/proxlb/proxlb:v1.1.7 |
+| v1.1.6.1 | cr.gyptazy.com/proxlb/proxlb:v1.1.6.1 |
+| v1.1.6 | cr.gyptazy.com/proxlb/proxlb:v1.1.6 |
+| v1.1.5 | cr.gyptazy.com/proxlb/proxlb:v1.1.5 |
+| v1.1.4 | cr.gyptazy.com/proxlb/proxlb:v1.1.4 |
+| v1.1.3 | cr.gyptazy.com/proxlb/proxlb:v1.1.3 |
+| v1.1.2 | cr.gyptazy.com/proxlb/proxlb:v1.1.2 |
 | v1.1.1 | cr.gyptazy.com/proxlb/proxlb:v1.1.1 |
 | v1.1.0 | cr.gyptazy.com/proxlb/proxlb:v1.1.0 |
 | v1.0.6 | cr.gyptazy.com/proxlb/proxlb:v1.0.6 |
@@ -215,7 +236,7 @@ docker run -it --rm -v $(pwd)/proxlb.yaml:/etc/proxlb/proxlb.yaml proxlb
 Running ProxLB is straightforward and versatile, as it only requires `Python3` and the `proxmoxer` library. This means ProxLB can be executed directly on a Proxmox node or on dedicated systems such as Debian, RedHat, or even FreeBSD, provided that the Proxmox API is accessible from the client running ProxLB. ProxLB can also run inside a Container - Docker or LXC - and is simply up to you.

 ### GUI Integration
-<img align="left" src="https://cdn.gyptazy.com/images/proxlb-GUI-integration.jpg"/> ProxLB can also be accessed through the Proxmox Web UI by installing the optional `pve-proxmoxlb-service-ui` package, which depends on the proxlb package. For full Web UI integration, this package must be installed on all nodes within the cluster. Once installed, a new menu item - `Rebalancing`, appears in the cluster level under the HA section. Once installed, it offers two key functionalities:
+<img align="left" src="https://cdn.gyptazy.com/img/rebalance-ui.jpg"/> ProxLB can also be accessed through the Proxmox Web UI by installing the optional `pve-proxmoxlb-service-ui` package, which depends on the proxlb package. For full Web UI integration, this package must be installed on all nodes within the cluster. Once installed, a new menu item - `Rebalancing`, appears in the cluster level under the HA section. Once installed, it offers two key functionalities:
 * Rebalancing VM workloads
 * Migrate VM workloads away from a defined node (e.g. maintenance preparation)

@@ -238,7 +259,7 @@ The following options can be set in the configuration file `proxlb.yaml`:
 | Section | Option | Sub Option | Example | Type | Description |
 |---------|:------:|:----------:|:-------:|:----:|:-----------:|
 | `proxmox_api` |  |  |  |  |  |
-|  | hosts |  | ['virt01.example.com', '10.10.10.10', 'fe01::bad:code::cafe'] | `List` | List of Proxmox nodes. Can be IPv4, IPv6 or mixed. |
+|  | hosts |  | ['virt01.example.com', '10.10.10.10', 'fe01:bad:code::cafe', 'virt01.example.com:443', '[fc00::1]', '[fc00::1]:443', 'fc00::1:8006'] | `List` | List of Proxmox nodes. Can be IPv4, IPv6 or mixed. You can specify custom ports. In case of IPv6 without brackets the port is considered after the last colon |
 |  | user |  | root@pam | `Str` | Username for the API. |
 |  | pass |  | FooBar | `Str` | Password for the API. (Recommended: Use API token authorization!) |
 |  | token_id |  | proxlb | `Str` | Token ID of the user for the API. |
@@ -248,38 +269,41 @@ The following options can be set in the configuration file `proxlb.yaml`:
 |  | retries |  | 1 | `Int` | How often a connection attempt to the defined API host should be performed. |
 |  | wait_time |  | 1 | `Int` | How many seconds should be waited before performing another connection attempt to the API host. |
 | `proxmox_cluster` |  |  |  |  |  |
-|  | maintenance_nodes |  | ['virt66.example.com'] | `List` | A list of Proxmox nodes that are defined to be in a maintenance. |
+|  | maintenance_nodes |  | ['virt66.example.com'] | `List` | A list of Proxmox nodes that are defined to be in a maintenance. (must be the same node names as used within the cluster) |
 |  | ignore_nodes |  | [] | `List` | A list of Proxmox nodes that are defined to be ignored. |
 |  | overprovisioning |  | False | `Bool` | Avoids balancing when nodes would become overprovisioned. |
 | `balancing` |  |  |  |  |  |
 |  | enable |  | True | `Bool` | Enables the guest balancing.|
 |  | enforce_affinity |  | True | `Bool` | Enforcing affinity/anti-affinity rules but balancing might become worse. |
 |  | parallel |  | False | `Bool` | If guests should be moved in parallel or sequentially.|
+|  | parallel_jobs |  | 5 | `Int` | The amount if parallel jobs when migrating guests. (default: `5`)|
 |  | live |  | True | `Bool` | If guests should be moved live or shutdown.|
 |  | with_local_disks |  | True | `Bool` | If balancing of guests should include local disks.|
+|  | with_conntrack_state |  | True | `Bool` | If balancing of guests should including the conntrack state.|
 |  | balance_types |  | ['vm', 'ct'] | `List` | Defined the types of guests that should be honored. [values: `vm`, `ct`]|
 |  | max_job_validation |  | 1800 | `Int` | How long a job validation may take in seconds. (default: 1800) |
 |  | balanciness |  | 10 | `Int` | The maximum delta of resource usage between node with highest and lowest usage. |
+|  | memory_threshold |  | 75 | `Int` | The maximum threshold (in percent) that needs to be hit to perform balancing actions. (Optional) |
 |  | method |  | memory | `Str` | The balancing method that should be used.  [values: `memory` (default), `cpu`, `disk`]|
-|  | mode |  | used | `Str` | The balancing mode that should be used. [values: `used` (default), `assigned`] |
-| `dpm` |  |  |  |  |  |
-|  | enable |  | True | `Bool` | Enables the Dynamic Power Management functions.|
-|  | method |  | memory | `Str` | The balancing method that should be used.  [values: `memory` (default), `cpu`, `disk`]|
-|  | mode |  | static | `Str` | The balancing mode that should be used. [values: `static` (default), `auto`] |
-|  | cluster_min_free_resources |  | 60 | `Int` | Representing the minimum required free resouzrces in percent within the cluster. [values: `60`% (default)] |
-|  | cluster_min_nodes |  | 3 | `Int` | The minimum of required nodes that should remain in a cluster. [values: `3` (default)] |
+|  | mode |  | used | `Str` | The balancing mode that should be used. [values: `used` (default), `assigned`, `psi` (pressure)] |
+|  | psi |  | { nodes: { memory: { pressure_full: 0.20, pressure_some: 0.20, pressure_spikes: 1.00 }}} | `Dict` | A dict of PSI based thresholds for nodes and guests |
+|  | pools |  | pools: { dev: { type: affinity }, de-nbg01-db: { type: anti-affinity }} | `Dict` | A dict of pool names and their type for creating affinity/anti-affinity rules |
 | `service` |  |  |  |  |  |
 |  | daemon |  | True | `Bool` | If daemon mode should be activated. |
 |  | `schedule` |  |  | `Dict` | Schedule config block for rebalancing. |
 |  |  | interval | 12 | `Int` | How often rebalancing should occur in daemon mode.|
 |  |  | format | hours | `Str` | Sets the time format. [values: `hours` (default), `minutes`]|
+|  | `delay` |  |  | `Dict` | Schedule config block for an optional delay until the service starts. |
+|  |  | enable | False | `Bool` | If a delay time should be validated.|
+|  |  | time | 1 | `Int` | Delay time until the service starts after the initial execution.|
+|  |  | format | hours | `Str` | Sets the time format. [values: `hours` (default), `minutes`]|
 |  | log_level |  | INFO | `Str` | Defines the default log level that should be logged. [values: `INFO` (default), `WARNING`, `CRITICAL`, `DEBUG`] |


 An example of the configuration file looks like:
 ```
 proxmox_api:
-  hosts: ['virt01.example.com', '10.10.10.10', 'fe01::bad:code::cafe']
+  hosts: ['virt01.example.com', '10.10.10.10', 'fe01:bad:code::cafe']
  user: root@pam
  pass: crazyPassw0rd!
  # API Token method
@@ -302,26 +326,60 @@ balancing:
  parallel: False
  live: True
  with_local_disks: True
+  with_conntrack_state: True
  balance_types: ['vm', 'ct']
  max_job_validation: 1800
+  memory_threshold: 75
  balanciness: 5
  method: memory
  mode: used
-
-dpm:
-  # DPM requires you to define the WOL (Wake-on-Lan)
-  # MAC address for each node in Proxmox.
-  enable: True
-  method: memory
-  mode: static
-  cluster_min_free_resources: 60
-  cluster_min_nodes: 1
+# # PSI thresholds only apply when using mode 'psi'
+# # PSI based balancing is currently in beta and req. PVE >= 9
+# psi:
+#   nodes:
+#     memory:
+#       pressure_full: 0.20
+#       pressure_some: 0.20
+#       pressure_spikes: 1.00
+#     cpu:
+#       pressure_full: 0.20
+#       pressure_some: 0.20
+#       pressure_spikes: 1.00
+#     disk:
+#       pressure_full: 0.20
+#       pressure_some: 0.20
+#       pressure_spikes: 1.00
+#   guests:
+#     memory:
+#       pressure_full: 0.20
+#       pressure_some: 0.20
+#       pressure_spikes: 1.00
+#     cpu:
+#       pressure_full: 0.20
+#       pressure_some: 0.20
+#       pressure_spikes: 1.00
+#     disk:
+#       pressure_full: 0.20
+#       pressure_some: 0.20
+#       pressure_spikes: 1.00
+  pools:
+    dev:
+      type: affinity
+    de-nbg01-db
+      type: anti-affinity
+      pin:
+        - virt66
+        - virt77

 service:
  daemon: True
  schedule:
    interval: 12
    format: hours
+  delay:
+    enable: False
+    time: 1
+    format: hours
  log_level: INFO
 ```

@@ -342,19 +400,33 @@ ProxLB provides an advanced mechanism to define affinity and anti-affinity rules
 ProxLB implements affinity and anti-affinity rules through a tag-based system within the Proxmox web interface. Each guest (virtual machine or container) can be assigned specific tags, which then dictate its placement behavior. This method maintains a streamlined and secure approach to managing VM relationships while preserving Proxmox’s inherent permission model.

 ### Affinity Rules
-<img align="left" src="https://cdn.gyptazy.com/images/proxlb-affinity-rules.jpg"/> Affinity rules are used to group certain VMs together, ensuring that they run on the same host whenever possible. This can be beneficial for workloads requiring low-latency communication, such as clustered databases or application servers that frequently exchange data.
+<img align="left" src="https://cdn.gyptazy.com/img/proxlb-affinity-rules.jpg"/> Affinity rules are used to group certain VMs together, ensuring that they run on the same host whenever possible. This can be beneficial for workloads requiring low-latency communication, such as clustered databases or application servers that frequently exchange data. In general, there're two ways to manage affinity rules:

+#### Affinity Rules by Tags
 To define an affinity rule which keeps all guests assigned to this tag together on a node, users assign a tag with the prefix `plb_affinity_$TAG`:

 #### Example for Screenshot
 ```
 plb_affinity_talos
 ```
-
 As a result, ProxLB will attempt to place all VMs with the `plb_affinity_web` tag on the same host (see also the attached screenshot with the same node).

-### Anti-Affinity Rules
-<img align="left" src="https://cdn.gyptazy.com/images/proxlb-anti-affinity-rules.jpg"/> Conversely, anti-affinity rules ensure that designated VMs do not run on the same physical host. This is particularly useful for high-availability setups, where redundancy is crucial. Ensuring that critical services are distributed across multiple hosts reduces the risk of a single point of failure.
+#### Affinity Rules by Pools
+Antoher approach is by using pools in Proxmox. This way, it can easily also combined with other resources like backup jobs. However, in this approach you need to modify the ProxLB config file to your needs. Within the `balancing` section you can create a dict of pools, including the pool name and the affinity type. Please see the example for further details:
+
+**Example Config**
+```
+balancing:
+  [...]
+  pools:                              # Optional: Define affinity/anti-affinity rules per pool
+    dev:                              # Pool name: dev
+      type: affinity                  # Type: affinity (keeping VMs together)
+      pin:                            # Pin VMs to Nodes
+        - virt77                        # Pinning to 'virt77' which is maybe an older system for dev labs
+```
+
+### Anti-Affinity Rules by Tags
+<img align="left" src="https://cdn.gyptazy.com/img/proxlb-anti-affinity-rules.jpg"/> Conversely, anti-affinity rules ensure that designated VMs do not run on the same physical host. This is particularly useful for high-availability setups, where redundancy is crucial. Ensuring that critical services are distributed across multiple hosts reduces the risk of a single point of failure. In general, there're two ways to manage anti-affinity rules:

 To define an anti-affinity rule that ensures to not move systems within this group to the same node, users assign a tag with the prefix:

@@ -365,10 +437,23 @@ plb_anti_affinity_ntp

 As a result, ProxLB will try to place the VMs with the `plb_anti_affinity_ntp` tag on different hosts (see also the attached screenshot with the different nodes).

+#### Anti-Affinity Rules by Pools
+Antoher approach is by using pools in Proxmox. This way, it can easily also combined with other resources like backup jobs. However, in this approach you need to modify the ProxLB config file to your needs. Within the `balancing` section you can create a dict of pools, including the pool name and the affinity type. Please see the example for further details:
+
+**Example Config**
+```
+balancing:
+  [...]
+  pools:                              # Optional: Define affinity/anti-affinity rules per pool
+    de-nbg01-db:                      # Pool name: de-nbg01-db
+      type: anti-affinity                  # Type: anti-affinity (spreading VMs apart)
+```
+
+
 **Note:** While this ensures that ProxLB tries distribute these VMs across different physical hosts within the Proxmox cluster this may not always work. If you have more guests attached to the group than nodes in the cluster, we still need to run them anywhere. If this case occurs, the next one with the most free resources will be selected.

 ### Ignore VMs
-<img align="left" src="https://cdn.gyptazy.com/images/proxlb-ignore-vm-movement.jpg"/> Guests, such as VMs or CTs, can also be completely ignored. This means, they won't be affected by any migration (even when (anti-)affinity rules are enforced). To ensure a proper resource evaluation, these guests are still collected and evaluated but simply skipped for balancing actions. Another thing is the implementation. While ProxLB might have a very restricted configuration file including the file permissions, this file is only read- and writeable by the Proxmox administrators. However, we might have user and groups who want to define on their own that their systems shouldn't be moved. Therefore, these users can simpy set a specific tag to the guest object - just like the (anti)affinity rules.
+<img align="left" src="https://cdn.gyptazy.com/img/proxlb-ignore-vm-movement.jpg"/> Guests, such as VMs or CTs, can also be completely ignored. This means, they won't be affected by any migration (even when (anti-)affinity rules are enforced). To ensure a proper resource evaluation, these guests are still collected and evaluated but simply skipped for balancing actions. Another thing is the implementation. While ProxLB might have a very restricted configuration file including the file permissions, this file is only read- and writeable by the Proxmox administrators. However, we might have user and groups who want to define on their own that their systems shouldn't be moved. Therefore, these users can simpy set a specific tag to the guest object - just like the (anti)affinity rules.

 To define a guest to be ignored from the balancing, users assign a tag with the prefix `plb_ignore_$TAG`:

@@ -382,8 +467,9 @@ As a result, ProxLB will not migrate this guest with the `plb_ignore_dev` tag to
 **Note:** Ignored guests are really ignored. Even by enforcing affinity rules this guest will be ignored.

 ### Pin VMs to Specific Hypervisor Nodes
-<img align="left" src="https://cdn.gyptazy.com/images/proxlb-tag-node-pinning.jpg"/> Guests, such as VMs or CTs, can also be pinned to specific nodes in the cluster. This might be usefull when running applications with some special licensing requirements that are only fulfilled on certain nodes. It might also be interesting, when some physical hardware is attached to a node, that is not available in general within the cluster.
+<img align="left" src="https://cdn.gyptazy.com/img/proxlb-tag-node-pinning.jpg"/> Guests, such as VMs or CTs, can also be pinned to specific (and multiple) nodes in the cluster. This might be usefull when running applications with some special licensing requirements that are only fulfilled on certain nodes. It might also be interesting, when some physical hardware is attached to a node, that is not available in general within the cluster.

+#### Pinning VMs to (a) specific Hypervisor Node(s) by Tag
 To pin a guest to a specific cluster node, users assign a tag with the prefix `plb_pin_$nodename` to the desired guest:

 #### Example for Screenshot
@@ -393,11 +479,27 @@ plb_pin_node03

 As a result, ProxLB will pin the guest `dev-vm01` to the node `virt03`.

+
+#### Pinning VMs to (a) specific Hypervisor Node(s) by Pools
+Beside the tag approach, you can also pin a resource group to a specific hypervisor or groups of hypervisors by defining a `pin` key of type list.
+
+**Example Config**
+```
+balancing:
+  [...]
+  pools:                              # Optional: Define affinity/anti-affinity rules per pool
+    dev:                              # Pool name: dev
+      type: affinity                  # Type: affinity (keeping VMs together)
+      pin:                            # Pin VMs to Nodes
+        - virt77                        # Pinning to 'virt77' which is maybe an older system for dev labs
+```
+
+
+You can also repeat this step multiple times for different node names to create a potential group of allowed hosts where a the guest may be served on. In this case, ProxLB takes the node with the lowest used resources according to the defined balancing values from this group.
+
 **Note:** The given node names from the tag are validated. This means, ProxLB validated if the given node name is really part of the cluster. In case of a wrongly defined or unavailable node name it continous to use the regular processes to make sure the guest keeps running.

 ## Maintenance
-<img src="https://cdn.gyptazy.com/images/proxlb-rebalancing-demo.gif"/>
-
 The `maintenance_nodes` option allows operators to designate one or more Proxmox nodes for maintenance mode. When a node is set to maintenance, no new guest workloads will be assigned to it, and all existing workloads will be migrated to other available nodes within the cluster. This process ensures that (anti)-affinity rules and resource availability are respected, preventing disruptions while maintaining optimal performance across the infrastructure.

 ### Adding / Removing Nodes from Maintenance
@@ -415,9 +517,6 @@ Bugs can be reported via the GitHub issue tracker [here](https://github.com/gypt
 ### Contributing
 Feel free to add further documentation, to adjust already existing one or to contribute with code. Please take care about the style guide and naming conventions. You can find more in our [CONTRIBUTING.md](https://github.com/gyptazy/ProxLB/blob/main/CONTRIBUTING.md) file.

-### Documentation
-You can also find additional and more detailed documentation within the [docs/](https://github.com/gyptazy/ProxLB/tree/main/docs) directory.
-
 ### Support
 If you need assistance or have any questions, we offer support through our dedicated [chat room](https://matrix.to/#/#proxlb:gyptazy.com) in Matrix or [Discord](https://discord.gg/JemGu7WbfQ). Join our community for real-time help, advice, and discussions. The Matrix and Discord room are bridged to ensure that the communication is not splitted - so simply feel free to join which fits most to you!

@@ -432,5 +531,18 @@ Connect with us in our dedicated chat room for immediate support and live intera

 **Note:** Please always keep in mind that this is a one-man show project without any further help. This includes coding, testing, packaging and all the infrastructure around it to keep this project up and running.

+### Enterprise-Support
+Running critical infrastructure in an enterprise environment often comes with requirements that go far beyond functionality alone. Enterprises typically expect predictable service levels, defined escalation paths, and guaranteed response times. In many cases, organizations also demand 24x7 support availability to ensure that their systems remain stable and resilient, even under unexpected circumstances.
+
+As the creator and maintainer of ProxLB, I operate as a one-man project. While I am continuously working to improve the software, I cannot provide the type of enterprise-grade support that large organizations may require. To address this need, several companies have stepped in to offer professional services around ProxLB in Proxmox VE clusters.
+
+Below is a list of organizations currently known to provide enterprise-level support for ProxLB. If your business relies on ProxLB in production and you require more than community-based support, these providers may be a good fit for your needs:
+
+| Company| Country | Web |
+|------|:------:|:------:|
+| credativ | DE | [credativ.de](https://www.credativ.de/en/portfolio/support/proxmox-virtualization/) |
+
+*Note: If you provide support for ProxLB, feel free to create PR with your addition.*
+
 ### Author(s)
 * Florian Paul Azim Hoberg @gyptazy (https://gyptazy.com)
--- a/config/proxlb_example.yaml
+++ b/config/proxlb_example.yaml
@@ -1,5 +1,5 @@
 proxmox_api:
-  hosts: ['virt01.example.com', '10.10.10.10', 'fe01::bad:code::cafe']
+  hosts: ['virt01.example.com', '10.10.10.10', 'fe01:bad:code::cafe']
  user: root@pam
  pass: crazyPassw0rd!
  # API Token method
@@ -20,24 +20,62 @@ balancing:
  enable: True
  enforce_affinity: False
  parallel: False
+  # If running parallel job, you can define
+  # the amount of prallel jobs (default: 5)
+  parallel_jobs: 1
  live: True
  with_local_disks: True
-  balance_types: ['vm', 'ct']
-  max_job_validation: 1800
-  balanciness: 5
-  method: memory
-  mode: used
-
-dpm:
-  enable: True
-  method: memory
-  mode: static
-  cluster_min_free_resources: 60
-  cluster_min_nodes: 1
+  with_conntrack_state: True
+  balance_types: ['vm', 'ct']         # 'vm' | 'ct'
+  max_job_validation: 1800            # Maximum time (in seconds) a job validation may take
+  memory_threshold: 75                # Optional: Maximum threshold (in percent) to trigger balancing actions
+  balanciness: 5                      # Maximum delta of resource usage between highest and lowest usage node
+  method: memory                      # 'memory' | 'cpu' | 'disk'
+  mode: used                          # 'assigned' | 'used' | 'psi'
+  # # PSI thresholds only apply when using mode 'psi'
+  # psi:
+  #   nodes:
+  #     memory:
+  #       pressure_full: 0.20
+  #       pressure_some: 0.20
+  #       pressure_spikes: 1.00
+  #     cpu:
+  #       pressure_full: 0.20
+  #       pressure_some: 0.20
+  #       pressure_spikes: 1.00
+  #     disk:
+  #       pressure_full: 0.20
+  #       pressure_some: 0.20
+  #       pressure_spikes: 1.00
+  #   guests:
+  #     memory:
+  #       pressure_full: 0.20
+  #       pressure_some: 0.20
+  #       pressure_spikes: 1.00
+  #     cpu:
+  #       pressure_full: 0.20
+  #       pressure_some: 0.20
+  #       pressure_spikes: 1.00
+  #     disk:
+  #       pressure_full: 0.20
+  #       pressure_some: 0.20
+  #       pressure_spikes: 1.00
+  pools:                              # Optional: Define affinity/anti-affinity rules per pool
+    dev:                              # Pool name: dev
+      type: affinity                  # Type: affinity (keeping VMs together)
+    de-nbg01-db:                      # Pool name: de-nbg01-db
+      type: anti-affinity             # Type: anti-affinity (spreading VMs apart)
+      pin:                            # Define a pinning og guests to specific node(s)
+        - virt66
+        - virt77

 service:
  daemon: True
  schedule:
    interval: 12
    format: hours
+  delay:
+    enable: False
+    time: 1
+    format: hours
  log_level: INFO
--- a/debian/changelog
+++ b/debian/changelog
@@ -1,8 +1,82 @@
-proxlb (1.1.2~b1) stable; urgency=medium
+proxlb (1.1.9.1) stable; urgency=medium

-  * Auto-created 1.1.2 beta 1 release.
+  * Fix quoting in f-strings which may cause issues on PVE 8 / Debian Bookworm systems (Closes: #352)

- -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Mon, 17 Mar 2025 18:55:02 +0000
+ -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Thu, 30 Oct 2025 17:41:02 +0001
+
+proxlb (1.1.9) stable; urgency=medium
+
+  * Add pressure (PSI) based balancing for memory, cpu, disk (req. PVE9 or greater). (Closes: #339)
+  * Add (memory) threshold for nodes before running balancing. (Closes: #342)
+  * Add affinity/anti-affinity support by pools. (Closes: #343)
+
+ -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Thu, 30 Oct 2025 06:58:43 +0001
+
+proxlb (1.1.8) stable; urgency=medium
+
+  * Fix API errors when using conntrack aware migration with older PVE version. (Closes: #318)
+  * Add a static ProxLB prefix to the log output when used by journal handler. (Closes: #329)
+
+ -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Thu, 09 Oct 2025 09:04:13 +0002
+
+proxlb (1.1.7) stable; urgency=medium
+
+  * Add conntrack state aware migrations of VMs. (Closes: #305)
+  * Add graceful shutdown for SIGINT command. (Closes: #304)
+  * Fix crash when validating absent migration job ids. (Closes: #308)
+  * Fix guest object names are not being evaluated in debug log. (Closes: #310)
+  * Note: Have a great Dutch Proxmox Day 2025!
+
+ -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Thu, 04 Sep 2025 19:23:51 +0000
+
+proxlb (1.1.6.1) stable; urgency=medium
+
+  * Validate for node presence when pinning VMs to avoid crashing. (Closes: #296)
+
+ -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Thu, 04 Sep 2025 19:23:51 +0000
+
+proxlb (1.1.6) stable; urgency=medium
+
+  * Add validation for provided API user token id to avoid confusions. (Closes: #291)
+  * Fix stacktrace output when validating permissions on non existing users in Proxmox. (Closes: #291)
+  * Fix Overprovisioning first node if anti_affinity_group has only one member. (Closes: #295)
+  * Validate for node presence when pinning guests to avoid crashing. (Closes: #296)
+  * Fix balancing evaluation of guest types (e.g., VM or CT). (Closes: #268)
+
+ -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Thu, 04 Sep 2025 05:12:19 +0000
+
+proxlb (1.1.5) stable; urgency=medium
+
+  * Allow custom API ports instead of fixed tcp/8006. (Closes: #260)
+
+ -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Mon, 14 Jul 2025 11:07:34 +0000
+
+proxlb (1.1.4) stable; urgency=medium
+
+  * Allow pinning of guests to a group of nodes. (Closes: #245)
+  * Modified log levels to make output lighter at INFO level. (Closes: #255)
+  * ixed an issue where balancing was performed in combination of deactivated balancing and dry-run mode. (Closes: #248)
+
+ -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Fri, 27 Jun 2025 16:22:58 +0000
+
+proxlb (1.1.3) stable; urgency=medium
+
+  * Add relaod (SIGHUP) function to ProxLB to reload the configuration. (Closes: #189)
+  * Add optional wait time parameter to delay execution until the service takes action. (Closes: #239)
+  * Make the amount of parallel migrations configurable. (Closes: #241)
+  * Use the average CPU consumption of a guest within the last 60 minutes instead of the current CPU usage. (Closes: #94)
+  * Align maintenance mode with Proxmox HA maintenance mode. (Closes: #232)
+
+ -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Thu, 19 Jun 2025 09:10:43 +0000
+
+proxlb (1.1.2) stable; urgency=medium
+
+  * Add a configurable retry mechanism when connecting to the Proxmox API. (Closed: #157)
+  * Add 1-to-1 relationships between guest and hypervisor node to ping a guest on a node. (Closes #218)
+  * Force type cast cpu count of guests to int for some corner cases where a str got returned. (Closed #222)
+  * Fix systemd unit file to run after network on non PVE nodes. (Closes #137)
+
+ -- Florian Paul Azim Hoberg <gyptazy@gyptazy.com>  Mon, 13 May 2025 18:12:04 +0000

 proxlb (1.1.1) stable; urgency=medium

--- a/debian/control
+++ b/debian/control
@@ -8,5 +8,5 @@ Build-Depends: debhelper-compat (= 13), dh-python, python3-all, python3-setuptoo
 Package: proxlb
 Architecture: all
 Depends: ${python3:Depends}, ${misc:Depends}, python3-requests, python3-urllib3, python3-proxmoxer, python3-yaml 
-Description: A DRS alike Load Balancer for Proxmox Clusters
- An advanced DRS alike loadbalancer for Proxmox clusters that also supports maintenance modes and affinity/anti-affinity rules.
+Description: An advanced resource scheduler and load balancer for Proxmox clusters
+ An advanced resource scheduler and load balancer for Proxmox clusters that also supports maintenance mode and affinity/anti-affinity rules.
--- a/docs/03_configuration.md
+++ b/docs/03_configuration.md
@@ -19,7 +19,11 @@
    6. [Parallel Migrations](#parallel-migrations)
    7. [Run as a Systemd-Service](#run-as-a-systemd-service)
    8. [SSL Self-Signed Certificates](#ssl-self-signed-certificates)
-    9. [Dynamic Power Management (DPM)](#dynamic-power-management)
+    9. [Node Maintenances](#node-maintenances)
+    10. [Balancing Methods](#balancing-methods)
+        1. [Used Resources](#used-resources)
+        2. [Assigned Resources](#assigned-resources)
+        3. [Pressure (PSI) based Resources](#pressure-psi-based-resources)

 ## Authentication / User Accounts / Permissions
 ### Authentication
@@ -76,8 +80,8 @@ ProxLB provides an advanced mechanism to define affinity and anti-affinity rules

 ProxLB implements affinity and anti-affinity rules through a tag-based system within the Proxmox web interface. Each guest (virtual machine or container) can be assigned specific tags, which then dictate its placement behavior. This method maintains a streamlined and secure approach to managing VM relationships while preserving Proxmox’s inherent permission model.

-#### Affinity Rules
-<img align="left" src="https://cdn.gyptazy.com/images/proxlb-affinity-rules.jpg"/> Affinity rules are used to group certain VMs together, ensuring that they run on the same host whenever possible. This can be beneficial for workloads requiring low-latency communication, such as clustered databases or application servers that frequently exchange data.
+#### Affinity Rules by Tags
+<img align="left" src="https://cdn.gyptazy.com/img/proxlb-affinity-rules.jpg"/> Affinity rules are used to group certain VMs together, ensuring that they run on the same host whenever possible. This can be beneficial for workloads requiring low-latency communication, such as clustered databases or application servers that frequently exchange data.

 To define an affinity rule which keeps all guests assigned to this tag together on a node, users assign a tag with the prefix `plb_affinity_$TAG`:

@@ -88,8 +92,20 @@ plb_affinity_talos

 As a result, ProxLB will attempt to place all VMs with the `plb_affinity_web` tag on the same host (see also the attached screenshot with the same node).

-#### Anti-Affinity Rules
-<img align="left" src="https://cdn.gyptazy.com/images/proxlb-anti-affinity-rules.jpg"/> Conversely, anti-affinity rules ensure that designated VMs do not run on the same physical host. This is particularly useful for high-availability setups, where redundancy is crucial. Ensuring that critical services are distributed across multiple hosts reduces the risk of a single point of failure.
+#### Affinity Rules by Pools
+Antoher approach is by using pools in Proxmox. This way, it can easily also combined with other resources like backup jobs. However, in this approach you need to modify the ProxLB config file to your needs. Within the `balancing` section you can create a dict of pools, including the pool name and the affinity type. Please see the example for further details:
+
+**Example Config**
+```
+balancing:
+  [...]
+  pools:                              # Optional: Define affinity/anti-affinity rules per pool
+    dev:                              # Pool name: dev
+      type: affinity                  # Type: affinity (keeping VMs together)
+```
+
+#### Anti-Affinity Rules by Tags
+<img align="left" src="https://cdn.gyptazy.com/img/proxlb-anti-affinity-rules.jpg"/> Conversely, anti-affinity rules ensure that designated VMs do not run on the same physical host. This is particularly useful for high-availability setups, where redundancy is crucial. Ensuring that critical services are distributed across multiple hosts reduces the risk of a single point of failure.

 To define an anti-affinity rule that ensures to not move systems within this group to the same node, users assign a tag with the prefix:

@@ -102,6 +118,18 @@ As a result, ProxLB will try to place the VMs with the `plb_anti_affinity_ntp` t

 **Note:** While this ensures that ProxLB tries distribute these VMs across different physical hosts within the Proxmox cluster this may not always work. If you have more guests attached to the group than nodes in the cluster, we still need to run them anywhere. If this case occurs, the next one with the most free resources will be selected.

+#### Anti-Affinity Rules by Pools
+Antoher approach is by using pools in Proxmox. This way, it can easily also combined with other resources like backup jobs. However, in this approach you need to modify the ProxLB config file to your needs. Within the `balancing` section you can create a dict of pools, including the pool name and the affinity type. Please see the example for further details:
+
+**Example Config**
+```
+balancing:
+  [...]
+  pools:                              # Optional: Define affinity/anti-affinity rules per pool
+    de-nbg01-db:                      # Pool name: de-nbg01-db
+      type: anti-affinity                  # Type: anti-affinity (spreading VMs apart)
+````
+
 ### Affinity / Anti-Affinity Enforcing
 When a cluster is already balanced and does not require further adjustments, enabling the enforce_affinity parameter ensures that affinity and anti-affinity rules are still respected. This parameter prioritizes the placement of guest objects according to these rules, even if it leads to slight resource imbalances or increased migration overhead. Regularly reviewing and updating these rules, along with monitoring cluster performance, helps maintain optimal performance and reliability. By carefully managing these aspects, you can create a cluster environment that meets your specific needs and maintains a good balance of resources.

@@ -113,7 +141,7 @@ balancing:
 *Note: This may have impacts to the cluster. Depending on the created group matrix, the result may also be an unbalanced cluster.*

 ### Ignore VMs / CTs
-<img align="left" src="https://cdn.gyptazy.com/images/proxlb-ignore-vm-movement.jpg"/> Guests, such as VMs or CTs, can also be completely ignored. This means, they won't be affected by any migration (even when (anti-)affinity rules are enforced). To ensure a proper resource evaluation, these guests are still collected and evaluated but simply skipped for balancing actions. Another thing is the implementation. While ProxLB might have a very restricted configuration file including the file permissions, this file is only read- and writeable by the Proxmox administrators. However, we might have user and groups who want to define on their own that their systems shouldn't be moved. Therefore, these users can simpy set a specific tag to the guest object - just like the (anti)affinity rules.
+<img align="left" src="https://cdn.gyptazy.com/img/proxlb-ignore-vm-movement.jpg"/> Guests, such as VMs or CTs, can also be completely ignored. This means, they won't be affected by any migration (even when (anti-)affinity rules are enforced). To ensure a proper resource evaluation, these guests are still collected and evaluated but simply skipped for balancing actions. Another thing is the implementation. While ProxLB might have a very restricted configuration file including the file permissions, this file is only read- and writeable by the Proxmox administrators. However, we might have user and groups who want to define on their own that their systems shouldn't be moved. Therefore, these users can simpy set a specific tag to the guest object - just like the (anti)affinity rules.

 To define a guest to be ignored from the balancing, users assign a tag with the prefix `plb_ignore_$TAG`:

@@ -138,14 +166,20 @@ plb_pin_node03

 As a result, ProxLB will pin the guest `dev-vm01` to the node `virt03`.

+You can also repeat this step multiple times for different node names to create a potential group of allowed hosts where a the guest may be served on. In this case, ProxLB takes the node with the lowest used resources according to the defined balancing values from this group.
+
 **Note:** The given node names from the tag are validated. This means, ProxLB validated if the given node name is really part of the cluster. In case of a wrongly defined or unavailable node name it continous to use the regular processes to make sure the guest keeps running.

 ### API Loadbalancing
-ProxLB supports API loadbalancing, where one or more host objects can be defined as a list. This ensures, that you can even operator ProxLB without further changes when one or more nodes are offline or in a maintenance. When defining multiple hosts, the first reachable one will be picked.
+ProxLB supports API loadbalancing, where one or more host objects can be defined as a list. This ensures, that you can even operator ProxLB without further changes when one or more nodes are offline or in a maintenance. When defining multiple hosts, the first reachable one will be picked. You can speficy custom ports in the list. There are 4 ways of defining hosts with ports:
+1. Hostname of IPv4 without port (in this case the default 8006 will be used)
+2. Hostname or IPv4 with port
+3. IPv6 in brackets with optional port
+4. IPv6 without brackets, in this case the port is assumed after last colon

 ```
 proxmox_api:
-  hosts: ['virt01.example.com', '10.10.10.10', 'fe01::bad:code::cafe']
+  hosts: ['virt01.example.com', '10.10.10.10', 'fe01::bad:code::cafe', 'virt01.example.com:443', '[fc00::1]', '[fc00::1]:443', 'fc00::1:8006']
 ```

 ### Ignore Host-Nodes or Guests
@@ -210,32 +244,134 @@ proxmox_api:

 *Note: Disabling SSL certificate validation is not recommended.*

-### Dynamic Power Management (DPM)
-<img align="left" src="https://cdn.gyptazy.com/images/proxlb-proxmox-node-wakeonlan-wol-mac-dpm.jpg"/> Configuring Dynamic Power Management (DPM) in ProxLB within a Proxmox cluster involves a few critical steps to ensure proper operation. The first consideration is that any node intended for automatic shutdown and startup must support Wake-on-LAN (WOL). This is essential because DPM relies on the ability to power nodes back on remotely. For this to work, the ProxLB instance must be able to reach the target node’s MAC address directly over the network.
+### Node Maintenances
+To exclude specific nodes from receiving any new workloads during the balancing process, the `maintenance_nodes` configuration option can be used. This option allows administrators to define a list of nodes that are currently undergoing maintenance or should otherwise not be used for running virtual machines or containers.

-To make this possible, you must configure the correct MAC address for WOL within the Proxmox web interface. This is done by selecting the node, going to the “System” section, then “Options,” and finally setting the “MAC address for Wake-on-LAN.” Alternatively, this value can also be submitted using the Proxmox API. Without this MAC address in place, ProxLB will not allow the node to be shut down. This restriction is in place to prevent nodes from being turned off without a way to bring them back online, which could lead to service disruption. By ensuring that each node has a valid WOL MAC address configured, DPM can operate safely and effectively, allowing ProxLB to manage the cluster’s power consumption dynamically.
+```yaml
+maintenance_nodes:
+  - virt66.example.com
+```

-#### Requirements
-Using the powermanagement feature within clusters comes along with several requirements:
-* ProxLB needs to reach the WOL-Mac address of the node (plain network)
-* WOL must be enabled of the node in general (BIOS/UEFI)
-* The related WOL network interface must be defined
-* The related WOL network interface MAC address must be defined in Proxmox for the node
+which can also be written as:

-#### Options
-| Section | Option | Sub Option | Example | Type | Description |
-|---------|:------:|:----------:|:-------:|:----:|:-----------:|
-| `dpm` |  |  |  |  |  |
-|  | enable |  | True | `Bool` | Enables the Dynamic Power Management functions.|
-|  | method |  | memory | `Str` | The balancing method that should be used.  [values: `memory` (default), `cpu`, `disk`]|
-|  | mode |  | static | `Str` | The balancing mode that should be used. [values: `static` (default), `auto`] |
-|  | cluster_min_free_resources |  | 60 | `Int` | Representing the minimum required free resouzrces in percent within the cluster. [values: `60`% (default)] |
-|  | cluster_min_nodes |  | 3 | `Int` | The minimum of required nodes that should remain in a cluster. [values: `3` (default)] |
+```yaml
+maintenance_nodes: ['virt66.example.com']
+```

-#### DPM Modes
-##### Static
-Static mode in DPM lets you set a fixed number of nodes that should always stay powered on in a Proxmox cluster. This is important to keep the cluster working properly, since you need at least three nodes to maintain quorum. The system won’t let you go below that limit to avoid breaking cluster functionality.
+The maintenance_nodes key must be defined as a list, even if it only includes a single node. Each entry in the list must exactly match the node name as it is known within the Proxmox VE cluster. Do not use IP addresses, alternative DNS names, or aliases—only the actual cluster node names are valid. Once a node is marked as being in maintenance mode:

-Besides the minimum number of active nodes, you can also define a baseline for how many free resources—like CPU or RAM—should always be available when the virtual machines are running. If the available resources drop below that level, ProxLB will try to power on more nodes, as long as they're available and can be started. On the other hand, if the cluster has more than enough resources, ProxLB will begin to shut down nodes again, but only until the free resource threshold is reached.
+* No new workloads will be balanced or migrated onto it.
+* Any existing workloads currently running on the node will be migrated away in accordance with the configured balancing strategies, assuming resources on other nodes allow. 

-This mode gives you a more stable setup by always keeping a minimum number of nodes ready while still adjusting the rest of the cluster based on resource usage, but in a controlled and predictable way.
+This feature is particularly useful during planned maintenance, upgrades, or troubleshooting, ensuring that services continue to run with minimal disruption while the specified node is being worked on.
+
+## 10. Balancing Methods
+ProxLB provides multiple balancing modes that define *how* resources are evaluated and compared during cluster balancing.
+Each mode reflects a different strategy for determining load and distributing guests (VMs or containers) between nodes.
+
+Depending on your environment, provisioning strategy, and performance goals, you can choose between:
+
+| Mode | Description | Typical Use Case |
+|------|--------------|------------------|
+| `used` | Uses the *actual runtime resource usage* (e.g. CPU, memory, disk). | Dynamic or lab environments with frequent workload changes and tolerance for overprovisioning. |
+| `assigned` | Uses the *statically defined resource allocations* from guest configurations. | Production or SLA-driven clusters that require guaranteed resources and predictable performance. |
+| `psi` | Uses Linux *Pressure Stall Information (PSI)* metrics to evaluate real system contention and pressure. | Advanced clusters that require pressure-aware decisions for proactive rebalancing. |
+
+### 10.1 Used Resources
+When **mode: `used`** is configured, ProxLB evaluates the *real usage metrics* of guest objects (VMs and CTs).
+It collects the current CPU, memory, and disk usage directly from the Proxmox API to determine the *actual consumption* of each guest and node.
+
+This mode is ideal for **dynamic environments** where workloads frequently change and **overprovisioning is acceptable**. It provides the most reactive balancing behavior, since decisions are based on live usage instead of static assignment.
+
+Typical scenarios include:
+- Production environments to distribute workloads across the nodes.
+- Test or development clusters with frequent VM changes.
+- Clusters where resource spikes are short-lived.
+- Environments where slight resource contention is tolerable.
+
+#### Example Configuration
+```yaml
+balancing:
+  mode: used
+```
+
+### 10.2 Assigned Resources
+When **mode: `assigned`** is configured, ProxLB evaluates the *provisioned or allocated resources* of each guest (VM or CT) instead of their runtime usage.
+It uses data such as **CPU cores**, **memory limits**, and **disk allocations** defined in Proxmox to calculate how much of each node’s capacity is reserved.
+
+This mode is ideal for **production clusters** where:
+- Overcommitment is *not allowed or only minimally tolerated*.
+- Each node’s workload is planned based on the assigned capacities.
+- Administrators want predictable resource distribution aligned with provisioning policies.
+
+Unlike the `used` mode, `assigned` focuses purely on the *declared configuration* of guests and remains stable even if actual usage varies temporarily.
+
+Typical scenarios include:
+- Enterprise environments with SLA or QoS requirements.
+- Clusters where workloads are sized deterministically.
+- Situations where consistent node utilization and capacity awareness are crucial.
+
+#### Example Configuration
+```yaml
+balancing:
+  mode: assigned
+```
+
+### 10.3 Pressure (PSI) based Resources
+> [!IMPORTANT]
+> PSI based balancing is still in beta! If you find any bugs, please raise an issue including metrics of all nodes and affected guests. You can provide metrics directly from PVE or Grafana (via node_exporter or pve_exporter).
+
+When **mode: `psi`** is configured, ProxLB uses the **Linux Pressure Stall Information (PSI)** interface to measure the *real-time pressure* on system resources such as **CPU**, **memory**, and **disk I/O**.
+Unlike the `used` or `assigned` modes, which rely on static or average metrics, PSI provides *direct insight into how often and how long tasks are stalled* because of insufficient resources.
+
+This enables ProxLB to make **proactive balancing decisions** — moving workloads *before* performance degradation becomes visible to the user.
+
+**IMPORTANT**: Predicting distributing workloads is dangerous and might not result into the expected state. Therefore, ProxLB migrates only a single instance each 60 minutes to obtain new real-metrics and to validate if further changes are required. Keep in mind, that migrations are also costly and should be avoided as much as possible.
+
+PSI metrics are available for both **nodes** and **guest objects**, allowing fine-grained balancing decisions:
+- **Node-level PSI:** Detects cluster nodes under systemic load or contention.
+- **Guest-level PSI:** Identifies individual guests suffering from memory, CPU, or I/O stalls.
+
+### PSI Metrics Explained
+Each monitored resource defines three pressure thresholds:
+| Key | Description |
+|-----|--------------|
+| `pressure_some` | Indicates partial stall conditions where some tasks are waiting for a resource. |
+| `pressure_full` | Represents complete stall conditions where *all* tasks are blocked waiting for a resource. |
+| `pressure_spikes` | Defines short-term burst conditions that may signal saturation spikes. |
+
+These thresholds are expressed in **percentages** and represent how much time the kernel reports stalls over specific averaging windows (e.g. 5s, 10s, 60s).
+
+### Example Configuration
+
+```yaml
+balancing:
+  mode: psi
+  psi:
+    nodes:
+      memory:
+        pressure_full: 0.20
+        pressure_some: 0.20
+        pressure_spikes: 1.00
+      cpu:
+        pressure_full: 0.20
+        pressure_some: 0.20
+        pressure_spikes: 1.00
+      disk:
+        pressure_full: 0.20
+        pressure_some: 0.20
+        pressure_spikes: 1.00
+    guests:
+      memory:
+        pressure_full: 0.20
+        pressure_some: 0.20
+        pressure_spikes: 1.00
+      cpu:
+        pressure_full: 0.20
+        pressure_some: 0.20
+        pressure_spikes: 1.00
+      disk:
+        pressure_full: 0.20
+        pressure_some: 0.20
+        pressure_spikes: 1.00
+```
--- a/helm/proxlb/Chart.yaml
+++ b/helm/proxlb/Chart.yaml
@@ -0,0 +1,6 @@
+apiVersion: v3
+name: proxlb
+description: A Helm chart for self-hosted ProxLB
+type: application
+version: "1.1.9.1"
+appVersion: "v1.1.9.1"
--- a/helm/proxlb/templates/_helpers.yaml
+++ b/helm/proxlb/templates/_helpers.yaml
@@ -0,0 +1,13 @@
+{{- define "proxlb.fullname" -}}
+{{- printf "%s-%s" .Release.Name .Chart.Name | trunc 63 | trimSuffix "-" -}}
+{{- end -}}
+
+{{ define "proxlb.labels" }}
+app.kubernetes.io/name: {{ .Release.Name }}
+app.kubernetes.io/managed-by: Helm
+app.kubernetes.io/version: {{ .Chart.AppVersion }}
+app.kubernetes.io/component: proxlb
+{{- if .Values.labels }}
+{{ toYaml .Values.labels }}
+{{- end }}
+{{ end }}
--- a/helm/proxlb/templates/configmap.yaml
+++ b/helm/proxlb/templates/configmap.yaml
@@ -0,0 +1,11 @@
+{{- if .Values.configmap.create }}
+apiVersion: v1
+kind: ConfigMap
+metadata:
+  name: proxlb-config
+  labels:
+    {{- include "proxlb.labels" . | nindent 4 }}
+data:
+  proxlb.yaml: |
+{{ toYaml .Values.configmap.config | indent 4 }}
+{{ end }}
--- a/helm/proxlb/templates/deployment.yaml
+++ b/helm/proxlb/templates/deployment.yaml
@@ -0,0 +1,44 @@
+apiVersion: apps/v1
+kind: Deployment
+metadata:
+  name: {{ .Release.Name }}
+  labels:
+    {{- include "proxlb.labels" . | nindent 4 }}
+spec:
+  replicas: 1 # Number of replicas cannot be more than 1
+  selector:
+    matchLabels:
+      {{- include "proxlb.labels" . | nindent 6 }}
+  template:
+    metadata:
+      labels:
+        {{- include "proxlb.labels" . | nindent 8 }}
+    spec:
+      {{- with .Values.image.imagePullSecrets }}
+      imagePullSecrets:
+        {{- toYaml . | nindent 8 }}
+      {{- end }}
+      # not interacting with the k8s cluster
+      automountServiceAccountToken: False
+      containers:
+      - name: proxlb
+        image: "{{ .Values.image.registry }}/{{ .Values.image.repository }}:{{ .Values.image.tag }}"
+        imagePullPolicy: {{ .Values.image.pullPolicy }}
+        args:
+        {{- if .Values.extraArgs.dryRun }}
+          - --dry-run
+        {{- end }}
+        volumeMounts:
+          - name: config
+            mountPath: /etc/proxlb/proxlb.yaml
+            subPath: proxlb.yaml
+        {{ if .Values.resources }}
+        resources:
+          {{ with .Values.resources }}
+            {{ toYaml . | nindent 10 }}
+          {{ end }}
+        {{ end }}
+      volumes:
+        - name: config
+          configMap:
+            name: proxlb-config
--- a/helm/proxlb/values.yaml
+++ b/helm/proxlb/values.yaml
@@ -0,0 +1,61 @@
+image:
+  registry: cr.gyptazy.com
+  repository: proxlb/proxlb
+  tag: v1.1.9.1
+  pullPolicy: IfNotPresent
+  imagePullSecrets: [ ]
+
+resources:
+  limits:
+    cpu: "1000m"
+    memory: "2Gi"
+  requests:
+    cpu: "100m"
+    memory: "100Mi"
+
+labels: {}
+
+extraArgs:
+  dryRun: false
+
+configmap:
+  create: true
+  config:
+    proxmox_api:
+      hosts: []
+#Can be either a user or a token
+#      user: ""
+#      pass: ""
+#      token_id: ""
+#      token_secret: ""
+      ssl_verification: True
+      timeout: 10
+    proxmox_cluster:
+      maintenance_nodes: [ ]
+      ignore_nodes: [ ]
+      overprovisioning: True
+    balancing:
+      enable: True
+      enforce_affinity: False
+      parallel: False
+      # If running parallel job, you can define
+      # the amount of prallel jobs (default: 5)
+      parallel_jobs: 1
+      live: True
+      with_local_disks: True
+      with_conntrack_state: True
+      balance_types: [ 'vm', 'ct' ]
+      max_job_validation: 1800
+      balanciness: 5
+      method: memory
+      mode: used
+    service:
+      daemon: True
+      schedule:
+        interval: 12
+        format: "hours"
+      delay:
+        enable: False
+        time: 1
+        format: "hours"
+      log_level: INFO
--- a/misc/01-replace-version.sh
+++ b/misc/01-replace-version.sh
@@ -1,6 +1,13 @@
 #!/usr/bin/env bash
-VERSION="1.1.2b"
+VERSION="1.1.9.1"

+# ProxLB
 sed -i "s/^__version__ = .*/__version__ = \"$VERSION\"/" "proxlb/utils/version.py"
 sed -i "s/version=\"[0-9]*\.[0-9]*\.[0-9]*\"/version=\"$VERSION\"/" setup.py
+
+# Helm Chart
+sed -i "s/^version: .*/version: \"$VERSION\"/" helm/proxlb/Chart.yaml
+sed -i "s/^appVersion: .*/appVersion: \"v$VERSION\"/" helm/proxlb/Chart.yaml
+sed -i "s/^tag: .*/tag: \"v$VERSION\"/" helm/proxlb/values.yaml
+
 echo "OK: Versions have been sucessfully set to $VERSION"
--- a/proxlb/main.py
+++ b/proxlb/main.py
@@ -13,16 +13,17 @@ __license__ = "GPL-3.0"


 import logging
+import signal
 from utils.logger import SystemdLogger
 from utils.cli_parser import CliParser
 from utils.config_parser import ConfigParser
 from utils.proxmox_api import ProxmoxApi
-from models.dpm import DPM
 from models.nodes import Nodes
 from models.guests import Guests
 from models.groups import Groups
 from models.calculations import Calculations
 from models.balancing import Balancing
+from models.pools import Pools
 from utils.helper import Helper


@@ -33,6 +34,10 @@ def main():
    # Initialize logging handler
    logger = SystemdLogger(level=logging.INFO)

+    # Initialize handlers
+    signal.signal(signal.SIGHUP, Helper.handler_sighup)
+    signal.signal(signal.SIGINT, Helper.handler_sigint)
+
    # Parses arguments passed from the CLI
    cli_parser = CliParser()
    cli_args = cli_parser.parse_args()
@@ -45,6 +50,9 @@ def main():
    # Update log level from config and fallback to INFO if not defined
    logger.set_log_level(proxlb_config.get('service', {}).get('log_level', 'INFO'))

+    # Validate of an optional service delay
+    Helper.get_service_delay(proxlb_config)
+
    # Connect to Proxmox API & create API object
    proxmox_api = ProxmoxApi(proxlb_config)

@@ -52,35 +60,41 @@ def main():
    proxlb_config["proxmox_api"]["pass"] = "********"

    while True:
+
+        # Validate if reload signal was sent during runtime
+        # and reload the ProxLB configuration and adjust log level
+        if Helper.proxlb_reload:
+            logger.info("Reloading ProxLB configuration.")
+            proxlb_config = config_parser.get_config()
+            logger.set_log_level(proxlb_config.get('service', {}).get('log_level', 'INFO'))
+            Helper.proxlb_reload = False
+
        # Get all required objects from the Proxmox cluster
        meta = {"meta": proxlb_config}
-        nodes, cluster = Nodes.get_nodes(proxmox_api, proxlb_config)
-        guests = Guests.get_guests(proxmox_api, nodes, meta)
+        nodes = Nodes.get_nodes(proxmox_api, proxlb_config)
+        pools = Pools.get_pools(proxmox_api)
+        guests = Guests.get_guests(proxmox_api, pools, nodes, meta, proxlb_config)
        groups = Groups.get_groups(guests, nodes)

        # Merge obtained objects from the Proxmox cluster for further usage
-        proxlb_data = {**meta, **cluster, **nodes, **guests, **groups}
+        proxlb_data = {**meta, **nodes, **guests, **pools, **groups}
        Helper.log_node_metrics(proxlb_data)

-        # Evaluate the dynamic power management for nodes in the clustet
-        DPM(proxlb_data)
-
        # Update the initial node resource assignments
        # by the previously created groups.
        Calculations.set_node_assignments(proxlb_data)
+        Calculations.set_node_hot(proxlb_data)
+        Calculations.set_guest_hot(proxlb_data)
        Calculations.get_most_free_node(proxlb_data, cli_args.best_node)
        Calculations.relocate_guests_on_maintenance_nodes(proxlb_data)
        Calculations.get_balanciness(proxlb_data)
        Calculations.relocate_guests(proxlb_data)
        Helper.log_node_metrics(proxlb_data, init=False)

-        # Perform balancing
-        if not cli_args.dry_run or not proxlb_data["meta"]["balancing"].get("enable", False):
-            Balancing(proxmox_api, proxlb_data)
-
-        # Perform DPM
-        if not cli_args.dry_run:
-            DPM.dpm_shutdown_nodes(proxmox_api, proxlb_data)
+        # Perform balancing actions via Proxmox API
+        if proxlb_data["meta"]["balancing"].get("enable", False):
+            if not cli_args.dry_run:
+                Balancing(proxmox_api, proxlb_data)

        # Validate if the JSON output should be
        # printed to stdout
--- a/proxlb/models/balancing.py
+++ b/proxlb/models/balancing.py
@@ -12,6 +12,7 @@ __license__ = "GPL-3.0"

 import proxmoxer
 import time
+from itertools import islice
 from utils.logger import SystemdLogger
 from typing import Dict, Any

@@ -48,34 +49,81 @@ class Balancing:
        Initializes the Balancing class with the provided ProxLB data.

        Args:
-            proxlb_data (dict): The data required for balancing VMs and CTs.
+            proxmox_api (object): The Proxmox API client instance used to interact with the Proxmox cluster.
+            proxlb_data (dict): A dictionary containing data related to the ProxLB load balancing configuration.
        """
-        for guest_name, guest_meta in proxlb_data["guests"].items():
+        def chunk_dict(data, size):
+            """
+            Splits a dictionary into chunks of a specified size.
+            Args:
+                data (dict): The dictionary to be split into chunks.
+                size (int): The size of each chunk.
+            Yields:
+                dict: A chunk of the original dictionary with the specified size.
+            """
+            logger.debug("Starting: chunk_dict.")
+            it = iter(data.items())
+            for chunk in range(0, len(data), size):
+                yield dict(islice(it, size))

-            # Check if the guest's target is not the same as the current node
-            if guest_meta["node_current"] != guest_meta["node_target"]:
-                # Check if the guest is not ignored and perform the balancing
-                # operation based on the guest type
-                if not guest_meta["ignore"]:
-                    guest_id = guest_meta["id"]
-                    guest_node_current = guest_meta["node_current"]
-                    guest_node_target = guest_meta["node_target"]
+        # Validate if balancing should be performed in parallel or sequentially.
+        # If parallel balancing is enabled, set the number of parallel jobs.
+        parallel_jobs = proxlb_data["meta"]["balancing"].get("parallel_jobs", 5)
+        if not proxlb_data["meta"]["balancing"].get("parallel", False):
+            parallel_jobs = 1
+            logger.debug("Balancing: Parallel balancing is disabled. Running sequentially.")
+        else:
+            logger.debug(f"Balancing: Parallel balancing is enabled. Running with {parallel_jobs} parallel jobs.")

-                    # VM Balancing
-                    if guest_meta["type"] == "vm":
-                        self.exec_rebalancing_vm(proxmox_api, proxlb_data, guest_name)
+        for chunk in chunk_dict(proxlb_data["guests"], parallel_jobs):
+            jobs_to_wait = []

-                    # CT Balancing
-                    elif guest_meta["type"] == "ct":
-                        self.exec_rebalancing_ct(proxmox_api, proxlb_data, guest_name)
+            for guest_name, guest_meta in chunk.items():
+
+                # Check if the guest's target is not the same as the current node
+                if guest_meta["node_current"] != guest_meta["node_target"]:
+
+                    # Check if the guest is not ignored and perform the balancing
+                    # operation based on the guest type
+                    if not guest_meta["ignore"]:
+                        job_id = None
+
+                        # VM Balancing
+                        if guest_meta["type"] == "vm":
+                            if 'vm' in proxlb_data["meta"]["balancing"].get("balance_types", []):
+                                logger.debug(f"Balancing: Balancing for guest {guest_name} of type VM started.")
+                                job_id = self.exec_rebalancing_vm(proxmox_api, proxlb_data, guest_name)
+                            else:
+                                logger.debug(
+                                    f"Balancing: Balancing for guest {guest_name} will not be performed. "
+                                    "Guest is of type VM which is not included in allowed balancing types.")
+
+                        # CT Balancing
+                        elif guest_meta["type"] == "ct":
+                            if 'ct' in proxlb_data["meta"]["balancing"].get("balance_types", []):
+                                logger.debug(f"Balancing: Balancing for guest {guest_name} of type CT started.")
+                                job_id = self.exec_rebalancing_ct(proxmox_api, proxlb_data, guest_name)
+                            else:
+                                logger.debug(
+                                    f"Balancing: Balancing for guest {guest_name} will not be performed. "
+                                    "Guest is of type CT which is not included in allowed balancing types.")
+
+                        # Just in case we get a new type of guest in the future
+                        else:
+                            logger.critical(f"Balancing: Got unexpected guest type: {guest_meta['type']}. Cannot proceed guest: {guest_meta['name']}.")
+
+                        if job_id:
+                            jobs_to_wait.append((guest_name, guest_meta["node_current"], job_id))

-                    # Just in case we get a new type of guest in the future
                    else:
-                        logger.critical(f"Balancing: Got unexpected guest type: {guest_meta['type']}. Cannot proceed guest: {guest_meta['name']}.")
+                        logger.debug(f"Balancing: Guest {guest_name} is ignored and will not be rebalanced.")
                else:
-                    logger.debug(f"Balancing: Guest {guest_name} is ignored and will not be rebalanced.")
-            else:
-                logger.debug(f"Balancing: Guest {guest_name} is already on the target node {guest_meta['node_target']} and will not be rebalanced.")
+                    logger.debug(f"Balancing: Guest {guest_name} is already on the target node {guest_meta['node_target']} and will not be rebalanced.")
+
+            # Wait for all jobs in the current chunk to complete
+            for guest_name, node, job_id in jobs_to_wait:
+                if job_id:
+                    self.get_rebalancing_job_status(proxmox_api, proxlb_data, guest_name, node, job_id)

    def exec_rebalancing_vm(self, proxmox_api: any, proxlb_data: Dict[str, Any], guest_name: str) -> None:
        """
@@ -96,6 +144,7 @@ class Balancing:
        guest_id = proxlb_data["guests"][guest_name]["id"]
        guest_node_current = proxlb_data["guests"][guest_name]["node_current"]
        guest_node_target = proxlb_data["guests"][guest_name]["node_target"]
+        job_id = None

        if proxlb_data["meta"]["balancing"].get("live", True):
            online_migration = 1
@@ -108,19 +157,25 @@ class Balancing:
            with_local_disks = 0

        migration_options = {
-            'target': {guest_node_target},
+            'target': guest_node_target,
            'online': online_migration,
-            'with-local-disks': with_local_disks
+            'with-local-disks': with_local_disks,
        }

+        # Conntrack state aware migrations are not supported in older
+        # PVE versions, so we should not add it by default.
+        if proxlb_data["meta"]["balancing"].get("with_conntrack_state", True):
+            migration_options['with-conntrack-state'] = 1
+
        try:
-            logger.debug(f"Balancing: Starting to migrate guest {guest_name} of type VM.")
+            logger.info(f"Balancing: Starting to migrate VM guest {guest_name} from {guest_node_current} to {guest_node_target}.")
            job_id = proxmox_api.nodes(guest_node_current).qemu(guest_id).migrate().post(**migration_options)
-            self.get_rebalancing_job_status(proxmox_api, proxlb_data, guest_name, guest_node_current, job_id)
        except proxmoxer.core.ResourceException as proxmox_api_error:
            logger.critical(f"Balancing: Failed to migrate guest {guest_name} of type VM due to some Proxmox errors. Please check if resource is locked or similar.")
            logger.debug(f"Balancing: Failed to migrate guest {guest_name} of type VM due to some Proxmox errors: {proxmox_api_error}")
+
        logger.debug("Finished: exec_rebalancing_vm.")
+        return job_id

    def exec_rebalancing_ct(self, proxmox_api: any, proxlb_data: Dict[str, Any], guest_name: str) -> None:
        """
@@ -141,15 +196,17 @@ class Balancing:
        guest_id = proxlb_data["guests"][guest_name]["id"]
        guest_node_current = proxlb_data["guests"][guest_name]["node_current"]
        guest_node_target = proxlb_data["guests"][guest_name]["node_target"]
+        job_id = None

        try:
-            logger.debug(f"Balancing: Starting to migrate guest {guest_name} of type CT.")
+            logger.info(f"Balancing: Starting to migrate CT guest {guest_name} from {guest_node_current} to {guest_node_target}.")
            job_id = proxmox_api.nodes(guest_node_current).lxc(guest_id).migrate().post(target=guest_node_target, restart=1)
-            self.get_rebalancing_job_status(proxmox_api, proxlb_data, guest_name, guest_node_current, job_id)
        except proxmoxer.core.ResourceException as proxmox_api_error:
            logger.critical(f"Balancing: Failed to migrate guest {guest_name} of type CT due to some Proxmox errors. Please check if resource is locked or similar.")
            logger.debug(f"Balancing: Failed to migrate guest {guest_name} of type CT due to some Proxmox errors: {proxmox_api_error}")
+
        logger.debug("Finished: exec_rebalancing_ct.")
+        return job_id

    def get_rebalancing_job_status(self, proxmox_api: any, proxlb_data: Dict[str, Any], guest_name: str, guest_current_node: str, job_id: int, retry_counter: int = 1) -> bool:
        """
@@ -167,35 +224,32 @@ class Balancing:
            bool: True if the job completed successfully, False otherwise.
        """
        logger.debug("Starting: get_rebalancing_job_status.")
-        # Parallel migrations can take a huge time and create a higher load, if not defined by an
-        # operator we will use a sequential mode by default
-        if not proxlb_data["meta"]["balancing"].get("parallel", False):
-            job = proxmox_api.nodes(guest_current_node).tasks(job_id).status().get()
+        job = proxmox_api.nodes(guest_current_node).tasks(job_id).status().get()

-            # Watch job id until it finalizes
-            if job["status"] == "running":
-                # Do not hammer the API while
-                # watching the job status
-                time.sleep(10)
-                retry_counter += 1
+        # Watch job id until it finalizes
+        if job["status"] == "running":
+            # Do not hammer the API while
+            # watching the job status
+            time.sleep(10)
+            retry_counter += 1

-                # Run recursion until we hit the soft-limit of maximum migration time for a guest
-                if retry_counter < proxlb_data["meta"]["balancing"].get("max_job_validation", 1800):
-                    logger.debug(f"Balancing: Job ID {job_id} (guest: {guest_name}) for migration is still running... (Run: {retry_counter})")
-                    self.get_rebalancing_job_status(proxmox_api, proxlb_data, guest_name, guest_current_node, job_id, retry_counter)
-                else:
-                    logger.warning(f"Balancing: Job ID {job_id} (guest: {guest_name}) for migration took too long. Please check manually.")
-                    logger.debug("Finished: get_rebalancing_job_status.")
-                    return False
+            # Run recursion until we hit the soft-limit of maximum migration time for a guest
+            if retry_counter < proxlb_data["meta"]["balancing"].get("max_job_validation", 1800):
+                logger.debug(f"Balancing: Job ID {job_id} (guest: {guest_name}) for migration is still running... (Run: {retry_counter})")
+                self.get_rebalancing_job_status(proxmox_api, proxlb_data, guest_name, guest_current_node, job_id, retry_counter)
+            else:
+                logger.warning(f"Balancing: Job ID {job_id} (guest: {guest_name}) for migration took too long. Please check manually.")
+                logger.debug("Finished: get_rebalancing_job_status.")
+                return False

-            # Validate job output for errors when finished
-            if job["status"] == "stopped":
+        # Validate job output for errors when finished
+        if job["status"] == "stopped":

-                if job["exitstatus"] == "OK":
-                    logger.debug(f"Balancing: Job ID {job_id} (guest: {guest_name}) was successfully.")
-                    logger.debug("Finished: get_rebalancing_job_status.")
-                    return True
-                else:
-                    logger.critical(f"Balancing: Job ID {job_id} (guest: {guest_name}) went into an error! Please check manually.")
-                    logger.debug("Finished: get_rebalancing_job_status.")
-                    return False
+            if job["exitstatus"] == "OK":
+                logger.debug(f"Balancing: Job ID {job_id} (guest: {guest_name}) was successfully.")
+                logger.debug("Finished: get_rebalancing_job_status.")
+                return True
+            else:
+                logger.critical(f"Balancing: Job ID {job_id} (guest: {guest_name}) went into an error! Please check manually.")
+                logger.debug("Finished: get_rebalancing_job_status.")
+                return False
--- a/proxlb/models/calculations.py
+++ b/proxlb/models/calculations.py
@@ -80,8 +80,9 @@ class Calculations:

            for guest_name in group_meta["guests"]:
                guest_node_current = proxlb_data["guests"][guest_name]["node_current"]
-                # Update Hardware assignments
+                # Update resource assignments
                # Update assigned values for the current node
+                logger.debug(f"set_node_assignment of guest {guest_name} on node {guest_node_current} with cpu_total: {proxlb_data['guests'][guest_name]['cpu_total']}, memory_total: {proxlb_data['guests'][guest_name]['memory_total']}, disk_total: {proxlb_data['guests'][guest_name]['disk_total']}.")
                proxlb_data["nodes"][guest_node_current]["cpu_assigned"] += proxlb_data["guests"][guest_name]["cpu_total"]
                proxlb_data["nodes"][guest_node_current]["memory_assigned"] += proxlb_data["guests"][guest_name]["memory_total"]
                proxlb_data["nodes"][guest_node_current]["disk_assigned"] += proxlb_data["guests"][guest_name]["disk_total"]
@@ -92,6 +93,83 @@ class Calculations:

        logger.debug("Finished: set_node_assignments.")

+    def set_node_hot(proxlb_data: Dict[str, Any]) -> Dict[str, Any]:
+        """
+        Evaluates node 'full' pressure metrics for memory, cpu, and io
+        against defined thresholds and sets <metric>_pressure_hot = True
+        when a node is considered HOT.
+
+        Returns the modified proxlb_data dict.
+        """
+        logger.debug("Starting: set_node_hot.")
+        balancing_cfg = proxlb_data.get("meta", {}).get("balancing", {})
+        thresholds = balancing_cfg.get("psi_thresholds", balancing_cfg.get("psi", {}).get("nodes", {}))
+        nodes = proxlb_data.get("nodes", {})
+
+        for node_name, node in nodes.items():
+
+            if node.get("maintenance"):
+                continue
+
+            if node.get("ignore"):
+                continue
+
+            # PSI metrics are only availavble on Proxmox VE 9.0 and higher.
+            if proxlb_data["meta"]["balancing"].get("mode", "used") == "psi":
+
+                if tuple(map(int, proxlb_data["nodes"][node["name"]]["pve_version"].split('.'))) < tuple(map(int, "9.0".split('.'))):
+                    logger.critical(f"Proxmox node {node['name']} runs Proxmox VE version {proxlb_data['nodes'][node['name']]['pve_version']}."
+                                    " PSI metrics require Proxmox VE 9.0 or higher. Balancing deactivated!")
+
+            for metric, threshold in thresholds.items():
+                pressure_full = node.get(f"{metric}_pressure_full_percent", 0.0)
+                pressure_some = node.get(f"{metric}_pressure_some_percent", 0.0)
+                pressure_spikes = node.get(f"{metric}_pressure_full_spikes_percent", 0.0)
+                is_hot = (pressure_full >= threshold["pressure_full"] and pressure_some >= threshold["pressure_some"]) or (pressure_spikes >= threshold["pressure_spikes"])
+
+                if is_hot:
+                    logger.debug(f"Set node {node['name']} as hot based on {metric} pressure metrics.")
+                    proxlb_data["nodes"][node["name"]][f"{metric}_pressure_hot"] = True
+                    proxlb_data["nodes"][node["name"]][f"pressure_hot"] = True
+                else:
+                    logger.debug(f"Node {node['name']} is not hot based on {metric} pressure metrics.")
+
+        logger.debug("Finished: set_node_hot.")
+        return proxlb_data
+
+    def set_guest_hot(proxlb_data: Dict[str, Any]) -> Dict[str, Any]:
+        """
+        Evaluates guest 'full' pressure metrics for memory, cpu, and io
+        against defined thresholds and sets <metric>_pressure_hot = True
+        when a guest is considered HOT.
+
+        Returns the modified proxlb_data dict.
+        """
+        logger.debug("Starting: set_guest_hot.")
+        balancing_cfg = proxlb_data.get("meta", {}).get("balancing", {})
+        thresholds = balancing_cfg.get("psi_thresholds", balancing_cfg.get("psi", {}).get("guests", {}))
+        guests = proxlb_data.get("guests", {})
+
+        for guest_name, guest in guests.items():
+            if guest.get("ignore"):
+                continue
+
+            for metric, threshold in thresholds.items():
+                pressure_full = guest.get(f"{metric}_pressure_full_percent", 0.0)
+                pressure_some = guest.get(f"{metric}_pressure_some_percent", 0.0)
+                pressure_spikes = guest.get(f"{metric}_pressure_full_spikes_percent", 0.0)
+                is_hot = (pressure_full >= threshold["pressure_full"] and pressure_some >= threshold["pressure_some"]) or (pressure_spikes >= threshold["pressure_spikes"])
+
+                if is_hot:
+                    logger.debug(f"Set guest {guest['name']} as hot based on {metric} pressure metrics.")
+                    proxlb_data["guests"][guest["name"]][f"{metric}_pressure_hot"] = True
+                    proxlb_data["guests"][guest["name"]][f"pressure_hot"] = True
+                else:
+                    logger.debug(f"guest {guest['name']} is not hot based on {metric} pressure metrics.")
+
+        logger.debug("Finished: set_guest_hot.")
+        return proxlb_data
+
    @staticmethod
    def get_balanciness(proxlb_data: Dict[str, Any]) -> Dict[str, Any]:
        """
@@ -112,7 +190,66 @@ class Calculations:
            method = proxlb_data["meta"]["balancing"].get("method", "memory")
            mode = proxlb_data["meta"]["balancing"].get("mode", "used")
            balanciness = proxlb_data["meta"]["balancing"].get("balanciness", 10)
-            method_value = [node_meta[f"{method}_{mode}_percent"] for node_meta in proxlb_data["nodes"].values()]
+
+            if mode == "assigned":
+                method_value = [node_meta[f"{method}_{mode}_percent"] for node_meta in proxlb_data["nodes"].values()]
+
+                if proxlb_data["meta"]["balancing"].get(f"{method}_threshold", None):
+                    threshold = proxlb_data["meta"]["balancing"].get(f"{method}_threshold")
+                    highest_usage_node = max(proxlb_data["nodes"].values(), key=lambda x: x[f"{method}_{mode}_percent"])
+                    highest_node_value = highest_usage_node[f"{method}_{mode}_percent"]
+
+                    if highest_node_value >= threshold:
+                        logger.debug(f"Guest balancing is required. Highest {method} usage node {highest_usage_node['name']} is above the defined threshold of {threshold}% with a value of {highest_node_value}%.")
+                        proxlb_data["meta"]["balancing"]["balance"] = True
+                    else:
+                        logger.debug(f"Guest balancing is ok. Highest {method} usage node {highest_usage_node['name']} is below the defined threshold of {threshold}% with a value of {highest_node_value}%.")
+                        proxlb_data["meta"]["balancing"]["balance"] = False
+
+                else:
+                    logger.debug(f"No {method} threshold defined for balancing. Skipping threshold check.")
+
+            elif mode == "used":
+                method_value = [node_meta[f"{method}_{mode}_percent"] for node_meta in proxlb_data["nodes"].values()]
+
+                if proxlb_data["meta"]["balancing"].get(f"{method}_threshold", None):
+                    threshold = proxlb_data["meta"]["balancing"].get(f"{method}_threshold")
+                    highest_usage_node = max(proxlb_data["nodes"].values(), key=lambda x: x[f"{method}_{mode}_percent"])
+                    highest_node_value = highest_usage_node[f"{method}_{mode}_percent"]
+
+                    if highest_node_value >= threshold:
+                        logger.debug(f"Guest balancing is required. Highest {method} usage node {highest_usage_node['name']} is above the defined threshold of {threshold}% with a value of {highest_node_value}%.")
+                        proxlb_data["meta"]["balancing"]["balance"] = True
+                    else:
+                        logger.debug(f"Guest balancing is ok. Highest {method} usage node {highest_usage_node['name']} is below the defined threshold of {threshold}% with a value of {highest_node_value}%.")
+                        proxlb_data["meta"]["balancing"]["balance"] = False
+
+                else:
+                    logger.debug(f"No {method} threshold defined for balancing. Skipping threshold check.")
+
+            elif mode == "psi":
+                method_value = [node_meta[f"{method}_pressure_full_spikes_percent"] for node_meta in proxlb_data["nodes"].values()]
+                any_node_hot = any(node.get(f"{method}_pressure_hot", False) for node in proxlb_data["nodes"].values())
+                any_guest_hot = any(node.get(f"{method}_pressure_hot", False) for node in proxlb_data["guests"].values())
+
+                if any_node_hot:
+                    logger.debug(f"Guest balancing is required. A node is marked as HOT based on {method} pressure metrics.")
+                    proxlb_data["meta"]["balancing"]["balance"] = True
+                else:
+                    logger.debug(f"Guest balancing is ok. No node is marked as HOT based on {method} pressure metrics.")
+
+                if any_guest_hot:
+                    logger.debug(f"Guest balancing is required. A guest is marked as HOT based on {method} pressure metrics.")
+                    proxlb_data["meta"]["balancing"]["balance"] = True
+                else:
+                    logger.debug(f"Guest balancing is ok. No guest is marked as HOT based on {method} pressure metrics.")
+
+                return proxlb_data
+
+            else:
+                logger.critical(f"Unknown balancing mode: {mode} provided. Cannot get balanciness.")
+                sys.exit(1)
+
            method_value_highest = max(method_value)
            method_value_lowest = min(method_value)

@@ -128,7 +265,7 @@ class Calculations:
        logger.debug("Finished: get_balanciness.")

    @staticmethod
-    def get_most_free_node(proxlb_data: Dict[str, Any], return_node: bool = False) -> Dict[str, Any]:
+    def get_most_free_node(proxlb_data: Dict[str, Any], return_node: bool = False, guest_node_relation_list: list = []) -> Dict[str, Any]:
        """
        Get the name of the Proxmox node in the cluster with the most free resources based on
        the user defined method (e.g.: memory) and mode (e.g.: used).
@@ -137,6 +274,8 @@ class Calculations:
            proxlb_data (Dict[str, Any]): The data holding all content of all objects.
            return_node (bool): The indicator to simply return the best node for further
                                assignments.
+            guest_node_relation_list (list): A list of nodes that have a tag on the given
+                                             guest relationship for pinning.

        Returns:
            Dict[str, Any]: Updated meta data section of the node with the most free resources that should
@@ -145,11 +284,34 @@ class Calculations:
        logger.debug("Starting: get_most_free_node.")
        proxlb_data["meta"]["balancing"]["balance_next_node"] = ""

-        # Do not include nodes that are marked in 'maintenance'
+        # Filter and exclude nodes that are in maintenance mode
        filtered_nodes = [node for node in proxlb_data["nodes"].values() if not node["maintenance"]]
+
+        # Filter and include nodes that given by a relationship between guest and node. This is only
+        # used if the guest has a relationship to a node defined by "pin" tags.
+        if len(guest_node_relation_list) > 0:
+            filtered_nodes = [node for node in proxlb_data["nodes"].values() if node["name"] in guest_node_relation_list]
+
+        # Filter by the defined methods and modes for balancing
        method = proxlb_data["meta"]["balancing"].get("method", "memory")
        mode = proxlb_data["meta"]["balancing"].get("mode", "used")
-        lowest_usage_node = min(filtered_nodes, key=lambda x: x[f"{method}_{mode}_percent"])
+
+        if mode == "assigned":
+            logger.debug(f"Get best node for balancing by assigned {method} resources.")
+            lowest_usage_node = min(filtered_nodes, key=lambda x: x[f"{method}_{mode}_percent"])
+
+        elif mode == "used":
+            logger.debug(f"Get best node for balancing by used {method} resources.")
+            lowest_usage_node = min(filtered_nodes, key=lambda x: x[f"{method}_{mode}_percent"])
+
+        elif mode == "psi":
+            logger.debug(f"Get best node for balancing by pressure of {method} resources.")
+            lowest_usage_node = min(filtered_nodes, key=lambda x: x[f"{method}_pressure_full_spikes_percent"])
+
+        else:
+            logger.critical(f"Unknown balancing mode: {mode} provided. Cannot get best node.")
+            sys.exit(1)
+
        proxlb_data["meta"]["balancing"]["balance_reason"] = 'resources'
        proxlb_data["meta"]["balancing"]["balance_next_node"] = lowest_usage_node["name"]

@@ -162,7 +324,7 @@ class Calculations:
        logger.debug("Finished: get_most_free_node.")

    @staticmethod
-    def relocate_guests_on_maintenance_nodes(proxlb_data: Dict[str, Any]) -> None:
+    def relocate_guests_on_maintenance_nodes(proxlb_data: Dict[str, Any]):
        """
        Relocates guests that are currently on nodes marked for maintenance to
        nodes with the most available resources.
@@ -178,7 +340,7 @@ class Calculations:
        Returns:
        None
        """
-        logger.debug("Starting: get_most_free_node.")
+        logger.debug("Starting: relocate_guests_on_maintenance_nodes.")
        proxlb_data["meta"]["balancing"]["balance_next_guest"] = ""

        for guest_name in proxlb_data["groups"]["maintenance"]:
@@ -189,10 +351,10 @@ class Calculations:
            Calculations.update_node_resources(proxlb_data)
            logger.warning(f"Warning: Balancing may not be perfect because guest {guest_name} was located on a node which is in maintenance mode.")

-        logger.debug("Finished: get_most_free_node.")
+        logger.debug("Finished: relocate_guests_on_maintenance_nodes.")

    @staticmethod
-    def relocate_guests(proxlb_data: Dict[str, Any]) -> None:
+    def relocate_guests(proxlb_data: Dict[str, Any]):
        """
        Relocates guests within the provided data structure to ensure affinity groups are
        placed on nodes with the most free resources.
@@ -223,15 +385,34 @@ class Calculations:
                Calculations.get_most_free_node(proxlb_data)

                for guest_name in proxlb_data["groups"]["affinity"][group_name]["guests"]:
-                    proxlb_data["meta"]["balancing"]["balance_next_guest"] = guest_name
+                    mode = proxlb_data["meta"]["balancing"].get("mode", "used")
+
+                    if mode == 'psi':
+                        logger.debug(f"Evaluating guest relocation based on {mode} mode.")
+                        method = proxlb_data["meta"]["balancing"].get("method", "memory")
+                        processed_guests_psi = proxlb_data["meta"]["balancing"].setdefault("processed_guests_psi", [])
+                        unprocessed_guests_psi = [guest for guest in proxlb_data["guests"].values() if guest["name"] not in processed_guests_psi]
+
+                        # Filter by the defined methods and modes for balancing
+                        highest_usage_guest = max(unprocessed_guests_psi, key=lambda x: x[f"{method}_pressure_full_spikes_percent"])
+
+                        # Append guest to the psi based processed list of guests
+                        if highest_usage_guest["name"] == guest_name and guest_name not in proxlb_data["meta"]["balancing"]["processed_guests_psi"]:
+                            proxlb_data["meta"]["balancing"]["processed_guests_psi"].append(guest_name)
+                            proxlb_data["meta"]["balancing"]["balance_next_guest"] = guest_name
+
+                    else:
+                        logger.debug(f"Evaluating guest relocation based on {mode} mode.")
+                        proxlb_data["meta"]["balancing"]["balance_next_guest"] = guest_name
+
                    Calculations.val_anti_affinity(proxlb_data, guest_name)
-                    Calculations.val_node_relationship(proxlb_data, guest_name)
+                    Calculations.val_node_relationships(proxlb_data, guest_name)
                    Calculations.update_node_resources(proxlb_data)

        logger.debug("Finished: relocate_guests.")

    @staticmethod
-    def val_anti_affinity(proxlb_data: Dict[str, Any], guest_name: str) -> None:
+    def val_anti_affinity(proxlb_data: Dict[str, Any], guest_name: str):
        """
        Validates and assigns nodes to guests based on anti-affinity rules.

@@ -256,23 +437,28 @@ class Calculations:
            if guest_name in proxlb_data["groups"]["anti_affinity"][group_name]['guests'] and not proxlb_data["guests"][guest_name]["processed"]:
                logger.debug(f"Anti-Affinity: Guest: {guest_name} is included in anti-affinity group: {group_name}.")

-                # Iterate over all available nodes
-                for node_name in proxlb_data["nodes"].keys():
+                # Check if the group has only one member. If so skip new guest node assignment.
+                if proxlb_data["groups"]["anti_affinity"][group_name]["counter"] > 1:
+                    logger.debug(f"Anti-Affinity: Group has more than 1 member.")
+                    # Iterate over all available nodes
+                    for node_name in proxlb_data["nodes"].keys():

-                    # Only select node if it was not used before and is not in a
-                    # maintenance mode. Afterwards, add it to the list of already
-                    # used nodes for the current anti-affinity group
-                    if node_name not in proxlb_data["groups"]["anti_affinity"][group_name]["used_nodes"]:
+                        # Only select node if it was not used before and is not in a
+                        # maintenance mode. Afterwards, add it to the list of already
+                        # used nodes for the current anti-affinity group
+                        if node_name not in proxlb_data["groups"]["anti_affinity"][group_name]["used_nodes"]:

-                        if not proxlb_data["nodes"][node_name]["maintenance"]:
-                            # If the node has not been used yet, we assign this node to the guest
-                            proxlb_data["meta"]["balancing"]["balance_next_node"] = node_name
-                            proxlb_data["groups"]["anti_affinity"][group_name]["used_nodes"].append(node_name)
-                            logger.debug(f"Node: {node_name} marked as used for anti-affinity group: {group_name} with guest {guest_name}")
-                            break
+                            if not proxlb_data["nodes"][node_name]["maintenance"]:
+                                # If the node has not been used yet, we assign this node to the guest
+                                proxlb_data["meta"]["balancing"]["balance_next_node"] = node_name
+                                proxlb_data["groups"]["anti_affinity"][group_name]["used_nodes"].append(node_name)
+                                logger.debug(f"Node: {node_name} marked as used for anti-affinity group: {group_name} with guest {guest_name}")
+                                break

-                    else:
-                        logger.critical(f"Node: {node_name} already got used for anti-affinity group:: {group_name}. (Tried for guest: {guest_name})")
+                        else:
+                            logger.critical(f"Node: {node_name} already got used for anti-affinity group:: {group_name}. (Tried for guest: {guest_name})")
+                else:
+                    logger.debug(f"Anti-Affinity: Group has less than 2 members. Skipping node calculation for the group.")

            else:
                logger.debug(f"Guest: {guest_name} is not included in anti-affinity group: {group_name}. Skipping.")
@@ -280,7 +466,7 @@ class Calculations:
        logger.debug("Finished: val_anti_affinity.")

    @staticmethod
-    def val_node_relationship(proxlb_data: Dict[str, Any], guest_name: str) -> None:
+    def val_node_relationships(proxlb_data: Dict[str, Any], guest_name: str):
        """
        Validates and assigns guests to nodes based on defined relationships based on tags.

@@ -291,27 +477,29 @@ class Calculations:
        Returns:
        None
        """
-        logger.debug("Starting: val_node_relationship.")
+        logger.debug("Starting: val_node_relationships.")
        proxlb_data["guests"][guest_name]["processed"] = True

-        if proxlb_data["guests"][guest_name]["node_relationship"]:
-            logger.info(f"Guest '{guest_name}' has a specific relationship defined to node: {proxlb_data['guests'][guest_name]['node_relationship']}. Pinning to node.")
+        if len(proxlb_data["guests"][guest_name]["node_relationships"]) > 0:
+            logger.debug(f"Guest '{guest_name}' has relationships defined to node(s): {','.join(proxlb_data['guests'][guest_name]['node_relationships'])}. Pinning to node.")
+
+            # Get the node with the most free resources of the group
+            guest_node_relation_list = proxlb_data["guests"][guest_name]["node_relationships"]
+            Calculations.get_most_free_node(proxlb_data, False, guest_node_relation_list)

            # Validate if the specified node name is really part of the cluster
-            if proxlb_data['guests'][guest_name]['node_relationship'] in proxlb_data["nodes"].keys():
-                logger.info(f"Guest '{guest_name}' has a specific relationship defined to node: {proxlb_data['guests'][guest_name]['node_relationship']} is a known hypervisor node in the cluster.")
-                # Pin the guest to the specified hypervisor node.
-                proxlb_data["meta"]["balancing"]["balance_next_node"] = proxlb_data['guests'][guest_name]['node_relationship']
+            if proxlb_data["meta"]["balancing"]["balance_next_node"] in proxlb_data["nodes"].keys():
+                logger.debug(f"Guest '{guest_name}' has a specific relationship defined to node: {proxlb_data['meta']['balancing']['balance_next_node']} is a known hypervisor node in the cluster.")
            else:
-                logger.warning(f"Guest '{guest_name}' has a specific relationship defined to node: {proxlb_data['guests'][guest_name]['node_relationship']} but this node name is not known in the cluster!")
+                logger.warning(f"Guest '{guest_name}' has a specific relationship defined to node: {proxlb_data['meta']['balancing']['balance_next_node']} but this node name is not known in the cluster!")

        else:
-            logger.info(f"Guest '{guest_name}' does not have any specific node relationships.")
+            logger.debug(f"Guest '{guest_name}' does not have any specific node relationships.")

-        logger.debug("Finished: val_node_relationship.")
+        logger.debug("Finished: val_node_relationships.")

    @staticmethod
-    def update_node_resources(proxlb_data: Dict[str, Any]) -> None:
+    def update_node_resources(proxlb_data):
        """
        Updates the resource allocation and usage statistics for nodes when a guest
        is moved from one node to another.
@@ -331,6 +519,11 @@ class Calculations:
        """
        logger.debug("Starting: update_node_resources.")
        guest_name = proxlb_data["meta"]["balancing"]["balance_next_guest"]
+
+        if guest_name == "":
+            logger.debug("No guest defined to update node resources for.")
+            return
+
        node_current = proxlb_data["guests"][guest_name]["node_current"]
        node_target = proxlb_data["meta"]["balancing"]["balance_next_node"]

@@ -375,68 +568,3 @@ class Calculations:
        logger.debug(f"Set guest {guest_name} from node {node_current} to node {node_target}.")

        logger.debug("Finished: update_node_resources.")
-
-    @staticmethod
-    def update_cluster_resources(proxlb_data: Dict[str, Any], node: str, action: str) -> None:
-        """
-        Updates the cluster resource statistics based on the specified action and node.
-
-        This method modifies the cluster-level resource data (such as CPU, memory, disk usage,
-        and node counts) based on the action performed ('add' or 'remove') for the specified node.
-        It calculates the updated statistics after adding or removing a node and logs the results.
-
-        Parameters:
-            proxlb_data (Dict[str, Any]): The data representing the current state of the cluster,
-                                        including node-level statistics for CPU, memory, and disk.
-            node (str): The identifier of the node whose resources are being added or removed from the cluster.
-            action (str): The action to perform, either 'add' or 'remove'. 'add' will include the node's
-                        resources in the cluster, while 'remove' will exclude the node's resources.
-
-        Returns:
-            None: The function modifies the `proxlb_data` dictionary in place to update the cluster resources.
-        """
-        logger.debug("Starting: update_cluster_resources.")
-        logger.debug(f"DPM: Updating cluster statistics by online node {node}. Action: {action}")
-        logger.debug(f"DPM: update_cluster_resources - Before {action}: {proxlb_data['cluster']['memory_free_percent']}")
-
-        if action == "add":
-            proxlb_data["cluster"]["node_count"] = proxlb_data["cluster"].get("node_count", 0) + 1
-            proxlb_data["cluster"]["cpu_total"] = proxlb_data["cluster"].get("cpu_total", 0) + proxlb_data["nodes"][node]["cpu_total"]
-            proxlb_data["cluster"]["cpu_used"] = proxlb_data["cluster"].get("cpu_used", 0) + proxlb_data["nodes"][node]["cpu_used"]
-            proxlb_data["cluster"]["cpu_free"] = proxlb_data["cluster"].get("cpu_free", 0) + proxlb_data["nodes"][node]["cpu_free"]
-            proxlb_data["cluster"]["cpu_free_percent"] = proxlb_data["cluster"].get("cpu_free", 0) / proxlb_data["cluster"].get("cpu_total", 0) * 100
-            proxlb_data["cluster"]["cpu_used_percent"] = proxlb_data["cluster"].get("cpu_used", 0) / proxlb_data["cluster"].get("cpu_total", 0) * 100
-            proxlb_data["cluster"]["memory_total"] = proxlb_data["cluster"].get("memory_total", 0) + proxlb_data["nodes"][node]["memory_total"]
-            proxlb_data["cluster"]["memory_used"] = proxlb_data["cluster"].get("memory_used", 0) + proxlb_data["nodes"][node]["memory_used"]
-            proxlb_data["cluster"]["memory_free"] = proxlb_data["cluster"].get("memory_free", 0) + proxlb_data["nodes"][node]["memory_free"]
-            proxlb_data["cluster"]["memory_free_percent"] = proxlb_data["cluster"].get("memory_free", 0) / proxlb_data["cluster"].get("memory_total", 0) * 100
-            proxlb_data["cluster"]["memory_used_percent"] = proxlb_data["cluster"].get("memory_used", 0) / proxlb_data["cluster"].get("memory_total", 0) * 100
-            proxlb_data["cluster"]["disk_total"] = proxlb_data["cluster"].get("disk_total", 0) + proxlb_data["nodes"][node]["disk_total"]
-            proxlb_data["cluster"]["disk_used"] = proxlb_data["cluster"].get("disk_used", 0) + proxlb_data["nodes"][node]["disk_used"]
-            proxlb_data["cluster"]["disk_free"] = proxlb_data["cluster"].get("disk_free", 0) + proxlb_data["nodes"][node]["disk_free"]
-            proxlb_data["cluster"]["disk_free_percent"] = proxlb_data["cluster"].get("disk_free", 0) / proxlb_data["cluster"].get("disk_total", 0) * 100
-            proxlb_data["cluster"]["disk_used_percent"] = proxlb_data["cluster"].get("disk_used", 0) / proxlb_data["cluster"].get("disk_total", 0) * 100
-            proxlb_data["cluster"]["node_count_available"] = proxlb_data["cluster"].get("node_count_available", 0) + 1
-            proxlb_data["cluster"]["node_count_overall"] = proxlb_data["cluster"].get("node_count_overall", 0) + 1
-
-        if action == "remove":
-            proxlb_data["cluster"]["node_count"] = proxlb_data["cluster"].get("node_count", 0) - 1
-            proxlb_data["cluster"]["cpu_total"] = proxlb_data["cluster"].get("cpu_total", 0) - proxlb_data["nodes"][node]["cpu_total"]
-            proxlb_data["cluster"]["cpu_used"] = proxlb_data["cluster"].get("cpu_used", 0) - proxlb_data["nodes"][node]["cpu_used"]
-            proxlb_data["cluster"]["cpu_free"] = proxlb_data["cluster"].get("cpu_free", 0) - proxlb_data["nodes"][node]["cpu_free"]
-            proxlb_data["cluster"]["cpu_free_percent"] = proxlb_data["cluster"].get("cpu_free", 0) / proxlb_data["cluster"].get("cpu_total", 0) * 100
-            proxlb_data["cluster"]["cpu_used_percent"] = proxlb_data["cluster"].get("cpu_used", 0) / proxlb_data["cluster"].get("cpu_total", 0) * 100
-            proxlb_data["cluster"]["memory_total"] = proxlb_data["cluster"].get("memory_total", 0) - proxlb_data["nodes"][node]["memory_total"]
-            proxlb_data["cluster"]["memory_used"] = proxlb_data["cluster"].get("memory_used") - proxlb_data["nodes"][node]["memory_used"]
-            proxlb_data["cluster"]["memory_free"] = proxlb_data["cluster"].get("memory_free") - proxlb_data["nodes"][node]["memory_free"]
-            proxlb_data["cluster"]["memory_free_percent"] = proxlb_data["cluster"].get("memory_free") / proxlb_data["cluster"].get("memory_total", 0) * 100
-            proxlb_data["cluster"]["memory_used_percent"] = proxlb_data["cluster"].get("memory_used") / proxlb_data["cluster"].get("memory_total", 0) * 100
-            proxlb_data["cluster"]["disk_total"] = proxlb_data["cluster"].get("disk_total", 0) - proxlb_data["nodes"][node]["disk_total"]
-            proxlb_data["cluster"]["disk_used"] = proxlb_data["cluster"].get("disk_used", 0) - proxlb_data["nodes"][node]["disk_used"]
-            proxlb_data["cluster"]["disk_free"] = proxlb_data["cluster"].get("disk_free", 0) - proxlb_data["nodes"][node]["disk_free"]
-            proxlb_data["cluster"]["disk_free_percent"] = proxlb_data["cluster"].get("disk_free", 0) / proxlb_data["cluster"].get("disk_total", 0) * 100
-            proxlb_data["cluster"]["disk_used_percent"] = proxlb_data["cluster"].get("disk_used", 0) / proxlb_data["cluster"].get("disk_total", 0) * 100
-            proxlb_data["cluster"]["node_count_available"] = proxlb_data["cluster"].get("node_count_available", 0) - 1
-
-        logger.debug(f"DPM: update_cluster_resources - After {action}: {proxlb_data['cluster']['memory_free_percent']}")
-        logger.debug("Finished: update_cluster_resources.")
--- a/proxlb/models/dpm.py
+++ b/proxlb/models/dpm.py
@@ -1,255 +0,0 @@
-"""
-The DPM (Dynamic Power Management) class is responsible for the dynamic management
-of nodes within a Proxmox cluster, optimizing resource utilization by controlling
-node power states based on specified schedules and conditions.
-
-This class provides functionality for:
- Tracking and validating schedules for dynamic power management.
- Shutting down nodes that are underutilized or not needed.
- Starting up nodes using Wake-on-LAN (WOL) based on certain conditions.
- Ensuring that nodes are properly flagged for maintenance and startup/shutdown actions.
-
-The DPM class can operate in different modes, such as static and automatic,
-to either perform predefined actions or dynamically adjust based on real-time resource usage.
-"""
-
-__author__ = "Florian Paul Azim Hoberg <gyptazy>"
-__copyright__ = "Copyright (C) 2025 Florian Paul Azim Hoberg (@gyptazy)"
-__license__ = "GPL-3.0"
-
-
-import proxmoxer
-from typing import Dict, Any
-from models.calculations import Calculations
-from utils.logger import SystemdLogger
-
-logger = SystemdLogger()
-
-
-class DPM:
-    """
-    The DPM (Dynamic Power Management) class is responsible for the dynamic management
-    of nodes within a Proxmox cluster, optimizing resource utilization by controlling
-    node power states based on specified schedules and conditions.
-
-    This class provides functionality for:
-    - Tracking and validating schedules for dynamic power management.
-    - Shutting down nodes that are underutilized or not needed.
-    - Starting up nodes using Wake-on-LAN (WOL) based on certain conditions.
-    - Ensuring that nodes are properly flagged for maintenance and startup/shutdown actions.
-
-    The DPM class can operate in different modes, such as static and automatic,
-    to either perform predefined actions or dynamically adjust based on real-time resource usage.
-
-    Attributes:
-        None directly defined for the class; instead, all actions are based on input data
-        and interactions with the Proxmox API and other helper functions.
-
-    Methods:
-        __init__(proxlb_data: Dict[str, Any]):
-            Initializes the DPM class, checking whether DPM is enabled and operating in the
-            appropriate mode (static or auto).
-
-        dpm_static(proxlb_data: Dict[str, Any]) -> None:
-            Evaluates the cluster's resource availability and performs static power management
-            actions by removing nodes that are not required.
-
-        dpm_shutdown_nodes(proxmox_api, proxlb_data) -> None:
-            Shuts down nodes flagged for DPM shutdown by using the Proxmox API, ensuring
-            that Wake-on-LAN (WOL) is available for proper node recovery.
-
-        dpm_startup_nodes(proxmox_api, proxlb_data) -> None:
-            Powers on nodes that are flagged for startup and are not in maintenance mode,
-            leveraging Wake-on-LAN (WOL) functionality.
-
-        dpm_validate_wol_mac(proxmox_api, node) -> None:
-            Validates and retrieves the Wake-on-LAN (WOL) MAC address for a given node,
-            ensuring that a valid address is set for powering on the node remotely.
-    """
-
-    def __init__(self, proxlb_data: Dict[str, Any]):
-        """
-        Initializes the DPM class with the provided ProxLB data.
-
-        Args:
-            proxlb_data (dict): The data required for balancing VMs and CTs.
-        """
-        logger.debug("Starting: dpm class.")
-
-        if proxlb_data["meta"].get("dpm", {}).get("enable", False):
-            logger.debug("DPM function is enabled.")
-            mode = proxlb_data["meta"].get("dpm", {}).get("mode", None)
-
-            if mode == "static":
-                self.dpm_static(proxlb_data)
-
-            if mode == "auto":
-                self.dpm_auto(proxlb_data)
-
-        else:
-            logger.debug("DPM function is not enabled.")
-
-        logger.debug("Finished: dpm class.")
-
-    def dpm_static(self, proxlb_data: Dict[str, Any]) -> None:
-        """
-        Evaluates and performs static Distributed Power Management (DPM) actions based on current cluster state.
-
-        This method monitors cluster resource availability and attempts to reduce the number of active nodes
-        when sufficient free resources are available. It ensures a minimum number of nodes remains active
-        and prioritizes shutting down nodes with the least utilized resources to minimize impact. Nodes selected
-        for shutdown are marked for maintenance and flagged for DPM shutdown.
-
-        Parameters:
-            proxlb_data (Dict[str, Any]): A dictionary containing metadata, cluster status, and node-level information
-                including resource utilization, configuration settings, and DPM thresholds.
-
-        Returns:
-            None: Modifies the input dictionary in-place to reflect updated cluster state and node flags.
-        """
-        logger.debug("Starting: dpm_static.")
-
-        method = proxlb_data["meta"].get("dpm", {}).get("method", "memory")
-        cluster_nodes_overall = proxlb_data["cluster"]["node_count_overall"]
-        cluster_nodes_available = proxlb_data["cluster"]["node_count_available"]
-        cluster_free_resources_percent = int(proxlb_data["cluster"][f"{method}_free_percent"])
-        cluster_free_resources_req_min = proxlb_data["meta"].get("dpm", {}).get("cluster_min_free_resources", 0)
-        cluster_mind_nodes = proxlb_data["meta"].get("dpm", {}).get("cluster_min_nodes", 3)
-        logger.debug(f"DPM: Cluster Nodes: {cluster_nodes_overall} | Nodes available: {cluster_nodes_available} | Nodes offline: {cluster_nodes_overall - cluster_nodes_available}")
-
-        # Only proceed removing nodes if the cluster has enough resources
-        while cluster_free_resources_percent > cluster_free_resources_req_min:
-            logger.debug(f"DPM: More free resources {cluster_free_resources_percent}% available than required: {cluster_free_resources_req_min}%. DPM evaluation starting...")
-
-            # Ensure that we have at least a defined minimum of nodes left
-            if cluster_nodes_available > cluster_mind_nodes:
-                logger.debug(f"DPM: A minimum of {cluster_mind_nodes} nodes is required. {cluster_nodes_available} are available. Proceeding...")
-
-                # Get the node with the fewest used resources to keep migrations low
-                Calculations.get_most_free_node(proxlb_data, False)
-                dpm_node = proxlb_data["meta"]["balancing"]["balance_next_node"]
-
-                # Perform cluster calculation for evaluating how many nodes can safely leave
-                # the cluster. Further object calculations are being processed afterwards by
-                # the calculation class
-                logger.debug(f"DPM: Removing node {dpm_node} from cluster. Node will be turned off later.")
-                Calculations.update_cluster_resources(proxlb_data, dpm_node, "remove")
-                cluster_free_resources_percent = int(proxlb_data["cluster"][f"{method}_free_percent"])
-                logger.debug(f"DPM: Free cluster resources changed to: {int(proxlb_data['cluster'][f'{method}_free_percent'])}%.")
-
-                # Set node to maintenance and DPM shutdown
-                proxlb_data["nodes"][dpm_node]["maintenance"] = True
-                proxlb_data["nodes"][dpm_node]["dpm_shutdown"] = True
-            else:
-                logger.warning(f"DPM: A minimum of {cluster_mind_nodes} nodes is required. {cluster_nodes_available} are available. Cannot proceed!")
-
-        logger.debug(f"DPM: Not enough free resources {cluster_free_resources_percent}% available than required: {cluster_free_resources_req_min}%. DPM evaluation stopped.")
-        logger.debug("Finished: dpm_static.")
-        return proxlb_data
-
-    @staticmethod
-    def dpm_shutdown_nodes(proxmox_api, proxlb_data: Dict[str, Any]) -> None:
-        """
-        Shuts down cluster nodes that are marked for maintenance and flagged for DPM shutdown.
-
-        This method iterates through the cluster nodes in the provided data and attempts to
-        power off any node that has both the 'maintenance' and 'dpm_shutdown' flags set.
-        It communicates with the Proxmox API to issue shutdown commands and logs any failures.
-
-        Parameters:
-            proxmox_api: An instance of the Proxmox API client used to issue node shutdown commands.
-            proxlb_data: A dictionary containing node status information, including flags for
-                maintenance and DPM shutdown readiness.
-
-        Returns:
-            None: Performs shutdown operations and logs outcomes; modifies no data directly.
-        """
-        logger.debug("Starting: dpm_shutdown_nodes.")
-        for node, node_info in proxlb_data["nodes"].items():
-
-            if node_info["maintenance"] and node_info["dpm_shutdown"]:
-                logger.debug(f"DPM: Node: {node} is flagged as maintenance mode and to be powered off.")
-
-                # Ensure that the node has a valid WOL MAC defined. If not
-                # we would be unable to power on that system again
-                valid_wol_mac = DPM.dpm_validate_wol_mac(proxmox_api, node)
-
-                if valid_wol_mac:
-                    try:
-                        logger.debug(f"DPM: Shutting down node: {node}.")
-                        job_id = proxmox_api.nodes(node).status.post(command="shutdown")
-                    except proxmoxer.core.ResourceException as proxmox_api_error:
-                        logger.critical(f"DPM: Error while powering off node {node}. Please check job-id: {job_id}")
-                        logger.debug(f"DPM: Error while powering off node {node}. Please check job-id: {job_id}")
-                else:
-                    logger.critical(f"DPM: Node {node} cannot be powered off due to missing WOL MAC. Please define a valid WOL MAC for this node.")
-
-        logger.debug("Finished: dpm_shutdown_nodes.")
-
-    @staticmethod
-    def dpm_startup_nodes(proxmox_api, proxlb_data: Dict[str, Any]) -> None:
-        """
-        Starts uo cluster nodes that are marked for DPM start up.
-
-        This method iterates through the cluster nodes in the provided data and attempts to
-        power on any node that is not flagged as 'maintenance' but flagged as 'dpm_startup'.
-        It communicates with the Proxmox API to issue poweron commands and logs any failures.
-
-        Parameters:
-            proxmox_api: An instance of the Proxmox API client used to issue node startup commands.
-            proxlb_data: A dictionary containing node status information, including flags for
-                maintenance and DPM shutdown readiness.
-
-        Returns:
-            None: Performs poweron operations and logs outcomes; modifies no data directly.
-        """
-        logger.debug("Starting: dpm_startup_nodes.")
-        for node, node_info in proxlb_data["nodes"].items():
-
-            if not node_info["maintenance"]:
-                logger.debug(f"DPM: Node: {node} is not in maintenance mode.")
-
-                if node_info["dpm_startup"]:
-                    logger.debug(f"DPM: Node: {node} is flagged as to be started.")
-
-                    try:
-                        logger.debug(f"DPM: Powering on node: {node}.")
-                        # Important: This requires Proxmox Operators to define the
-                        # WOL address for each node within the Proxmox webinterface
-                        job_id = proxmox_api.nodes().wakeonlan.post(node=node)
-                    except proxmoxer.core.ResourceException as proxmox_api_error:
-                        logger.critical(f"DPM: Error while powering on node {node}. Please check job-id: {job_id}")
-                        logger.debug(f"DPM: Error while powering on node {node}. Please check job-id: {job_id}")
-
-        logger.debug("Finished: dpm_startup_nodes.")
-
-    @staticmethod
-    def dpm_validate_wol_mac(proxmox_api, node: Dict[str, Any]) -> str:
-        """
-        Retrieves and validates the Wake-on-LAN (WOL) MAC address for a specified node.
-
-        This method fetches the MAC address configured for Wake-on-LAN (WOL) from the Proxmox API.
-        If the MAC address is found, it is logged. In case of failure to retrieve the address,
-        a critical log is generated indicating the absence of a WOL MAC address for the node.
-
-        Parameters:
-            proxmox_api: An instance of the Proxmox API client used to query node configurations.
-            node: The identifier (name or ID) of the node for which the WOL MAC address is to be validated.
-
-        Returns:
-            node_wol_mac_address: The WOL MAC address for the specified node if found, otherwise `None`.
-        """
-        logger.debug("Starting: dpm_validate_wol_mac.")
-
-        try:
-            logger.debug(f"DPM: Getting WOL MAC address for node {node} from API.")
-            node_wol_mac_address = proxmox_api.nodes(node).config.get(property="wakeonlan")
-            node_wol_mac_address = node_wol_mac_address.get("wakeonlan")
-            logger.debug(f"DPM: Node {node} has MAC address: {node_wol_mac_address} for WOL.")
-        except proxmoxer.core.ResourceException as proxmox_api_error:
-            logger.debug(f"DPM: Failed to get WOL MAC address for node {node} from API.")
-            node_wol_mac_address = None
-            logger.critical(f"DPM: Node {node} has no MAC address defined for WOL.")
-
-        logger.debug("Finished: dpm_validate_wol_mac.")
-        return node_wol_mac_address
--- a/proxlb/models/guests.py
+++ b/proxlb/models/guests.py
@@ -10,6 +10,7 @@ __license__ = "GPL-3.0"

 from typing import Dict, Any
 from utils.logger import SystemdLogger
+from models.pools import Pools
 from models.tags import Tags
 import time

@@ -35,7 +36,7 @@ class Guests:
        """

    @staticmethod
-    def get_guests(proxmox_api: any, nodes: Dict[str, Any], meta: Dict[str, Any]) -> Dict[str, Any]:
+    def get_guests(proxmox_api: any, pools: Dict[str, Any], nodes: Dict[str, Any], meta: Dict[str, Any], proxlb_config: Dict[str, Any]) -> Dict[str, Any]:
        """
        Get metrics of all guests in a Proxmox cluster.

@@ -46,6 +47,8 @@ class Guests:
        Args:
            proxmox_api (any): The Proxmox API client instance.
            nodes (Dict[str, Any]): A dictionary containing information about the nodes in the Proxmox cluster.
+            meta (Dict[str, Any]): A dictionary containing metadata information.
+            proxmox_config (Dict[str, Any]): A dictionary containing the ProxLB configuration.

        Returns:
            Dict[str, Any]: A dictionary containing metrics and information for all running guests.
@@ -62,36 +65,43 @@ class Guests:
            # resource metrics for rebalancing to ensure that we do not overprovisiong the node.
            for guest in proxmox_api.nodes(node).qemu.get():
                if guest['status'] == 'running':
-
-                    # If the balancing method is set to cpu, we need to wait for the guest to report
-                    # cpu usage. This is important for the balancing process to ensure that we do not
-                    # wait for a guest for an infinite time.
-                    if meta["meta"]["balancing"]["method"] == "cpu":
-                        retry_counter = 0
-                        while guest['cpu'] == 0 and retry_counter < 10:
-                            guest = proxmox_api.nodes(node).qemu(guest['vmid']).status.current.get()
-                            logger.debug(f"Guest {guest['name']} (type VM) is reporting {guest['cpu']} cpu usage on retry {retry_counter}.")
-                            retry_counter += 1
-                            time.sleep(1)
-
                    guests['guests'][guest['name']] = {}
                    guests['guests'][guest['name']]['name'] = guest['name']
-                    guests['guests'][guest['name']]['cpu_total'] = guest['cpus']
-                    guests['guests'][guest['name']]['cpu_used'] = guest['cpu'] * guest['cpus']
+                    guests['guests'][guest['name']]['cpu_total'] = int(guest['cpus'])
+                    guests['guests'][guest['name']]['cpu_used'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'cpu', None)
+                    guests['guests'][guest['name']]['cpu_pressure_some_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'cpu', 'some')
+                    guests['guests'][guest['name']]['cpu_pressure_full_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'cpu', 'full')
+                    guests['guests'][guest['name']]['cpu_pressure_some_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'cpu', 'some', spikes=True)
+                    guests['guests'][guest['name']]['cpu_pressure_full_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'cpu', 'full', spikes=True)
+                    guests['guests'][guest['name']]['cpu_pressure_hot'] = False
                    guests['guests'][guest['name']]['memory_total'] = guest['maxmem']
                    guests['guests'][guest['name']]['memory_used'] = guest['mem']
+                    guests['guests'][guest['name']]['memory_pressure_some_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'memory', 'some')
+                    guests['guests'][guest['name']]['memory_pressure_full_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'memory', 'full')
+                    guests['guests'][guest['name']]['memory_pressure_some_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'memory', 'some', spikes=True)
+                    guests['guests'][guest['name']]['memory_pressure_full_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'memory', 'full', spikes=True)
+                    guests['guests'][guest['name']]['memory_pressure_hot'] = False
                    guests['guests'][guest['name']]['disk_total'] = guest['maxdisk']
                    guests['guests'][guest['name']]['disk_used'] = guest['disk']
+                    guests['guests'][guest['name']]['disk_pressure_some_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'disk', 'some')
+                    guests['guests'][guest['name']]['disk_pressure_full_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'disk', 'full')
+                    guests['guests'][guest['name']]['disk_pressure_some_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'disk', 'some', spikes=True)
+                    guests['guests'][guest['name']]['disk_pressure_full_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'disk', 'full', spikes=True)
+                    guests['guests'][guest['name']]['disk_pressure_hot'] = False
                    guests['guests'][guest['name']]['id'] = guest['vmid']
                    guests['guests'][guest['name']]['node_current'] = node
                    guests['guests'][guest['name']]['node_target'] = node
                    guests['guests'][guest['name']]['processed'] = False
+                    guests['guests'][guest['name']]['pressure_hot'] = False
                    guests['guests'][guest['name']]['tags'] = Tags.get_tags_from_guests(proxmox_api, node, guest['vmid'], 'vm')
-                    guests['guests'][guest['name']]['affinity_groups'] = Tags.get_affinity_groups(guests['guests'][guest['name']]['tags'])
-                    guests['guests'][guest['name']]['anti_affinity_groups'] = Tags.get_anti_affinity_groups(guests['guests'][guest['name']]['tags'])
+                    guests['guests'][guest['name']]['pools'] = Pools.get_pools_for_guest(guest['name'], pools)
+                    guests['guests'][guest['name']]['affinity_groups'] = Tags.get_affinity_groups(guests['guests'][guest['name']]['tags'], guests['guests'][guest['name']]['pools'], proxlb_config)
+                    guests['guests'][guest['name']]['anti_affinity_groups'] = Tags.get_anti_affinity_groups(guests['guests'][guest['name']]['tags'], guests['guests'][guest['name']]['pools'], proxlb_config)
                    guests['guests'][guest['name']]['ignore'] = Tags.get_ignore(guests['guests'][guest['name']]['tags'])
-                    guests['guests'][guest['name']]['node_relationship'] = Tags.get_node_relationship(guests['guests'][guest['name']]['tags'])
+                    guests['guests'][guest['name']]['node_relationships'] = Tags.get_node_relationships(guests['guests'][guest['name']]['tags'], nodes, guests['guests'][guest['name']]['pools'], proxlb_config)
                    guests['guests'][guest['name']]['type'] = 'vm'
+
+                    logger.debug(f"Resources of Guest {guest['name']} (type VM) added: {guests['guests'][guest['name']]}")
                else:
                    logger.debug(f'Metric for VM {guest["name"]} ignored because VM is not running.')

@@ -102,24 +112,97 @@ class Guests:
                if guest['status'] == 'running':
                    guests['guests'][guest['name']] = {}
                    guests['guests'][guest['name']]['name'] = guest['name']
-                    guests['guests'][guest['name']]['cpu_total'] = guest['cpus']
-                    guests['guests'][guest['name']]['cpu_used'] = guest['cpu']
+                    guests['guests'][guest['name']]['cpu_total'] = int(guest['cpus'])
+                    guests['guests'][guest['name']]['cpu_used'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'cpu', None)
+                    guests['guests'][guest['name']]['cpu_pressure_some_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'cpu', 'some')
+                    guests['guests'][guest['name']]['cpu_pressure_full_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'cpu', 'full')
+                    guests['guests'][guest['name']]['cpu_pressure_some_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'cpu', 'some', spikes=True)
+                    guests['guests'][guest['name']]['cpu_pressure_full_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'cpu', 'full', spikes=True)
+                    guests['guests'][guest['name']]['cpu_pressure_hot'] = False
                    guests['guests'][guest['name']]['memory_total'] = guest['maxmem']
                    guests['guests'][guest['name']]['memory_used'] = guest['mem']
+                    guests['guests'][guest['name']]['memory_pressure_some_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'memory', 'some')
+                    guests['guests'][guest['name']]['memory_pressure_full_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'memory', 'full')
+                    guests['guests'][guest['name']]['memory_pressure_some_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'memory', 'some', spikes=True)
+                    guests['guests'][guest['name']]['memory_pressure_full_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'memory', 'full', spikes=True)
+                    guests['guests'][guest['name']]['memory_pressure_hot'] = False
                    guests['guests'][guest['name']]['disk_total'] = guest['maxdisk']
                    guests['guests'][guest['name']]['disk_used'] = guest['disk']
+                    guests['guests'][guest['name']]['disk_pressure_some_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'disk', 'some')
+                    guests['guests'][guest['name']]['disk_pressure_full_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'disk', 'full')
+                    guests['guests'][guest['name']]['disk_pressure_some_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'disk', 'some', spikes=True)
+                    guests['guests'][guest['name']]['disk_pressure_full_spikes_percent'] = Guests.get_guest_rrd_data(proxmox_api, node, guest['vmid'], guest['name'], 'disk', 'full', spikes=True)
+                    guests['guests'][guest['name']]['disk_pressure_hot'] = False
                    guests['guests'][guest['name']]['id'] = guest['vmid']
                    guests['guests'][guest['name']]['node_current'] = node
                    guests['guests'][guest['name']]['node_target'] = node
                    guests['guests'][guest['name']]['processed'] = False
+                    guests['guests'][guest['name']]['pressure_hot'] = False
                    guests['guests'][guest['name']]['tags'] = Tags.get_tags_from_guests(proxmox_api, node, guest['vmid'], 'ct')
-                    guests['guests'][guest['name']]['affinity_groups'] = Tags.get_affinity_groups(guests['guests'][guest['name']]['tags'])
-                    guests['guests'][guest['name']]['anti_affinity_groups'] = Tags.get_anti_affinity_groups(guests['guests'][guest['name']]['tags'])
+                    guests['guests'][guest['name']]['pools'] = Pools.get_pools_for_guest(guest['name'], pools)
+                    guests['guests'][guest['name']]['affinity_groups'] = Tags.get_affinity_groups(guests['guests'][guest['name']]['tags'], guests['guests'][guest['name']]['pools'], proxlb_config)
+                    guests['guests'][guest['name']]['anti_affinity_groups'] = Tags.get_anti_affinity_groups(guests['guests'][guest['name']]['tags'], guests['guests'][guest['name']]['pools'], proxlb_config)
                    guests['guests'][guest['name']]['ignore'] = Tags.get_ignore(guests['guests'][guest['name']]['tags'])
-                    guests['guests'][guest['name']]['node_relationship'] = Tags.get_node_relationship(guests['guests'][guest['name']]['tags'])
+                    guests['guests'][guest['name']]['node_relationships'] = Tags.get_node_relationships(guests['guests'][guest['name']]['tags'], nodes, guests['guests'][guest['name']]['pools'], proxlb_config)
                    guests['guests'][guest['name']]['type'] = 'ct'
+
+                    logger.debug(f"Resources of Guest {guest['name']} (type CT) added: {guests['guests'][guest['name']]}")
                else:
                    logger.debug(f'Metric for CT {guest["name"]} ignored because CT is not running.')

        logger.debug("Finished: get_guests.")
        return guests
+
+    @staticmethod
+    def get_guest_rrd_data(proxmox_api, node_name: str, vm_id: int, vm_name: str, object_name: str, object_type: str, spikes=False) -> float:
+        """
+        Retrieves the rrd data metrics for a specific resource (CPU, memory, disk) of a guest VM or CT.
+
+        Args:
+            proxmox_api (Any): The Proxmox API client instance.
+            node_name (str): The name of the node hosting the guest.
+            vm_id (int): The ID of the guest VM or CT.
+            vm_name (str): The name of the guest VM or CT.
+            object_name (str): The resource type to query (e.g., 'cpu', 'memory', 'disk').
+            object_type (str, optional): The pressure type ('some', 'full') or None for average usage.
+            spikes (bool, optional): Whether to consider spikes in the calculation. Defaults to False.
+
+        Returns:
+            float: The calculated average usage value for the specified resource.
+        """
+        logger.debug("Starting: get_guest_rrd_data.")
+        time.sleep(0.1)
+
+        try:
+            if spikes:
+                logger.debug(f"Getting spike RRD data for {object_name} from guest: {vm_name}.")
+                guest_data_rrd = proxmox_api.nodes(node_name).qemu(vm_id).rrddata.get(timeframe="hour", cf="MAX")
+            else:
+                logger.debug(f"Getting average RRD data for {object_name} from guest: {vm_name}.")
+                guest_data_rrd = proxmox_api.nodes(node_name).qemu(vm_id).rrddata.get(timeframe="hour", cf="AVERAGE")
+        except Exception:
+            logger.error(f"Failed to retrieve RRD data for guest: {vm_name} (ID: {vm_id}) on node: {node_name}. Using 0.0 as value.")
+            logger.debug("Finished: get_guest_rrd_data.")
+            return float(0.0)
+
+        if object_type:
+
+            lookup_key = f"pressure{object_name}{object_type}"
+            if spikes:
+                # RRD data is collected every minute, so we look at the last 6 entries
+                # and take the maximum value to represent the spike
+                logger.debug(f"Getting RRD data (spike: {spikes}) of pressure for {object_name} {object_type} from guest: {vm_name}.")
+                rrd_data_value = [row.get(lookup_key) for row in guest_data_rrd if row.get(lookup_key) is not None]
+                rrd_data_value = max(rrd_data_value[-6:], default=0.0)
+            else:
+                # Calculate the average value from the RRD data entries
+                logger.debug(f"Getting RRD data (spike: {spikes}) of pressure for {object_name} {object_type} from guest: {vm_name}.")
+                rrd_data_value = sum(entry.get(lookup_key, 0.0) for entry in guest_data_rrd) / len(guest_data_rrd)
+
+        else:
+            logger.debug(f"Getting RRD data of cpu usage from guest: {vm_name}.")
+            rrd_data_value = sum(entry.get("cpu", 0.0) for entry in guest_data_rrd) / len(guest_data_rrd)
+
+        logger.debug(f"RRD data (spike: {spikes}) for {object_name} from guest: {vm_name}: {rrd_data_value}")
+        logger.debug("Finished: get_guest_rrd_data.")
+        return rrd_data_value
--- a/proxlb/models/nodes.py
+++ b/proxlb/models/nodes.py
@@ -21,6 +21,7 @@ __copyright__ = "Copyright (C) 2025 Florian Paul Azim Hoberg (@gyptazy)"
 __license__ = "GPL-3.0"


+import time
 from typing import Dict, Any
 from utils.logger import SystemdLogger

@@ -47,6 +48,7 @@ class Nodes:

        Args:
            proxmox_api (any): The Proxmox API client instance.
+            proxmox_config (Dict[str, Any]): A dictionary containing the ProxLB configuration.
            nodes (Dict[str, Any]): A dictionary containing information about the nodes in the Proxmox cluster.

        Returns:
@@ -54,16 +56,15 @@ class Nodes:
        """
        logger.debug("Starting: get_nodes.")
        nodes = {"nodes": {}}
-        cluster = {"cluster": {}}

        for node in proxmox_api.nodes.get():
            # Ignoring a node results into ignoring all placed guests on the ignored node!
            if node["status"] == "online" and not Nodes.set_node_ignore(proxlb_config, node["node"]):
                nodes["nodes"][node["node"]] = {}
                nodes["nodes"][node["node"]]["name"] = node["node"]
+                nodes["nodes"][node["node"]]["pve_version"] = Nodes.get_node_pve_version(proxmox_api, node["node"])
+                nodes["nodes"][node["node"]]["pressure_hot"] = False
                nodes["nodes"][node["node"]]["maintenance"] = False
-                nodes["nodes"][node["node"]]["dpm_shutdown"] = False
-                nodes["nodes"][node["node"]]["dpm_startup"] = False
                nodes["nodes"][node["node"]]["cpu_total"] = node["maxcpu"]
                nodes["nodes"][node["node"]]["cpu_assigned"] = 0
                nodes["nodes"][node["node"]]["cpu_used"] = node["cpu"] * node["maxcpu"]
@@ -71,6 +72,11 @@ class Nodes:
                nodes["nodes"][node["node"]]["cpu_assigned_percent"] = nodes["nodes"][node["node"]]["cpu_assigned"] / nodes["nodes"][node["node"]]["cpu_total"] * 100
                nodes["nodes"][node["node"]]["cpu_free_percent"] = nodes["nodes"][node["node"]]["cpu_free"] / node["maxcpu"] * 100
                nodes["nodes"][node["node"]]["cpu_used_percent"] = nodes["nodes"][node["node"]]["cpu_used"] / node["maxcpu"] * 100
+                nodes["nodes"][node["node"]]["cpu_pressure_some_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "cpu", "some")
+                nodes["nodes"][node["node"]]["cpu_pressure_full_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "cpu", "full")
+                nodes["nodes"][node["node"]]["cpu_pressure_some_spikes_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "cpu", "some", spikes=True)
+                nodes["nodes"][node["node"]]["cpu_pressure_full_spikes_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "cpu", "full", spikes=True)
+                nodes["nodes"][node["node"]]["cpu_pressure_hot"] = False
                nodes["nodes"][node["node"]]["memory_total"] = node["maxmem"]
                nodes["nodes"][node["node"]]["memory_assigned"] = 0
                nodes["nodes"][node["node"]]["memory_used"] = node["mem"]
@@ -78,6 +84,11 @@ class Nodes:
                nodes["nodes"][node["node"]]["memory_assigned_percent"] = nodes["nodes"][node["node"]]["memory_assigned"] / nodes["nodes"][node["node"]]["memory_total"] * 100
                nodes["nodes"][node["node"]]["memory_free_percent"] = nodes["nodes"][node["node"]]["memory_free"] / node["maxmem"] * 100
                nodes["nodes"][node["node"]]["memory_used_percent"] = nodes["nodes"][node["node"]]["memory_used"] / node["maxmem"] * 100
+                nodes["nodes"][node["node"]]["memory_pressure_some_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "memory", "some")
+                nodes["nodes"][node["node"]]["memory_pressure_full_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "memory", "full")
+                nodes["nodes"][node["node"]]["memory_pressure_some_spikes_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "memory", "some", spikes=True)
+                nodes["nodes"][node["node"]]["memory_pressure_full_spikes_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "memory", "full", spikes=True)
+                nodes["nodes"][node["node"]]["memory_pressure_hot"] = False
                nodes["nodes"][node["node"]]["disk_total"] = node["maxdisk"]
                nodes["nodes"][node["node"]]["disk_assigned"] = 0
                nodes["nodes"][node["node"]]["disk_used"] = node["disk"]
@@ -85,43 +96,22 @@ class Nodes:
                nodes["nodes"][node["node"]]["disk_assigned_percent"] = nodes["nodes"][node["node"]]["disk_assigned"] / nodes["nodes"][node["node"]]["disk_total"] * 100
                nodes["nodes"][node["node"]]["disk_free_percent"] = nodes["nodes"][node["node"]]["disk_free"] / node["maxdisk"] * 100
                nodes["nodes"][node["node"]]["disk_used_percent"] = nodes["nodes"][node["node"]]["disk_used"] / node["maxdisk"] * 100
+                nodes["nodes"][node["node"]]["disk_pressure_some_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "disk", "some")
+                nodes["nodes"][node["node"]]["disk_pressure_full_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "disk", "full")
+                nodes["nodes"][node["node"]]["disk_pressure_some_spikes_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "disk", "some", spikes=True)
+                nodes["nodes"][node["node"]]["disk_pressure_full_spikes_percent"] = Nodes.get_node_rrd_data(proxmox_api, node["node"], "disk", "full", spikes=True)
+                nodes["nodes"][node["node"]]["disk_pressure_hot"] = False

                # Evaluate if node should be set to maintenance mode
-                if Nodes.set_node_maintenance(proxlb_config, node["node"]):
+                if Nodes.set_node_maintenance(proxmox_api, proxlb_config, node["node"]):
                    nodes["nodes"][node["node"]]["maintenance"] = True

-                # Generate the intial cluster statistics within the same loop to avoid a further one.
-                logger.debug(f"Updating cluster statistics by online node {node['node']}.")
-                cluster["cluster"]["node_count"] = cluster["cluster"].get("node_count", 0) + 1
-                cluster["cluster"]["cpu_total"] = cluster["cluster"].get("cpu_total", 0) + nodes["nodes"][node["node"]]["cpu_total"]
-                cluster["cluster"]["cpu_used"] = cluster["cluster"].get("cpu_used", 0) + nodes["nodes"][node["node"]]["cpu_used"]
-                cluster["cluster"]["cpu_free"] = cluster["cluster"].get("cpu_free", 0) + nodes["nodes"][node["node"]]["cpu_free"]
-                cluster["cluster"]["cpu_free_percent"] = cluster["cluster"].get("cpu_free", 0) / cluster["cluster"].get("cpu_total", 0) * 100
-                cluster["cluster"]["cpu_used_percent"] = cluster["cluster"].get("cpu_used", 0) / cluster["cluster"].get("cpu_total", 0) * 100
-                cluster["cluster"]["memory_total"] = cluster["cluster"].get("memory_total", 0) + nodes["nodes"][node["node"]]["memory_total"]
-                cluster["cluster"]["memory_used"] = cluster["cluster"].get("memory_used", 0) + nodes["nodes"][node["node"]]["memory_used"]
-                cluster["cluster"]["memory_free"] = cluster["cluster"].get("memory_free", 0) + nodes["nodes"][node["node"]]["memory_free"]
-                cluster["cluster"]["memory_free_percent"] = cluster["cluster"].get("memory_free", 0) / cluster["cluster"].get("memory_total", 0) * 100
-                cluster["cluster"]["memory_used_percent"] = cluster["cluster"].get("memory_used", 0) / cluster["cluster"].get("memory_total", 0) * 100
-                cluster["cluster"]["disk_total"] = cluster["cluster"].get("disk_total", 0) + nodes["nodes"][node["node"]]["disk_total"]
-                cluster["cluster"]["disk_used"] = cluster["cluster"].get("disk_used", 0) + nodes["nodes"][node["node"]]["disk_used"]
-                cluster["cluster"]["disk_free"] = cluster["cluster"].get("disk_free", 0) + nodes["nodes"][node["node"]]["disk_free"]
-                cluster["cluster"]["disk_free_percent"] = cluster["cluster"].get("disk_free", 0) / cluster["cluster"].get("disk_total", 0) * 100
-                cluster["cluster"]["disk_used_percent"] = cluster["cluster"].get("disk_used", 0) / cluster["cluster"].get("disk_total", 0) * 100
-
-                cluster["cluster"]["node_count_available"] = cluster["cluster"].get("node_count_available", 0) + 1
-                cluster["cluster"]["node_count_overall"] = cluster["cluster"].get("node_count_overall", 0) + 1
-
-            # Update the cluster statistics by offline nodes to have the overall count of nodes in the cluster
-            else:
-                logger.debug(f"Updating cluster statistics by offline node {node['node']}.")
-                cluster["cluster"]["node_count_overall"] = cluster["cluster"].get("node_count_overall", 0) + 1
-
+        logger.debug(f"Node metrics collected: {nodes}")
        logger.debug("Finished: get_nodes.")
-        return nodes, cluster
+        return nodes

    @staticmethod
-    def set_node_maintenance(proxlb_config: Dict[str, Any], node_name: str) -> Dict[str, Any]:
+    def set_node_maintenance(proxmox_api, proxlb_config: Dict[str, Any], node_name: str) -> Dict[str, Any]:
        """
        Set nodes to maintenance mode based on the provided configuration.

@@ -129,6 +119,7 @@ class Nodes:
        based on the configuration provided in proxlb_config.

        Args:
+            proxmox_api (any): The Proxmox API client instance.
            proxlb_config (Dict[str, Any]): A dictionary containing the ProxLB configuration, including maintenance nodes.
            node_name: (str): The current node name within the outer iteration.

@@ -137,11 +128,24 @@ class Nodes:
        """
        logger.debug("Starting: set_node_maintenance.")

+        # Evaluate maintenance mode by config
        if proxlb_config.get("proxmox_cluster", None).get("maintenance_nodes", None) is not None:
            if len(proxlb_config.get("proxmox_cluster", {}).get("maintenance_nodes", [])) > 0:
                if node_name in proxlb_config.get("proxmox_cluster", {}).get("maintenance_nodes", []):
-                    logger.warning(f"Node: {node_name} has been set to maintenance mode.")
+                    logger.info(f"Node: {node_name} has been set to maintenance mode (by ProxLB config).")
                    return True
+                else:
+                    logger.debug(f"Node: {node_name} is not in maintenance mode by ProxLB config.")
+
+        # Evaluate maintenance mode by Proxmox HA
+        for ha_element in proxmox_api.cluster.ha.status.current.get():
+            if ha_element.get("status"):
+                if "maintenance mode" in ha_element.get("status"):
+                    if ha_element.get("node") == node_name:
+                        logger.info(f"Node: {node_name} has been set to maintenance mode (by Proxmox HA API).")
+                        return True
+                    else:
+                        logger.debug(f"Node: {node_name} is not in maintenance mode by Proxmox HA API.")

        logger.debug("Finished: set_node_maintenance.")

@@ -165,7 +169,87 @@ class Nodes:
        if proxlb_config.get("proxmox_cluster", None).get("ignore_nodes", None) is not None:
            if len(proxlb_config.get("proxmox_cluster", {}).get("ignore_nodes", [])) > 0:
                if node_name in proxlb_config.get("proxmox_cluster", {}).get("ignore_nodes", []):
-                    logger.warning(f"Node: {node_name} has been set to be ignored. Not adding node!")
+                    logger.info(f"Node: {node_name} has been set to be ignored. Not adding node!")
                    return True

        logger.debug("Finished: set_node_ignore.")
+
+    @staticmethod
+    def get_node_rrd_data(proxmox_api, node_name: str, object_name: str, object_type: str, spikes=False) -> float:
+        """
+        Retrieves the rrd data metrics for a specific resource (CPU, memory, disk) of a node.
+
+        Args:
+            proxmox_api (Any): The Proxmox API client instance.
+            node_name (str): The name of the node hosting the guest.
+            object_name (str): The resource type to query (e.g., 'cpu', 'memory', 'disk').
+            object_type (str, optional): The pressure type ('some', 'full') or None for average usage.
+            spikes (bool, optional): Whether to consider spikes in the calculation. Defaults to False.
+
+        Returns:
+            float: The calculated average usage value for the specified resource.
+        """
+        logger.debug("Starting: get_node_rrd_data.")
+        time.sleep(0.1)
+
+        try:
+            if spikes:
+                logger.debug(f"Getting spike RRD data for {object_name} from node: {node_name}.")
+                node_data_rrd = proxmox_api.nodes(node_name).rrddata.get(timeframe="hour", cf="MAX")
+            else:
+                logger.debug(f"Getting average RRD data for {object_name} from node: {node_name}.")
+                node_data_rrd = proxmox_api.nodes(node_name).rrddata.get(timeframe="hour", cf="AVERAGE")
+
+        except Exception:
+            logger.error(f"Failed to retrieve RRD data for guest: {node_name}. Using 0.0 as value.")
+            logger.debug("Finished: get_node_rrd_data.")
+            return 0.0
+
+        lookup_key = f"pressure{object_name}{object_type}"
+
+        if spikes:
+            # RRD data is collected every minute, so we look at the last 6 entries
+            # and take the maximum value to represent the spike
+            rrd_data_value = [row.get(lookup_key) for row in node_data_rrd if row.get(lookup_key) is not None]
+            rrd_data_value = max(rrd_data_value[-6:], default=0.0)
+        else:
+            # Calculate the average value from the RRD data entries
+            rrd_data_value = sum(entry.get(lookup_key, 0.0) for entry in node_data_rrd) / len(node_data_rrd)
+
+        logger.debug(f"RRD data (spike: {spikes}) for {object_name} from node: {node_name}: {rrd_data_value}")
+        logger.debug("Finished: get_node_rrd_data.")
+        return rrd_data_value
+
+    @staticmethod
+    def get_node_pve_version(proxmox_api, node_name: str) -> float:
+        """
+        Return the Proxmox VE (PVE) version for a given node by querying the Proxmox API.
+
+        This function calls proxmox_api.nodes(node_name).version.get() and extracts the
+        'version' field from the returned mapping. The value is expected to be numeric
+        (or convertible to float) and is returned as a float.
+
+        Args:
+            proxmox_api (Any): The Proxmox API client instance.
+            node_name (str): The name of the node hosting the guest.
+
+        Returns:
+            float: The PVE version for the specified node as a floating point number.
+
+        Raises:
+        Exception: If the proxmox_api call fails, returns an unexpected structure, or the
+                   'version' field is missing or cannot be converted to float. Callers should
+                    handle or propagate exceptions as appropriate.
+        """
+        logger.debug("Starting: get_node_pve_version.")
+        time.sleep(0.1)
+
+        try:
+            logger.debug(f"Trying to get PVE version for node: {node_name}.")
+            version = proxmox_api.nodes(node_name).version.get()
+        except Exception:
+            logger.error(f"Failed to get PVE version for node: {node_name}.")
+
+        logger.debug(f"Got version {version['version']} for node {node_name}.")
+        logger.debug("Finished: get_node_pve_version.")
+        return version["version"]
--- a/proxlb/models/pools.py
+++ b/proxlb/models/pools.py
@@ -0,0 +1,111 @@
+"""
+The Pools class retrieves all present pools defined on a Proxmox cluster
+including the chield objects.
+"""
+
+__author__ = "Florian Paul Azim Hoberg <gyptazy>"
+__copyright__ = "Copyright (C) 2025 Florian Paul Azim Hoberg (@gyptazy)"
+__license__ = "GPL-3.0"
+
+
+from typing import Dict, Any
+from utils.logger import SystemdLogger
+from models.tags import Tags
+import time
+
+logger = SystemdLogger()
+
+
+class Pools:
+    """
+    The Pools class retrieves all present pools defined on a Proxmox cluster
+    including the chield objects.
+
+    Methods:
+        __init__:
+            Initializes the Pools class.
+
+        get_pools(proxmox_api: any) -> Dict[str, Any]:
+            Retrieve pool definitions and membership from the Proxmox cluster.
+            Returns a dict with a top-level "pools" mapping each poolid to
+            {"name": <poolid>, "members": [<member_names>...]}.
+            This method does not collect per-member metrics or perform node filtering.
+    """
+    def __init__(self):
+        """
+        Initializes the Pools class with the provided ProxLB data.
+        """
+
+    @staticmethod
+    def get_pools(proxmox_api: any) -> Dict[str, Any]:
+        """
+        Retrieve all pools and their members from a Proxmox cluster.
+
+        Queries the Proxmox API for pool definitions and returns a dictionary
+        containing each pool's id/name and a list of its member VM/CT names.
+        This function does not perform per-member metric collection or node
+        filtering — it only gathers pool membership information.
+
+        Args:
+            proxmox_api (any): Proxmox API client instance.
+
+        Returns:
+            Dict[str, Any]: Dictionary with a top-level "pools" key mapping poolid
+                    to {"name": <poolid>, "members": [<member_names>...]}.
+        """
+        logger.debug("Starting: get_pools.")
+        pools = {"pools": {}}
+
+        # Pool objects: iterate over all pools in the cluster.
+        # We keep pool members even if their nodes are ignored so resource accounting
+        # for rebalancing remains correct and we avoid overprovisioning nodes.
+        for pool in proxmox_api.pools.get():
+            logger.debug(f"Got pool: {pool['poolid']}")
+            pools['pools'][pool['poolid']] = {}
+            pools['pools'][pool['poolid']]['name'] = pool['poolid']
+            pools['pools'][pool['poolid']]['members'] = []
+
+            # Fetch pool details and collect member names
+            pool_details = proxmox_api.pools(pool['poolid']).get()
+            for member in pool_details.get("members", []):
+                logger.debug(f"Got member: {member['name']} for pool: {pool['poolid']}")
+                pools['pools'][pool['poolid']]['members'].append(member["name"])
+
+        logger.debug("Finished: get_pools.")
+        return pools
+
+    @staticmethod
+    def get_pools_for_guest(guest_name: str, pools: Dict[str, Any]) -> Dict[str, Any]:
+        """
+        Return the list of pool names that include the given guest.
+
+        Args:
+            guest_name (str): Name of the VM or CT to look up.
+            pools (Dict[str, Any]): Pools structure as returned by get_pools(),
+            expected to contain a top-level "pools" mapping each poolid to
+            {"name": <poolid>, "members": [<member_names>...]}.
+
+        Returns:
+            list[str]: Names of pools the guest is a member of (empty list if none).
+        """
+        logger.debug("Starting: get_pools_for_guests.")
+        guest_pools = []
+
+        for pool in pools.items():
+            for pool_id, pool_data in pool[1].items():
+
+                if type(pool_data) is dict:
+                    pool_name = pool_data.get("name", "")
+                    pool_name_members = pool_data.get("members", [])
+
+                    if guest_name in pool_name_members:
+                        logger.debug(f"Guest: {guest_name} is member of Pool: {pool_name}.")
+                        guest_pools.append(pool_name)
+                    else:
+                        logger.debug(f"Guest: {guest_name} is NOT member of Pool: {pool_name}.")
+
+                else:
+                    logger.debug(f"Pool data for pool_id {pool_id} is not a dict: {pool_data}")
+
+        logger.debug("Finished: get_pools_for_guests.")
+        return guest_pools
--- a/proxlb/models/tags.py
+++ b/proxlb/models/tags.py
@@ -12,7 +12,9 @@ __license__ = "GPL-3.0"

 import time
 from typing import List
+from typing import Dict, Any
 from utils.logger import SystemdLogger
+from utils.helper import Helper

 logger = SystemdLogger()

@@ -78,15 +80,18 @@ class Tags:
        return tags

    @staticmethod
-    def get_affinity_groups(tags: List[str]) -> List[str]:
+    def get_affinity_groups(tags: List[str], pools: List[str], proxlb_config: Dict[str, Any]) -> List[str]:
        """
        Get affinity tags for a guest from the Proxmox cluster by the API.

-        This method retrieves all tags for a given guest and evaluates the
-        affinity tags which are required during the balancing calculations.
+        This method retrieves all tags for a given guest or based on a
+        membership of a pool and evaluates the affinity groups which are
+        required during the balancing calculations.

        Args:
            tags (List): A list holding all defined tags for a given guest.
+            pools (List): A list holding all defined pools for a given guest.
+            proxlb_config (Dict): A dict holding the ProxLB configuration.

        Returns:
            List: A list including all affinity tags for the given guest.
@@ -97,21 +102,36 @@ class Tags:
        if len(tags) > 0:
            for tag in tags:
                if tag.startswith("plb_affinity"):
+                    logger.debug(f"Adding affinity group for tag {tag}.")
                    affinity_tags.append(tag)
+                else:
+                    logger.debug(f"Skipping affinity group for tag {tag}.")
+
+        if len(pools) > 0:
+            for pool in pools:
+                if pool in (proxlb_config['balancing'].get('pools') or {}):
+                    if proxlb_config['balancing']['pools'][pool].get('type', None) == 'affinity':
+                        logger.debug(f"Adding affinity group for pool {pool}.")
+                        affinity_tags.append(pool)
+                else:
+                    logger.debug(f"Skipping affinity group for pool {pool}.")

        logger.debug("Finished: get_affinity_groups.")
        return affinity_tags

    @staticmethod
-    def get_anti_affinity_groups(tags: List[str]) -> List[str]:
+    def get_anti_affinity_groups(tags: List[str], pools: List[str], proxlb_config: Dict[str, Any]) -> List[str]:
        """
        Get anti-affinity tags for a guest from the Proxmox cluster by the API.

-        This method retrieves all tags for a given guest and evaluates the
-        anti-affinity tags which are required during the balancing calculations.
+        This method retrieves all tags for a given guest or based on a
+        membership of a pool and evaluates the anti-affinity groups which
+        are required during the balancing calculations.

        Args:
            tags (List): A list holding all defined tags for a given guest.
+            pools (List): A list holding all defined pools for a given guest.
+            proxlb_config (Dict): A dict holding the ProxLB configuration.

        Returns:
            List: A list including all anti-affinity tags for the given guest..
@@ -122,7 +142,19 @@ class Tags:
        if len(tags) > 0:
            for tag in tags:
                if tag.startswith("plb_anti_affinity"):
+                    logger.debug(f"Adding anti-affinity group for tag {tag}.")
                    anti_affinity_tags.append(tag)
+                else:
+                    logger.debug(f"Skipping anti-affinity group for tag {tag}.")
+
+        if len(pools) > 0:
+            for pool in pools:
+                if pool in (proxlb_config['balancing'].get('pools') or {}):
+                    if proxlb_config['balancing']['pools'][pool].get('type', None) == 'anti-affinity':
+                        logger.debug(f"Adding anti-affinity group for pool {pool}.")
+                        anti_affinity_tags.append(pool)
+                else:
+                    logger.debug(f"Skipping anti-affinity group for pool {pool}.")

        logger.debug("Finished: get_anti_affinity_groups.")
        return anti_affinity_tags
@@ -153,27 +185,55 @@ class Tags:
        return ignore_tag

    @staticmethod
-    def get_node_relationship(tags: List[str]) -> str:
+    def get_node_relationships(tags: List[str], nodes: Dict[str, Any], pools: List[str], proxlb_config: Dict[str, Any]) -> str:
        """
        Get a node relationship tag for a guest from the Proxmox cluster by the API to pin
-        a guest to a node.
+        a guest to a node or by defined pools from ProxLB configuration.

        This method retrieves a relationship tag between a guest and a specific
        hypervisor node to pin the guest to a specific node (e.g., for licensing reason).

        Args:
            tags (List): A list holding all defined tags for a given guest.
+            nodes (Dict): A dictionary holding all available nodes in the cluster.
+            pools (List): A list holding all defined pools for a given guest.
+            proxlb_config (Dict): A dict holding the ProxLB configuration.

        Returns:
-            Str: The related hypervisor node name.
+            Str: The related hypervisor node name(s).
        """
-        logger.debug("Starting: get_node_relationship.")
-        node_relationship_tag = False
+        logger.debug("Starting: get_node_relationships.")
+        node_relationship_tags = []

        if len(tags) > 0:
+            logger.debug("Validating node pinning by tags.")
            for tag in tags:
                if tag.startswith("plb_pin"):
                    node_relationship_tag = tag.replace("plb_pin_", "")

-        logger.debug("Finished: get_node_relationship.")
-        return node_relationship_tag
+                    # Validate if the node to pin is present in the cluster
+                    if Helper.validate_node_presence(node_relationship_tag, nodes):
+                        logger.debug(f"Tag {node_relationship_tag} is valid! Defined node exists in the cluster.")
+                        logger.debug(f"Setting node relationship because of tag {tag} to {node_relationship_tag}.")
+                        node_relationship_tags.append(node_relationship_tag)
+                    else:
+                        logger.warning(f"Tag {node_relationship_tag} is invalid! Defined node does not exist in the cluster. Not applying pinning.")
+
+        if len(pools) > 0:
+            logger.debug("Validating node pinning by pools.")
+            for pool in pools:
+                if pool in (proxlb_config['balancing'].get('pools') or {}):
+
+                    node = proxlb_config['balancing']['pools'][pool].get('pin', None)
+                    # Validate if the node to pin is present in the cluster
+                    if Helper.validate_node_presence(node, nodes):
+                        logger.debug(f"Pool pinning tag {node} is valid! Defined node exists in the cluster.")
+                        logger.debug(f"Setting node relationship because of pool {pool} to {node}.")
+                        node_relationship_tags.append(node)
+                    else:
+                        logger.warning(f"Pool pinning tag {node} is invalid! Defined node does not exist in the cluster. Not applying pinning.")
+                else:
+                    logger.debug(f"Skipping pinning for pool {pool}. Pool is not defined in ProxLB configuration.")
+
+        logger.debug("Finished: get_node_relationships.")
+        return node_relationship_tags
--- a/proxlb/utils/helper.py
+++ b/proxlb/utils/helper.py
@@ -10,11 +10,13 @@ __license__ = "GPL-3.0"

 import json
 import uuid
+import re
 import sys
 import time
 import utils.version
 from utils.logger import SystemdLogger
 from typing import Dict, Any
+from types import FrameType

 logger = SystemdLogger()

@@ -40,6 +42,8 @@ class Helper:
        get_daemon_mode(proxlb_config: Dict[str, Any]) -> None:
            Checks if the daemon mode is active and handles the scheduling accordingly.
    """
+    proxlb_reload = False
+
    def __init__(self):
        """
        Initializes the general Helper clas.
@@ -143,6 +147,39 @@ class Helper:

        logger.debug("Finished: get_daemon_mode.")

+    @staticmethod
+    def get_service_delay(proxlb_config: Dict[str, Any]) -> None:
+        """
+        Checks if a start up delay for the service is defined and waits to proceed until
+        the time is up.
+
+        Parameters:
+            proxlb_config (Dict[str, Any]): A dictionary containing the ProxLB configuration.
+
+        Returns:
+            None
+        """
+        logger.debug("Starting: get_service_delay.")
+        if proxlb_config.get("service", {}).get("delay", {}).get("enable", False):
+
+            # Convert hours to seconds
+            if proxlb_config["service"]["delay"].get("format", "hours") == "hours":
+                sleep_seconds = proxlb_config.get("service", {}).get("delay", {}).get("time", 1) * 3600
+            # Convert minutes to seconds
+            elif proxlb_config["service"]["delay"].get("format", "hours") == "minutes":
+                sleep_seconds = proxlb_config.get("service", {}).get("delay", {}).get("time", 60) * 60
+            else:
+                logger.error("Invalid format for service delay. Please use 'hours' or 'minutes'.")
+                sys.exit(1)
+
+            logger.info(f"Service delay active: First run in: {proxlb_config.get('service', {}).get('delay', {}).get('time', 1)} {proxlb_config['service']['delay'].get('format', 'hours')}.")
+            time.sleep(sleep_seconds)
+
+        else:
+            logger.debug("Service delay not active. Proceeding without delay.")
+
+        logger.debug("Finished: get_service_delay.")
+
    @staticmethod
    def print_json(proxlb_config: Dict[str, Any], print_json: bool = False) -> None:
        """
@@ -162,3 +199,111 @@ class Helper:
            print(json.dumps(filtered_data, indent=4))

        logger.debug("Finished: print_json.")
+
+    @staticmethod
+    def handler_sighup(signum: int, frame: FrameType) -> None:
+        """
+        Signal handler for SIGHUP.
+
+        This method is triggered when the process receives a SIGHUP signal.
+        It sets the `proxlb_reload` class variable to True to indicate that
+        configuration should be reloaded in the main loop.
+
+        Args:
+            signum (int): The signal number (expected to be signal.SIGHUP).
+            frame (frame object): Current stack frame (unused but required by signal handler signature).
+        """
+        logger.debug("Starting: handle_sighup.")
+        logger.debug("Got SIGHUP signal. Reloading...")
+        Helper.proxlb_reload = True
+        logger.debug("Finished: handle_sighup.")
+
+    @staticmethod
+    def handler_sigint(signum: int, frame: FrameType) -> None:
+        """
+        Signal handler for SIGINT. (triggered by CTRL+C).
+
+        Args:
+            signum (int): The signal number (e.g., SIGINT).
+            frame (FrameType): The current stack frame when the signal was received.
+
+        Returns:
+            None
+        """
+        exit_message = "ProxLB has been successfully terminated by user."
+        logger.debug(exit_message)
+        print(f"\n {exit_message}")
+        sys.exit(0)
+
+    @staticmethod
+    def get_host_port_from_string(host_object):
+        """
+        Parses a string containing a host (IPv4, IPv6, or hostname) and an optional port, and returns a tuple of (host, port).
+
+        Supported formats:
+        - Hostname or IPv4 without port: "example.com" or "192.168.0.1"
+        - Hostname or IPv4 with port: "example.com:8006" or "192.168.0.1:8006"
+        - IPv6 in brackets with optional port: "[fc00::1]" or "[fc00::1]:8006"
+        - IPv6 without brackets, port is assumed after last colon: "fc00::1:8006"
+
+        If no port is specified, port 8006 is used as the default.
+
+        Args:
+            host_object (str): A string representing a host with or without a port.
+
+        Returns:
+            tuple: A tuple (host: str, port: int)
+        """
+        logger.debug("Starting: get_host_port_from_string.")
+
+        # IPv6 (with or without port, written in brackets)
+        match = re.match(r'^\[(.+)\](?::(\d+))?$', host_object)
+        if match:
+            host = match.group(1)
+            port = int(match.group(2)) if match.group(2) else 8006
+            return host, port
+
+        # Count colons to identify IPv6 addresses without brackets
+        colon_count = host_object.count(':')
+
+        # IPv4 or hostname without port
+        if colon_count == 0:
+            return host_object, 8006
+
+        # IPv4 or hostname with port
+        elif colon_count == 1:
+            host, port = host_object.split(':')
+            return host, int(port)
+
+        # IPv6 (with or without port, assume last colon is port)
+        else:
+            parts = host_object.rsplit(':', 1)
+            try:
+                port = int(parts[1])
+                return parts[0], port
+            except ValueError:
+                return host_object, 8006
+
+    @staticmethod
+    def validate_node_presence(node: str, nodes: Dict[str, Any]) -> bool:
+        """
+        Validates whether a given node exists in the provided cluster nodes dictionary.
+
+        Args:
+            node (str): The name of the node to validate.
+            nodes (Dict[str, Any]): A dictionary containing cluster information.
+                                    Must include a "nodes" key mapping to a dict of available nodes.
+
+        Returns:
+            bool: True if the node exists in the cluster, False otherwise.
+        """
+        logger.debug("Starting: validate_node_presence.")
+
+        if node in nodes["nodes"].keys():
+            logger.info(f"Node {node} found in cluster. Applying pinning.")
+            logger.debug("Finished: validate_node_presence.")
+            return True
+        else:
+            logger.warning(f"Node {node} not found in cluster. Not applying pinning!")
+            logger.debug("Finished: validate_node_presence.")
+            return False
--- a/proxlb/utils/logger.py
+++ b/proxlb/utils/logger.py
@@ -88,7 +88,7 @@ class SystemdLogger:
        # logging is preferred.
        if SYSTEMD_PRESENT:
            # Add a JournalHandler for systemd integration
-            handler = JournalHandler()
+            handler = JournalHandler(SYSLOG_IDENTIFIER="ProxLB")
        else:
            # Add a stdout handler as a fallback
            handler = logging.StreamHandler(sys.stdout)
--- a/proxlb/utils/proxmox_api.py
+++ b/proxlb/utils/proxmox_api.py
@@ -33,6 +33,7 @@ try:
 except ImportError:
    URLLIB3_PRESENT = False
 from typing import Dict, Any
+from utils.helper import Helper
 from utils.logger import SystemdLogger


@@ -134,6 +135,14 @@ class ProxmoxApi:
        proxlb_credentials = proxlb_config["proxmox_api"]
        present_auth_pass = "pass" in proxlb_credentials
        present_auth_secret = "token_secret" in proxlb_credentials
+        token_id = proxlb_credentials.get("token_id", None)
+
+        if token_id:
+            non_allowed_chars = ["@", "!"]
+            for char in non_allowed_chars:
+                if char in token_id:
+                    logger.error(f"Wrong user/token format defined. User and token id must be splitted! Please see: https://github.com/gyptazy/ProxLB/blob/main/docs/03_configuration.md#required-permissions-for-a-user")
+                    sys.exit(1)

        if present_auth_pass and present_auth_secret:
            logger.critical(f"Username/password and API token authentication are mutal exclusive. Please use only one!")
@@ -189,9 +198,9 @@ class ProxmoxApi:
            api_connection_wait_time = proxlb_config["proxmox_api"].get("wait_time", 1)

            for api_connection_attempt in range(api_connection_retries):
-                validated = self.test_api_proxmox_host(host)
-                if validated:
-                    validated_api_hosts.append(validated)
+                validated_api_host, api_port = self.test_api_proxmox_host(host)
+                if validated_api_host:
+                    validated_api_hosts.append(validated_api_host)
                    break
                else:
                    logger.warning(f"Attempt {api_connection_attempt + 1}/{api_connection_retries} failed for host {host}. Retrying in {api_connection_wait_time} seconds...")
@@ -200,7 +209,7 @@ class ProxmoxApi:
        if len(validated_api_hosts) > 0:
            # Choose a random host to distribute the load across the cluster
            # as a simple load balancing mechanism.
-            return random.choice(validated_api_hosts)
+            return random.choice(validated_api_hosts), api_port

        logger.critical("No valid Proxmox API hosts found.")
        print("No valid Proxmox API hosts found.")
@@ -228,6 +237,10 @@ class ProxmoxApi:
        """
        logger.debug("Starting: test_api_proxmox_host.")

+        # Validate for custom ports in API hosts which might indicate
+        # that an external loadbalancer will be used.
+        host, port = Helper.get_host_port_from_string(host)
+
        # Try resolving DNS to IP and log non-resolvable ones
        try:
            ip = socket.getaddrinfo(host, None, socket.AF_UNSPEC)
@@ -239,12 +252,12 @@ class ProxmoxApi:
        for address_type in ip:
            if address_type[0] == socket.AF_INET:
                logger.debug(f"{host} is type ipv4.")
-                if self.test_api_proxmox_host_ipv4(host):
-                    return host
+                if self.test_api_proxmox_host_ipv4(host, port):
+                    return host, port
            elif address_type[0] == socket.AF_INET6:
                logger.debug(f"{host} is type ipv6.")
-                if self.test_api_proxmox_host_ipv6(host):
-                    return host
+                if self.test_api_proxmox_host_ipv6(host, port):
+                    return host, port
            else:
                return False

@@ -331,7 +344,15 @@ class ProxmoxApi:
        permissions_available = []

        # Get the permissions for the current user/token from API
-        permissions = proxmox_api.access.permissions.get()
+        try:
+            permissions = proxmox_api.access.permissions.get()
+        except proxmoxer.core.ResourceException as api_error:
+            if "no such user" in str(api_error):
+                logger.error("Authentication to Proxmox API not possible: User not known - please check your username and config file.")
+                sys.exit(1)
+            else:
+                logger.error(f"Proxmox API error: {api_error}")
+                sys.exit(1)

        # Get all available permissions of the current user/token
        for path, permission in permissions.items():
@@ -378,7 +399,7 @@ class ProxmoxApi:
        self.validate_config(proxlb_config)

        # Get a valid Proxmox API endpoint
-        proxmox_api_endpoint = self.api_connect_get_hosts(proxlb_config, proxlb_config.get("proxmox_api", {}).get("hosts", []))
+        proxmox_api_endpoint, proxmox_api_port = self.api_connect_get_hosts(proxlb_config, proxlb_config.get("proxmox_api", {}).get("hosts", []))

        # Disable warnings for SSL certificate validation
        if not proxlb_config.get("proxmox_api").get("ssl_verification", True):
@@ -392,6 +413,7 @@ class ProxmoxApi:
            if proxlb_config.get("proxmox_api").get("token_secret", False):
                proxmox_api = proxmoxer.ProxmoxAPI(
                    proxmox_api_endpoint,
+                    port=proxmox_api_port,
                    user=proxlb_config.get("proxmox_api").get("user", True),
                    token_name=proxlb_config.get("proxmox_api").get("token_id", True),
                    token_value=proxlb_config.get("proxmox_api").get("token_secret", True),
@@ -401,6 +423,7 @@ class ProxmoxApi:
            else:
                proxmox_api = proxmoxer.ProxmoxAPI(
                    proxmox_api_endpoint,
+                    port=proxmox_api_port,
                    user=proxlb_config.get("proxmox_api").get("user", True),
                    password=proxlb_config.get("proxmox_api").get("pass", True),
                    verify_ssl=proxlb_config.get("proxmox_api").get("ssl_verification", True),
@@ -420,6 +443,5 @@ class ProxmoxApi:
            sys.exit(2)

        logger.info(f"API connection to host {proxmox_api_endpoint} succeeded.")
-
        logger.debug("Finished: api_connect.")
        return proxmox_api
--- a/proxlb/utils/version.py
+++ b/proxlb/utils/version.py
@@ -1,7 +1,7 @@
 __app_name__ = "ProxLB"
-__app_desc__ = "A DRS alike loadbalancer for Proxmox clusters."
+__app_desc__ = "An advanced resource scheduler and load balancer for Proxmox clusters."
 __author__ = "Florian Paul Azim Hoberg <gyptazy>"
 __copyright__ = "Copyright (C) 2025 Florian Paul Azim Hoberg (@gyptazy)"
 __license__ = "GPL-3.0"
-__version__ = "1.1.2b"
+__version__ = "1.1.9.1"
 __url__ = "https://github.com/gyptazy/ProxLB"
--- a/service/proxlb.service
+++ b/service/proxlb.service
@@ -6,6 +6,8 @@ Wants=network-online.target pveproxy.service
 [Service]
 ExecStart=python3 /usr/lib/python3/dist-packages/proxlb/main.py -c /etc/proxlb/proxlb.yaml
 User=plb
+ExecReload=/bin/kill -HUP $MAINPID
+KillMode=process

 [Install]
 WantedBy=multi-user.target
--- a/setup.py
+++ b/setup.py
@@ -2,9 +2,9 @@ from setuptools import setup

 setup(
    name="proxlb",
-    version="1.1.2b",
-    description="A DRS alike loadbalancer for Proxmox clusters.",
-    long_description="An advanced DRS alike loadbalancer for Proxmox clusters that also supports maintenance modes and affinity/anti-affinity rules.",
+    version="1.1.9.1",
+    description="An advanced resource scheduler and load balancer for Proxmox clusters.",
+    long_description="An advanced resource scheduler and load balancer for Proxmox clusters that also supports maintenance modes and affinity/anti-affinity rules.",
    author="Florian Paul Azim Hoberg",
    author_email="gyptazy@gyptazy.com",
    maintainer="Florian Paul Azim Hoberg",
Author	SHA1	Message	Date
gyptazy	15436c431f	Merge pull request #355 from gyptazy/release/1.1.9.1 release: Create hotfix release 1.1.9.1	2025-10-30 17:53:14 +01:00
gyptazy	33f6ff8db0	release: Create hotfix release 1.1.9.1 Fixes: #352	2025-10-30 17:46:54 +01:00
gyptazy	84628f232e	Merge pull request #353 from gyptazy/fix/352-syntax-fix fix: Adjust quoting of f-strings	2025-10-30 17:39:42 +01:00
gyptazy	6a91afd405	fix: Adjust quoting of f-strings Fixes: #352	2025-10-30 17:18:17 +01:00
gyptazy	909643a09f	Merge pull request #351 from gyptazy/release/1.1.9final release: Create final 1.1.9 Release	2025-10-30 07:55:45 +01:00
Florian Paul Azim Hoberg	7de1ba366b	release: Create final 1.1.9 Release Fixes: #350 Sponsored-by: credativ GmbH <https://credativ.de>	2025-10-30 07:50:25 +01:00
gyptazy	0cb19fab34	Merge pull request #344 from gyptazy/release/1.1.9b Release/1.1.9b	2025-10-27 16:47:48 +01:00
gyptazy	972b10b7e5	Merge pull request #349 from gyptazy/fix/343-config-validation fix: Make pool based configuration more robust	2025-10-27 14:53:35 +01:00
Florian Paul Azim Hoberg	7fa110e465	fix: Make pool based configuration more robust Fixes: #343	2025-10-27 14:49:40 +01:00
gyptazy	948df0316b	Merge pull request #347 from gyptazy/feature/343-affinity-rules-by-pools feature(Pools): Add affinity/anti-affinity support by pools	2025-10-27 09:50:50 +01:00
gyptazy	016378e37c	feature(Pools): Add affinity/anti-affinity support by pools Fixes: #343	2025-10-27 09:37:12 +01:00
gyptazy	8a193b9891	Merge pull request #346 from gyptazy/feature/memory-balancing-threshold feature(balancing): Add an optional threshold in percent for balancing	2025-10-23 12:13:49 +02:00
gyptazy	30e3b66be9	feature(balancing): Add an optional threshold in percent for balancing Fixes: #342	2025-10-23 12:10:01 +02:00
gyptazy	b9be405194	Merge pull request #345 from gyptazy/fix/version release: Set release version 1.1.9	2025-10-23 11:47:12 +02:00
gyptazy	ac108f2abe	release: Set release version 1.1.9	2025-10-23 11:45:59 +02:00
gyptazy	02b43d3ef7	Merge pull request #339 from gyptazy/feature/337-add-pressure-based-balancing feature(balancing): Add pressure (memory, cpu, disk) based balancing	2025-10-23 11:41:04 +02:00
Florian Paul Azim Hoberg	581d6d480b	feature(balancing): Add pressure (memory, cpu, disk) based balancing Fixes: #337	2025-10-23 11:32:25 +02:00
gyptazy	5b395b7f15	Merge pull request #334 from gyptazy/docs/changelog-docker-nonroot docs: Add missing changelog fragment for container changes	2025-10-09 09:26:30 +02:00
gyptazy	7d94c52883	docs: Add missing changelog fragment for container changes Fixes: #317	2025-10-09 09:25:07 +02:00
gyptazy	7d19788be1	Merge pull request #333 from gyptazy/release/prepare-1.1.8 release: Prepare release 1.1.8	2025-10-09 09:13:17 +02:00
gyptazy	0bbc5992ca	release: Prepare release 1.1.8 Fixes: #332	2025-10-09 09:05:17 +02:00
gyptazy	a4a5d9e68a	Merge pull request #331 from gyptazy/fix/329-add-proxlb-prefix-log fix(logging): Add a static ProxLB prefix to the log output when used by journal handler	2025-10-08 18:43:11 +02:00
gyptazy	af98ee8d5b	fix(logging): Add a static ProxLB prefix to the log output when used by journal handler Fixes: #329	2025-10-08 18:39:02 +02:00
gyptazy	afc93f7b21	docs: Fix typo in readme	2025-09-26 12:14:48 +02:00
gyptazy	bc6d8c8509	Merge pull request #325 from gyptazy/doc/324-adjust-readme-docs-chapter docs: Adjust the order and presence of the documentation in the readme.	2025-09-26 12:13:17 +02:00
gyptazy	6d50f32486	docs: Adjust the order and presence of the documentation in the readme. Fixes: #324	2025-09-26 12:12:28 +02:00
gyptazy	5fe49a9dc1	Merge pull request #323 from gyptazy/docs/fix-img-links docs: Fix broken images	2025-09-25 16:44:55 +02:00
gyptazy	fca1d1211c	docs: Fix broken images	2025-09-25 16:43:52 +02:00
no ack hosting ab	36388d9429	Secure Dockerfile and application (#317 ) fix: Run ProxLB in container as plb user instead of root * Run ProxLB as plb user * Run within a venv	2025-09-25 14:41:47 +02:00
gyptazy	3f424e9e6d	Merge pull request #320 from gyptazy/docs/enterprise-support docs: Add enterprise support list	2025-09-25 10:37:42 +02:00
Florian Paul Azim Hoberg	44a733aed3	docs: Add enterprise support list	2025-09-25 10:35:21 +02:00
gyptazy	2f44ff48a0	Merge pull request #319 from gyptazy/fix/318-conntrack-state-migrations-pve8 fix(balancing): Fix API errors when using conntrack aware migration with older PVE versions	2025-09-25 08:40:57 +02:00
Florian Paul Azim Hoberg	7b6db9cfdd	fix(balancing): Fix API errors when using conntrack aware migration with older PVE versions. Fixes: #318	2025-09-25 06:59:54 +02:00
gyptazy	8c473b416c	docs: Adjust description	2025-09-23 20:34:04 +02:00
gyptazy	51c8afe5c5	docs: Update description	2025-09-23 20:30:20 +02:00
gyptazy	a8a154abde	Merge pull request #316 from gyptazy/fix/missing-1.1.7-version release: Replace missing 1.1.7 version	2025-09-19 10:12:53 +02:00
Florian Paul Azim Hoberg	554a3eaf72	release: Replace missing 1.1.7 version	2025-09-19 10:11:54 +02:00
gyptazy	0b35987403	Merge pull request #315 from gyptazy/release/create-release-1.7 release: Add release 1.1.7 information	2025-09-19 10:08:42 +02:00
Florian Paul Azim Hoberg	d93048db69	release: Add release 1.1.7 information Fixes: #314	2025-09-19 07:54:01 +02:00
gyptazy	2aba7dbe23	Merge pull request #313 from gyptazy/adjust-style fix: Add missing new lines	2025-09-19 07:41:30 +02:00
Florian Paul Azim Hoberg	ba388dfd7c	fix: Add missing new lines	2025-09-19 07:39:42 +02:00
gyptazy	5aa8257d40	Merge pull request #311 from gyptazy/fix/310-guest-object-names-not-being-evaluated-in-log fix: Fix guest object names are not being evaluated in debug log.	2025-09-15 09:53:26 +02:00
Florian Paul Azim Hoberg	99fefe20bf	fix: Fix guest object names are not being evaluated in debug log. Fixes: #310	2025-09-15 09:48:59 +02:00
gyptazy	b9fb3a60e1	Merge pull request #309 from gyptazy/fix/308-fix-validating-migration-job-ids fix: Fix crash when validating absent migration job ids	2025-09-15 09:29:00 +02:00
Florian Paul Azim Hoberg	88b3288eb7	fix: Fix crash when validating absent migration job ids Fixes: #308	2025-09-15 09:26:01 +02:00
gyptazy	fa0113f112	Merge pull request #307 from gyptazy/feature/304-catch-keyboard-interrupt Add graceful shutdown for SIGINT (e.g., CTRL + C abort).	2025-09-15 07:30:30 +02:00
Florian Paul Azim Hoberg	0039ae9093	Add graceful shutdown for SIGINT (e.g., CTRL + C abort). Fixes: #304	2025-09-15 07:26:03 +02:00
gyptazy	e3bbf31fdd	Merge pull request #306 from gyptazy/feature/305-add-conntrack-options-for-migrations feature: Add conntrack state aware migrations of VMs.	2025-09-15 07:10:30 +02:00
Florian Paul Azim Hoberg	bf393c6bbf	feature: Add conntrack state aware migrations of VMs. Fixes: #305	2025-09-15 06:59:41 +02:00
gyptazy	7e5b72cfc7	docs: Adjust container version to 1.1.6.1	2025-09-04 17:39:43 +02:00
gyptazy	0ba76f80f3	Merge pull request #302 from gyptazy/fix/296-vm fix: Validate for node presence when pinning VMs to avoid crashing	2025-09-04 17:33:47 +02:00
gyptazy	b48ff9d677	fix: Validate for node presence when pinning VMs to avoid crashing Fixes: #296	2025-09-04 17:31:24 +02:00
gyptazy	b5c11af474	Merge pull request #301 from gyptazy/gyptazy-patch-1 Set image version to 1.1.6	2025-09-04 08:34:49 +02:00
gyptazy	af2992747d	Set image version to 1.1.6	2025-09-04 08:34:23 +02:00
gyptazy	fb8dc40c16	Merge pull request #300 from gyptazy/release/1.1.6 release: Create release 1.1.6	2025-09-04 07:55:38 +02:00
Florian Paul Azim Hoberg	34f1de8367	release: Create release 1.1.6 Fixes: #297	2025-09-04 07:53:29 +02:00
gyptazy	0e992e99de	Merge pull request #298 from gyptazy/fix/296-validate-node-presence-when-pinning fix: Validate for node presence when pinning guests to avoid crashing	2025-08-29 11:33:29 +02:00
gyptazy	f5d073dc02	fix: Validate for node presence when pinning guests to avoid crashing Fixes: #296 Fixes: #271	2025-08-29 11:26:15 +02:00
gyptazy	70ba1f2dfc	Merge pull request #295 from MiBUl-eu/main fix: overprovisioning first node if anti_affinity group has only one member	2025-08-28 17:16:28 +02:00
MiBUl-eu	c9855f1991	Added changeloge and fixed typo	2025-08-26 10:34:48 +00:00
MiBUl-eu	9bd29158b9	fix: overprovisioning first node if anti_affinity group has only one member	2025-08-25 10:11:45 +02:00
gyptazy	1ff0c5d96e	Merge pull request #293 from gyptazy/feature/290-validate-token-input-from-config-file feature: Add validation for provided API user token id to avoid confusions	2025-08-25 08:11:56 +02:00
Florian Paul Azim Hoberg	3eb4038723	feature: Add validation for provided API user token id to avoid confusions Fixes: #290	2025-08-25 08:07:44 +02:00
gyptazy	47e7dd3c56	Merge pull request #292 from gyptazy/fix/291-catch-stack-trace-when-user-account-is-not-given-or-wrong fix(proxmox-api): Fix stacktrace output when validating permissions on non existing users in Proxmox	2025-08-25 07:58:32 +02:00
Florian Paul Azim Hoberg	bb8cf9033d	fix(proxmox-api): Fix stacktrace output when validating permissions on non existing users in Proxmox Fixes: #291	2025-08-25 07:55:02 +02:00
gyptazy	756b4efcbd	Merge pull request #288 from gyptazy/feature/281-helm-chart-versioning feature: Add Helm chart support for ProxLB	2025-08-19 06:28:31 +02:00
gyptazy	8630333e4b	feature: Add Helm chart support for ProxLB Fixes: #281	2025-08-19 06:27:38 +02:00
gyptazy	7bd9a9b038	Merge pull request #282 from MaoMaoCake/main Feat: Add support for helm	2025-08-18 19:31:14 +02:00
maomaocake	16651351de	Implemented changes from comments	2025-08-18 12:46:29 +07:00
maomaocake	63805f1f50	Added support for resource limits and requests	2025-08-17 16:11:18 +07:00
maomaocake	c0ff1b5273	Added support for new user defined labels	2025-08-17 15:59:17 +07:00
maomaocake	07f8596fc5	Helm Chart	2025-08-17 15:41:55 +07:00
Florian	affbe433f9	Merge pull request #280 from gyptazy/docs/fix-ipv6-example-syntax docs: Fix IPv6 address syntax in given examples	2025-08-06 16:42:13 +02:00
Florian Paul Azim Hoberg	7bda22e754	docs: Fix IPv6 address syntax in given examples	2025-08-06 16:38:23 +02:00
Florian	253dcf8eb9	Merge pull request #274 from gyptazy/docs/273-proxmox-9-compatibility docs: Add compatibility matrix of tested Proxmox versions	2025-07-20 12:37:20 +02:00
gyptazy	6212d23268	docs: Add compatibility matrix of tested Proxmox versions Fixes: #273	2025-07-20 12:36:25 +02:00
Florian	cf8c06393f	Merge pull request #270 from gyptazy/docs/258-extend-doc-of-cluster-node-names-usage docs: Adjust docs regarding maintenance node hostnames	2025-07-17 11:53:33 +02:00
Florian Paul Azim Hoberg	5c23fd3433	docs: Adjust docs regarding maintenance node hostnames Fixes: #258	2025-07-17 11:52:08 +02:00
Florian	0fb732fc8c	Merge pull request #269 from gyptazy/fix/268-evaluate-balancing-types fix: Fix balancing evaluation of guest types (e.g., VM or CT).	2025-07-17 11:44:29 +02:00
Florian Paul Azim Hoberg	f36d96c72a	fix: Fix balancing evaluation of guest types (e.g., VM or CT). Fixes: #268	2025-07-17 11:41:00 +02:00
Florian	9cc03717ef	Merge pull request #267 from gyptazy/release/1.1.5 release: Create release 1.1.5	2025-07-14 11:13:12 +02:00
Florian Paul Azim Hoberg	4848887ccc	release: Create release 1.1.5 Fixes: #266	2025-07-14 11:08:38 +02:00
Gombócz Márton	04476feeaf	Docs/custom api port (#264 ) * docs(README.md): added description and examples of port declaration in hosts * docs(docs/03_configuration): updated hosts description about using ports	2025-07-10 10:33:44 +02:00
Florian	b3765bf0ae	Merge pull request #261 from gyptazy/feature/260-custom-api-ports feature: Allow custom (instead of static tcp/8006) API ports for API hosts	2025-07-10 09:19:34 +02:00
Florian Paul Azim Hoberg	806b728a14	feature: Allow custom (instead of static tcp/8006) API ports for API hosts. Fixes: #260	2025-07-08 17:39:29 +02:00
Florian	2c34ec91b1	Merge pull request #257 from gyptazy/release/prepare-1.1.5 release: Create release 1.1.5 beta 1	2025-06-29 11:08:15 +02:00
gyptazy	08b746a53b	release: Create release 1.1.5 beta 1	2025-06-27 16:50:17 +02:00
Florian	615e2f5608	Merge pull request #256 from gyptazy/release/1.1.4 release: Create release 1.1.4	2025-06-27 16:40:53 +02:00
gyptazy	fa1e1ad8a3	release: Create release 1.1.4 Fixes: #254	2025-06-27 16:23:31 +02:00
pmarasse	c78def3919	Fix loglevels (#255 ) * Modified some loglevels to make output lighter at INFO level Co-authored-by: Philippe MARASSE <philippe@marasse.fr>	2025-06-27 15:10:57 +02:00
Florian	54c53b9860	Merge pull request #253 from gyptazy/feature/245-add-guest-pinning-to-group-of-nodes feature: Allow pinning of guests to a group of nodes	2025-06-26 13:59:44 +02:00
Florian Paul Azim Hoberg	1fe8f703cc	feature: Allow pinning of guests to a group of nodes * You can now simply define multiple tags with plb_pin_node names where nodes are being evaluated and the one with the lowest resource usage will be taken. Fixes: #245	2025-06-26 13:54:05 +02:00
Florian	7ba806abf7	Merge pull request #252 from gyptazy/fix/248-dry-run-with-deactivated-balancing Fix an issue where balancing was performed in combination of deactivated balancing and dry-run mode	2025-06-24 10:09:10 +02:00
Florian Paul Azim Hoberg	6b2e120739	Fix: Fixed an issue where balancing was performed in combination of deactivated balancing and dry-run mode Fixes: #248	2025-06-24 10:06:28 +02:00
Florian	e4103df326	Merge pull request #251 from gyptazy/prepare/1.1.4 release: Prepare release 1.1.4 beta	2025-06-24 10:00:27 +02:00
Florian Paul Azim Hoberg	f2acd4efa6	release: Prepare release 1.1.4 beta	2025-06-24 09:56:06 +02:00
Florian	f4ed8d9928	Merge pull request #247 from gyptazy/fix/readme-1.1.3 fix: Adjust readme for Container image version of release 1.1.3.	2025-06-19 09:32:13 +02:00
gyptazy	ba74254b93	fix: Adjust readme for Container image version of release 1.1.3.	2025-06-19 09:31:31 +02:00
Florian	792a0f3820	Merge pull request #246 from gyptazy/release/1.1.3 release: Prepare release 1.1.3	2025-06-19 09:30:31 +02:00
gyptazy	b766041c4c	release: Prepare release 1.1.3 Fixes: #242	2025-06-19 09:25:56 +02:00
Florian	a31e41f839	Merge pull request #243 from gyptazy/feature/241-make-amount-of-parallel-migrations-configureable feature: Make the amount of parallel migrations configurable	2025-06-10 18:46:39 +02:00
gyptazy	7cb5a31b89	feature: Make the amount of parallel migrations configurable Fixes: #241	2025-06-05 16:12:47 +02:00
Florian	617d0a3ae3	Merge pull request #240 from gyptazy/feature/239-add-optional-wait-time-until-service-starts feature: Add optional wait time before service action.	2025-06-04 16:49:24 +02:00
gyptazy	db3a3b77fc	feature: Add optional wait time before service action. Fixes: #239	2025-06-01 16:06:42 +02:00
Florian	5a9643275a	Merge pull request #237 from gyptazy/feature/94-balance-cpu-by-average-consumption feature: Use the average CPU consumption of a guest within the last 60 minutes instead of the current CPU usage	2025-05-29 12:01:13 +02:00
Florian	60d1e333aa	Merge pull request #238 from gyptazy/feature/189-add-reload-function feature: Add relaod (SIGHUP) function to ProxLB to reload the configuration.	2025-05-29 12:00:42 +02:00
gyptazy	96dc435cf6	feature: Add relaod (SIGHUP) function to ProxLB to reload the configuration. Fixes: #189	2025-05-24 09:56:20 +02:00
gyptazy	263b08b53a	feature: Add reload method to ProxLB systemd file Fixes: #189	2025-05-24 09:19:42 +02:00
gyptazy	89102d517e	feature: Use the average CPU consumption of a guest within the last 60 minutes instead of the current CPU usage - Using the current CPU consumption of a guest object is too volatile and does not represent the real usage. Therefore, we use the average consumption of the cpu values within the last 60 minutes. Thanks-to: @philslab-ninja Fixes: #94	2025-05-24 09:17:14 +02:00
Florian	845af4abc8	Merge pull request #236 from gyptazy/prepare/dev1.1.3beta development: Adjust beta release 1.1.3	2025-05-22 13:42:05 +02:00
gyptazy	3e02403598	development: Adjust beta release 1.1.3	2025-05-22 06:58:53 +02:00
Florian	0b0d569877	Merge pull request #235 from gyptazy/feature/232-align-proxmox-ha-maintenance-mode feature: Align maintenance mode with Proxmox HA maintenance mode	2025-05-22 06:55:32 +02:00
Florian Paul Azim Hoberg	1cbda2e2f9	feature: Align maintenance mode with Proxmox HA maintenance mode Fixes: #232	2025-05-21 18:19:50 +02:00
gyptazy	b6febf1933	feature: Add action to create multiarch container	2025-05-20 19:57:04 +02:00
Florian	53a6d2a459	Merge pull request #233 from gyptazy/feature/231-arm64-container feature: Add workflows to build container images for AMD64 + ARM64 architecture	2025-05-20 12:11:03 +02:00
Florian Paul Azim Hoberg	6c82ce010b	feature: Add workflows to build container images for AMD64 + ARM64 architecture Fixes: #231	2025-05-20 12:06:22 +02:00
Florian	4b8b73e468	Merge pull request #228 from gyptazy/release/1.1.2-readme docs: Update readme with new image version	2025-05-13 08:26:48 +02:00
Florian Paul Azim Hoberg	a75729dd6a	docs: Update readme with new image version	2025-05-13 08:26:03 +02:00
Florian	b8792a87af	Merge pull request #227 from gyptazy/release/1.1.2 release: Create release 1.1.2	2025-05-13 08:18:23 +02:00
Florian Paul Azim Hoberg	c1261a2d3c	release: Create release 1.1.2 Fixes: #226	2025-05-13 08:13:43 +02:00
Florian	0035f57738	Merge pull request #223 from gyptazy/fix/222-extend-debug-messages fix: Force type cast guest cpu count to int where in some corner cases a str got returned.	2025-05-08 16:23:10 +02:00
gyptazy	b372d361e7	fix: Force type cast guest cpu count to int where in some corner cases a str got returned. Fixes: #222	2025-05-03 08:53:56 +02:00