Skip to content

cpu_percent_% still higher than 100% in Alumet v0.9.3 #403

@thealanjason

Description

@thealanjason

Describe the bug
We observe cpu_percentage still higher than 100% even though PR #341 says that it was fixed in v0.9.3.

The hardware on which we ran the tests has 96 CPU Cores.

Refer the following two issues where we the bug surfaces:

Are we missing something? Do we need to use the cgroups plugin?

Environment information
OS: Ubuntu Linux 24.04LTS
Alumet version²: v0.9.3.
Enabled plugins:

    🧩 5 plugins started:
        - csv v0.2.0
        - rapl v0.3.1
        - procfs v0.2.1
        - nvml v0.5.0
        - energy-attribution v0.1.1

To Reproduce
Steps to reproduce the behavior:
Use the config here: https://github.com/thealanjason/energy_measurement/blob/main/measurement_tools/alumet/experiments/08_cpu_percentage_test/alumet-config-cpu-percent-test.toml

to exec the program here: https://github.com/thealanjason/energy_measurement/blob/86dcdd60bf89a0996294a7200fd82230c2658b3a/case_studies/gemm/gemm.py

The program would need a conda/micromamba to be created with the following environment.yml file

name: test-case-gemm
channels:
  - conda-forge
dependencies:
  - numpy

Expected behavior
cpu_percent less than 100% when the program uses all 96 cores.

Logs / Output

Metadata

Metadata

Assignees

No one assigned

    Labels

    T:bugtype: Something isn't working
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions