Bug #58957
openPrediction model doesn't work because of error in devicehealth module
0%
Description
Hello,
We have been unable to use the local prediction model for many years. Our first attempt was on the Nautilus version, and the most recent attempt was on Quincy, but both were unsuccessful. There is likely an issue with the device health module, as the self-test reports the following.
Ceph version 16.2.10 (Pacific)ceph mgr self-test module devicehealth
Error EPERM: Test failed: Remote method threw exception: Traceback (most recent call last):
File "/usr/share/ceph/mgr/devicehealth/module.py", line 233, in self_test
assert r 0
AssertionError
ceph mgr self-test module devicehealth
Error EPERM: Test failed: Remote method threw exception: Traceback (most recent call last):
File "/usr/share/ceph/mgr/devicehealth/module.py", line 258, in self_test
assert r 0
AssertionError
Regards,
Michal Strnad
Updated by Michal Strnad about 1 year ago
I am attaching the crash dump from today.
ceph crash info 2023-03-12T10:16:25.303976Z_9666d578-54e3-4f8d-a0fd-a26a6192ddfb { "backtrace": [ " File \"/usr/share/ceph/mgr/devicehealth/module.py\", line 258, in self_test\n assert r == 0", "AssertionError" ], "ceph_version": "17.2.5", "crash_id": "2023-03-12T10:16:25.303976Z_9666d578-54e3-4f8d-a0fd-a26a6192ddfb", "entity_name": "mgr.beX", "mgr_module": "devicehealth", "mgr_module_caller": "ActivePyModule::dispatch_remote self_test", "mgr_python_exception": "AssertionError", "os_id": "centos", "os_name": "CentOS Stream", "os_version": "8", "os_version_id": "8", "process_name": "ceph-mgr", "stack_sig": "b2f207be5a9cb776c859e3c66def93916bb2bdac09df0ea728a15c9af1b50ff8", "timestamp": "2023-03-12T10:16:25.303976Z", "utsname_hostname": "beX.example.com", "utsname_machine": "x86_64", "utsname_release": "4.18.0-448.el8.x86_64", "utsname_sysname": "Linux", "utsname_version": "#1 SMP Wed Jan 18 15:02:46 UTC 2023" }