https://tracker.ceph.com/https://tracker.ceph.com/favicon.ico2020-01-20T22:11:17ZCeph mgr - Bug #43551: Trying to enable the CEPH Telegraf module errors 'No such file or directory'https://tracker.ceph.com/issues/43551?journal_id=1563482020-01-20T22:11:17ZGreg Farnumgfarnum@redhat.com
<ul><li><strong>Project</strong> changed from <i>Ceph</i> to <i>mgr</i></li></ul> mgr - Bug #43551: Trying to enable the CEPH Telegraf module errors 'No such file or directory'https://tracker.ceph.com/issues/43551?journal_id=1566032020-01-22T22:20:09ZJosh Durgin
<ul><li><strong>Assignee</strong> set to <i>Wido den Hollander</i></li></ul><p>Wido can you take a look at this one?</p> mgr - Bug #43551: Trying to enable the CEPH Telegraf module errors 'No such file or directory'https://tracker.ceph.com/issues/43551?journal_id=1566842020-01-23T10:35:52ZWido den Hollanderwido@42on.com
<ul></ul><p>Yes, so I saw this recently as well.</p>
<p>The telegraf module by default looks at a local unix socket: unixgram:///tmp/telegraf.sock</p>
<p>It thows an 'error', but not really an exception which I tried to catch when I recently found this.</p>
<pre>
except (socket.error, RuntimeError, IOError, OSError):
self.log.exception('Failed to send statistics to Telegraf:')
</pre>
<p>This except block doesn't seem to catch it.</p>
<p>Any suggestions on what we need to catch to overcome this?</p> mgr - Bug #43551: Trying to enable the CEPH Telegraf module errors 'No such file or directory'https://tracker.ceph.com/issues/43551?journal_id=1612872020-03-19T01:50:21ZSage Weilsage@newdream.net
<ul><li><strong>Priority</strong> changed from <i>Normal</i> to <i>High</i></li></ul><pre>
2020-03-18T17:03:03.739 INFO:tasks.ceph.mgr.x.smithi182.stderr:2020-03-18T17:03:03.737+0000 7f892f71a700 -1 log_channel(cluster) log [ERR] : Unhandled exception from module 'telegraf' while running on mgr.x: [Errno 2] No such file or directory
2020-03-18T17:03:03.740 INFO:tasks.ceph.mgr.x.smithi182.stderr:2020-03-18T17:03:03.737+0000 7f892f71a700 -1 telegraf.serve:
2020-03-18T17:03:03.740 INFO:tasks.ceph.mgr.x.smithi182.stderr:2020-03-18T17:03:03.737+0000 7f892f71a700 -1 Traceback (most recent call last):
2020-03-18T17:03:03.740 INFO:tasks.ceph.mgr.x.smithi182.stderr: File "/usr/share/ceph/mgr/telegraf/module.py", line 295, in serve
2020-03-18T17:03:03.740 INFO:tasks.ceph.mgr.x.smithi182.stderr: self.send_to_telegraf()
2020-03-18T17:03:03.741 INFO:tasks.ceph.mgr.x.smithi182.stderr: File "/usr/share/ceph/mgr/telegraf/module.py", line 243, in send_to_telegraf
2020-03-18T17:03:03.741 INFO:tasks.ceph.mgr.x.smithi182.stderr: with sock as s:
2020-03-18T17:03:03.741 INFO:tasks.ceph.mgr.x.smithi182.stderr: File "/usr/share/ceph/mgr/telegraf/basesocket.py", line 41, in __enter__
2020-03-18T17:03:03.741 INFO:tasks.ceph.mgr.x.smithi182.stderr: self.connect()
2020-03-18T17:03:03.742 INFO:tasks.ceph.mgr.x.smithi182.stderr: File "/usr/share/ceph/mgr/telegraf/basesocket.py", line 29, in connect
2020-03-18T17:03:03.742 INFO:tasks.ceph.mgr.x.smithi182.stderr: return self.sock.connect(self.address)
2020-03-18T17:03:03.743 INFO:tasks.ceph.mgr.x.smithi182.stderr:FileNotFoundError: [Errno 2] No such file or directory
</pre><br />/a/sage-2020-03-18_14:59:42-rados-wip-sage-testing-2020-03-18-0826-distro-basic-smithi/4866150<br />description: rados/mgr/{clusters/{2-node-mgr.yaml} debug/mgr.yaml objectstore/bluestore-comp-zlib.yaml<br /> supported-random-distro$/{rhel_8.yaml} tasks/module_selftest.yaml} mgr - Bug #43551: Trying to enable the CEPH Telegraf module errors 'No such file or directory'https://tracker.ceph.com/issues/43551?journal_id=1613012020-03-19T09:29:17ZWido den Hollanderwido@42on.com
<ul></ul><p>Hope this fixes it: <a class="external" href="https://github.com/ceph/ceph/pull/34052">https://github.com/ceph/ceph/pull/34052</a></p> mgr - Bug #43551: Trying to enable the CEPH Telegraf module errors 'No such file or directory'https://tracker.ceph.com/issues/43551?journal_id=1630732020-04-12T11:35:05ZKefu Chaitchaikov@gmail.com
<ul><li><strong>Status</strong> changed from <i>New</i> to <i>Pending Backport</i></li><li><strong>Assignee</strong> changed from <i>Wido den Hollander</i> to <i>Kefu Chai</i></li><li><strong>Backport</strong> set to <i>nautilus, octopus</i></li><li><strong>Pull request ID</strong> set to <i>34468</i></li></ul> mgr - Bug #43551: Trying to enable the CEPH Telegraf module errors 'No such file or directory'https://tracker.ceph.com/issues/43551?journal_id=1630932020-04-13T10:27:20ZNathan Cutlerncutler@suse.cz
<ul><li><strong>Copied to</strong> <i><a class="issue tracker-9 status-3 priority-4 priority-default closed" href="/issues/45069">Backport #45069</a>: octopus: Trying to enable the CEPH Telegraf module errors 'No such file or directory'</i> added</li></ul> mgr - Bug #43551: Trying to enable the CEPH Telegraf module errors 'No such file or directory'https://tracker.ceph.com/issues/43551?journal_id=1630952020-04-13T10:27:28ZNathan Cutlerncutler@suse.cz
<ul><li><strong>Copied to</strong> <i><a class="issue tracker-9 status-3 priority-4 priority-default closed" href="/issues/45070">Backport #45070</a>: nautilus: Trying to enable the CEPH Telegraf module errors 'No such file or directory'</i> added</li></ul> mgr - Bug #43551: Trying to enable the CEPH Telegraf module errors 'No such file or directory'https://tracker.ceph.com/issues/43551?journal_id=1655142020-05-11T14:28:27ZNathan Cutlerncutler@suse.cz
<ul><li><strong>Status</strong> changed from <i>Pending Backport</i> to <i>Resolved</i></li></ul><p>While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are in status "Resolved" or "Rejected".</p> mgr - Bug #43551: Trying to enable the CEPH Telegraf module errors 'No such file or directory'https://tracker.ceph.com/issues/43551?journal_id=1781352020-10-28T10:52:23ZStefan Koomanceph@kooman.org
<ul></ul><p>The fix does nothing to prevent this from happening, it only handles the exception. We need socket support. The underlying issue is an updated systemd services file for ceph-manager where private tmp is enabled (PrivateTmp=true) that comes with Nautilus (it works in Mimic and Luminous). The ceph manager does not find the /tmp/telegraf.sock in it's own namespace. A fix might be to change the default location for the socket file to /var/telegraf.sock.</p>