Activity
From 05/19/2019 to 06/17/2019
06/17/2019
- 01:30 PM Bug #40339: kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
- 01:08 PM Bug #40339: kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
- fixed by "ceph: remove request from waiting list before unregister" in testing branch
06/16/2019
- 01:51 PM Bug #40340: kernel client stuck at opening forever after network outage.
- hmm, re-read the code seems we will backoff and retry every 5mins....not sure if I read it correctly and how it hit i...
06/14/2019
- 01:35 PM Bug #40362: amending caps with "ceph auth caps" vs using an existing client
- Note that these irregular I/O errors can occur today (i.e. with no invalidation in the kernel client) if enough time ...
- 12:36 PM Bug #40362: amending caps with "ceph auth caps" vs using an existing client
- If we simply invalidate, things become super confusing. If the OSD that pool2/img header resides on is open, "rbd ma...
- 12:14 PM Bug #40362 (New): amending caps with "ceph auth caps" vs using an existing client
- ...
- 07:40 AM Bug #40339: kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
- request was unregistered twice. one is from wait_requests, another is from __wake_requests
- 03:35 AM Bug #40339: kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
- Thanks zheng, can you explain more backgorund?
- 12:23 AM Bug #40339: kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
- maybe this one...
- 02:49 AM Bug #39523: small write in ab mode does not change mtime
- can't reproduce this issue on 5.0.16 kernel + ceph v12.2.12
- 02:34 AM Bug #37713 (Resolved): Centos 7 kernel client overwriting files
- Fixed in 7.5 (kernel-3.10.0-862.33.1.el7) and 7.6 (kernel-3.10.0-957.16.1.el7).
- 02:08 AM Bug #38824 (Resolved): kclient: ceph: remove duplicated filelock ref increase
- 02:05 AM Bug #40340: kernel client stuck at opening forever after network outage.
- maybe calling peer_reset after tcp timeout is a better fix
- 01:12 AM Bug #40340: kernel client stuck at opening forever after network outage.
- 140 06/13/2019 09:21:22 AM umount /import/ceph/
141 06/13/2019 09:23:05 AM mount -a
The time of umount - 01:09 AM Bug #40340: kernel client stuck at opening forever after network outage.
- Negative.
The peer_reset get called during eviction but not after connection establish timeout.
2264570:Jun 13 ... - 12:01 AM Bug #40340: kernel client stuck at opening forever after network outage.
- does peer_reset() get called after tcp timeout
06/13/2019
- 05:08 PM Bug #40340: kernel client stuck at opening forever after network outage.
- Step to reproduce
1. [MDS]ceph --daemon /path-to-admin-socket/ session evict <client_id>
2. [Client] After 2M... - 05:02 PM Bug #40340 (Can't reproduce): kernel client stuck at opening forever after network outage.
- We dont retry(in ceph code) for sending create_session_open_msg. Thus once a network outage happen and mds evict the ...
- 04:48 PM Bug #40339 (Resolved): kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
- It happens on 3.10.0-957.1.3.el7.x86_64
Back to that time,
1. there is a mds session stucking in "opening" due ...
06/10/2019
- 07:08 PM Bug #38482 (Resolved): Quotas: mounting quotas subdir doesn't respect quotas in kernel client
- Merged into 5.2-rc1.
- 07:02 PM Bug #39945: RBD I/O error leads to ghost-mapped RBD
- > On the system that had the RBDs mapped, the RBDs don't appear in "rbd showmapped" and are not inside "/sys/bus/rbd/...
- 06:18 PM Bug #39010 (Resolved): Hang with read-only RBD
- 06:17 PM Bug #39010: Hang with read-only RBD
- This assert has been removed in 5.2:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id... - 04:22 PM Feature #39967: kclient: recover from a killed session (w/ blacklist)
- Dan van der Ster wrote:
> Isn't client_reconnect_stale a fuse client option? What is the relation to kclient?
The... - 11:35 AM Bug #40190: oops or soft lockup after MDS crash
- Fixed by commit "ceph: don't blindly unregister session that is in opening state" in https://github.com/ceph/ceph-cli...
- 03:47 AM Bug #40190: oops or soft lockup after MDS crash
- yes. it's related. mds crash while importing caps reveals a kernel bug...
- 02:48 AM Bug #40190: oops or soft lockup after MDS crash
- looks like cap->session is wild pointer
06/08/2019
- 06:03 PM Bug #40190: oops or soft lockup after MDS crash
- Probably this bug: https://bugzilla.redhat.com/show_bug.cgi?id=1710556
Kernel patches here:
https://github.com/...
06/07/2019
- 01:44 PM Bug #39258: add statx support to kcephfs
- The patches are merged in the upstream kernel, but I still don't have any tests yet. We do have some testcases for li...
- 09:30 AM Feature #39967: kclient: recover from a killed session (w/ blacklist)
- Isn't client_reconnect_stale a fuse client option? What is the relation to kclient?
06/06/2019
- 02:52 PM Bug #40190 (Resolved): oops or soft lockup after MDS crash
- Yesterday we had a v12.2.12 mds crash [1] and several kernel clients (el7 kernel 3.10.0-957.12.1.el7.x86_64) either d...
05/31/2019
- 10:45 AM Bug #39649 (Resolved): kernel BUG at fs/ceph/file.c:1476!
- fixed by "ceph: fix error handling in ceph_get_caps()"
05/19/2019
- 08:30 AM Feature #39967 (Resolved): kclient: recover from a killed session (w/ blacklist)
- One option for a more robust solution is to create a new configuration option that allows the client to acquire a new...
Also available in: Atom