Project

General

Profile

Activity

From 05/19/2019 to 06/17/2019

06/17/2019

01:30 PM Bug #40339: kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
Zheng Yan
01:08 PM Bug #40339: kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
fixed by "ceph: remove request from waiting list before unregister" in testing branch Zheng Yan

06/16/2019

01:51 PM Bug #40340: kernel client stuck at opening forever after network outage.
hmm, re-read the code seems we will backoff and retry every 5mins....not sure if I read it correctly and how it hit i... Xiaoxi Chen

06/14/2019

01:35 PM Bug #40362: amending caps with "ceph auth caps" vs using an existing client
Note that these irregular I/O errors can occur today (i.e. with no invalidation in the kernel client) if enough time ... Ilya Dryomov
12:36 PM Bug #40362: amending caps with "ceph auth caps" vs using an existing client
If we simply invalidate, things become super confusing. If the OSD that pool2/img header resides on is open, "rbd ma... Ilya Dryomov
12:14 PM Bug #40362 (New): amending caps with "ceph auth caps" vs using an existing client
... Ilya Dryomov
07:40 AM Bug #40339: kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
request was unregistered twice. one is from wait_requests, another is from __wake_requests Zheng Yan
03:35 AM Bug #40339: kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
Thanks zheng, can you explain more backgorund? Xiaoxi Chen
12:23 AM Bug #40339: kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
maybe this one... Zheng Yan
02:49 AM Bug #39523: small write in ab mode does not change mtime
can't reproduce this issue on 5.0.16 kernel + ceph v12.2.12 Zheng Yan
02:34 AM Bug #37713 (Resolved): Centos 7 kernel client overwriting files
Fixed in 7.5 (kernel-3.10.0-862.33.1.el7) and 7.6 (kernel-3.10.0-957.16.1.el7). Zheng Yan
02:08 AM Bug #38824 (Resolved): kclient: ceph: remove duplicated filelock ref increase
Zheng Yan
02:05 AM Bug #40340: kernel client stuck at opening forever after network outage.
maybe calling peer_reset after tcp timeout is a better fix Zheng Yan
01:12 AM Bug #40340: kernel client stuck at opening forever after network outage.
140 06/13/2019 09:21:22 AM umount /import/ceph/
141 06/13/2019 09:23:05 AM mount -a
The time of umount
Xiaoxi Chen
01:09 AM Bug #40340: kernel client stuck at opening forever after network outage.
Negative.
The peer_reset get called during eviction but not after connection establish timeout.
2264570:Jun 13 ...
Xiaoxi Chen
12:01 AM Bug #40340: kernel client stuck at opening forever after network outage.
does peer_reset() get called after tcp timeout Zheng Yan

06/13/2019

05:08 PM Bug #40340: kernel client stuck at opening forever after network outage.
Step to reproduce
1. [MDS]ceph --daemon /path-to-admin-socket/ session evict <client_id>
2. [Client] After 2M...
Xiaoxi Chen
05:02 PM Bug #40340 (Can't reproduce): kernel client stuck at opening forever after network outage.
We dont retry(in ceph code) for sending create_session_open_msg. Thus once a network outage happen and mds evict the ... Xiaoxi Chen
04:48 PM Bug #40339 (Resolved): kernel BUG at fs/ceph/mds_client.c:600! invalid opcode: 0000 [#1] SMP
It happens on 3.10.0-957.1.3.el7.x86_64
Back to that time,
1. there is a mds session stucking in "opening" due ...
Xiaoxi Chen

06/10/2019

07:08 PM Bug #38482 (Resolved): Quotas: mounting quotas subdir doesn't respect quotas in kernel client
Merged into 5.2-rc1. Ilya Dryomov
07:02 PM Bug #39945: RBD I/O error leads to ghost-mapped RBD
> On the system that had the RBDs mapped, the RBDs don't appear in "rbd showmapped" and are not inside "/sys/bus/rbd/... Ilya Dryomov
06:18 PM Bug #39010 (Resolved): Hang with read-only RBD
Ilya Dryomov
06:17 PM Bug #39010: Hang with read-only RBD
This assert has been removed in 5.2:
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id...
Ilya Dryomov
04:22 PM Feature #39967: kclient: recover from a killed session (w/ blacklist)
Dan van der Ster wrote:
> Isn't client_reconnect_stale a fuse client option? What is the relation to kclient?
The...
Patrick Donnelly
11:35 AM Bug #40190: oops or soft lockup after MDS crash
Fixed by commit "ceph: don't blindly unregister session that is in opening state" in https://github.com/ceph/ceph-cli... Zheng Yan
03:47 AM Bug #40190: oops or soft lockup after MDS crash
yes. it's related. mds crash while importing caps reveals a kernel bug... Zheng Yan
02:48 AM Bug #40190: oops or soft lockup after MDS crash
looks like cap->session is wild pointer Zheng Yan

06/08/2019

06:03 PM Bug #40190: oops or soft lockup after MDS crash
Probably this bug: https://bugzilla.redhat.com/show_bug.cgi?id=1710556
Kernel patches here:
https://github.com/...
Patrick Donnelly

06/07/2019

01:44 PM Bug #39258: add statx support to kcephfs
The patches are merged in the upstream kernel, but I still don't have any tests yet. We do have some testcases for li... Jeff Layton
09:30 AM Feature #39967: kclient: recover from a killed session (w/ blacklist)
Isn't client_reconnect_stale a fuse client option? What is the relation to kclient? Dan van der Ster

06/06/2019

02:52 PM Bug #40190 (Resolved): oops or soft lockup after MDS crash
Yesterday we had a v12.2.12 mds crash [1] and several kernel clients (el7 kernel 3.10.0-957.12.1.el7.x86_64) either d... Dan van der Ster

05/31/2019

10:45 AM Bug #39649 (Resolved): kernel BUG at fs/ceph/file.c:1476!
fixed by "ceph: fix error handling in ceph_get_caps()"
Zheng Yan

05/19/2019

08:30 AM Feature #39967 (Resolved): kclient: recover from a killed session (w/ blacklist)
One option for a more robust solution is to create a new configuration option that allows the client to acquire a new... Patrick Donnelly
 

Also available in: Atom