Project

General

Profile

Activity

From 12/12/2019 to 01/10/2020

01/10/2020

11:36 PM Bug #43555 (New): raw usage is far from total pool usage
ceph -v
ceph version 13.2.5 (cbff874f9007f1869bfd3821b7e33b2a6ffd4988) mimic (stable)...
liang sibin
10:32 PM Bug #42328: osd/PrimaryLogPG.cc: 3962: ceph_abort_msg("out of order op")
http://pulpito.ceph.com/nojha-2020-01-10_19:11:03-rbd:mirror-thrash-master-distro-basic-smithi/4653675/
Observatio...
Neha Ojha
08:30 PM Bug #42328: osd/PrimaryLogPG.cc: 3962: ceph_abort_msg("out of order op")
Reproduces with -s rbd:mirror-thrash and --filter 'rbd-mirror-fsx-workunit'
http://pulpito.ceph.com/nojha-2020-01-...
Neha Ojha
10:03 PM Bug #43553 (Can't reproduce): mon: client mon_status fails
... Patrick Donnelly
09:07 PM Bug #40649: set_mon_vals failed to set cluster_network = 10.1.2.0/24: Configuration option 'clust...
This also happened to me during an upgrade from Luminous to Nautilus.
The cluster/public networks were not defined...
Frank Ritchie
07:26 PM Bug #43552 (Resolved): nautilus: OSDMonitor: SIGFPE in OSDMonitor::share_map_with_random_osd
... Patrick Donnelly
02:45 PM Bug #43365: Nautilus: Random mon crashes in failed assertion at ceph::time_detail::signedspan
We are also running into this issue.
Jan 07 19:03:42 pmxc05 ceph-mon[3701783]: 2020-01-07 19:03:42.625 7fe59c03d...
Glen Aidukas
01:39 PM Bug #39665 (Resolved): kstore: memory may leak on KStore::_do_read_stripe
Kefu Chai
01:34 PM Bug #43412 (Resolved): cephadm ceph_manager IndexError: list index out of range
Kefu Chai
04:55 AM Backport #43532 (In Progress): luminous: Change default upmap_max_deviation to 5
David Zafman
04:54 AM Backport #43531 (In Progress): mimic: Change default upmap_max_deviation to 5
David Zafman

01/09/2020

10:00 PM Bug #42328 (New): osd/PrimaryLogPG.cc: 3962: ceph_abort_msg("out of order op")
This issue is still occurring with today's master branch:
http://qa-proxy.ceph.com/teuthology/jdillaman-2020-01-09...
Jason Dillaman
04:56 PM Backport #43495: nautilus: ceph monitor crashes after updating 'mon_memory_target' config setting.
Sridhar Seshasayee wrote:
> https://github.com/ceph/ceph/pull/32520
merged
Yuri Weinstein
02:28 AM Bug #43412 (Fix Under Review): cephadm ceph_manager IndexError: list index out of range
Kefu Chai
12:39 AM Backport #43529 (In Progress): nautilus: Remove use of rules batching for upmap balancer
David Zafman
12:27 AM Backport #43529 (Resolved): nautilus: Remove use of rules batching for upmap balancer
https://github.com/ceph/ceph/pull/31956 David Zafman
12:39 AM Backport #43530 (In Progress): nautilus: Change default upmap_max_deviation to 5
David Zafman
12:28 AM Backport #43530 (Resolved): nautilus: Change default upmap_max_deviation to 5
https://github.com/ceph/ceph/pull/31956 David Zafman
12:28 AM Backport #43532 (Resolved): luminous: Change default upmap_max_deviation to 5
https://github.com/ceph/ceph/pull/32586 David Zafman
12:28 AM Backport #43531 (Resolved): mimic: Change default upmap_max_deviation to 5
https://github.com/ceph/ceph/pull/31957 David Zafman

01/08/2020

10:23 PM Bug #43312 (Pending Backport): Change default upmap_max_deviation to 5
Neha Ojha
10:10 PM Bug #43307 (Pending Backport): Remove use of rules batching for upmap balancer
Neha Ojha
10:09 PM Bug #43397 (Fix Under Review): FS_DEGRADED to cluster log despite --no-mon-health-to-clog
Neha Ojha
10:04 PM Bug #43412: cephadm ceph_manager IndexError: list index out of range
Kefu's got a PR for this Josh Durgin
05:31 AM Bug #43412: cephadm ceph_manager IndexError: list index out of range
I'm guessing it's caused by there being no pools at the time. So the random choice fails. Maybe we need to do somethi... Matthew Oliver
10:02 PM Bug #43422: qa/standalone/mon/osd-pool-create.sh fails to grep utf8 pool name
probably need to set LANG to utf8 Josh Durgin
08:23 AM Bug #43185: ceph -s not showing client activity
We run 14.2.4. I see mgr process at 100% sometimes and I been told that the reason for lack of activity show might be... Anonymous
02:24 AM Bug #43520 (In Progress): segfault in kstore's pending stripes
Chang Liu
02:23 AM Bug #43520: segfault in kstore's pending stripes
ceph version 14.2.1-700.3.0.2.407 (c823e6bbf85437561d2165c0f4b5d8c6bd726975) nautilus (stable)
1: (()+0xf5e0) [0x7f...
Chang Liu
02:20 AM Bug #43520 (In Progress): segfault in kstore's pending stripes
Chang Liu

01/07/2020

02:46 PM Documentation #41389 (Resolved): wrong datatype describing crush_rule
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
02:45 PM Bug #42177 (Resolved): osd/PrimaryLogPG.cc: 13068: FAILED ceph_assert(obc)
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
02:43 PM Bug #42906 (Resolved): ceph-mon --mkfs: public_address type (v1|v2) is not respected
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
10:25 AM Backport #43495 (In Progress): nautilus: ceph monitor crashes after updating 'mon_memory_target' ...
Sridhar Seshasayee
10:24 AM Backport #43495 (New): nautilus: ceph monitor crashes after updating 'mon_memory_target' config s...
Sridhar Seshasayee
10:01 AM Backport #43495 (Resolved): nautilus: ceph monitor crashes after updating 'mon_memory_target' con...
https://github.com/ceph/ceph/pull/32520 Sridhar Seshasayee
09:34 AM Bug #43454: ceph monitor crashes after updating 'mon_memory_target' config setting.
Tested the fix without using rocksdb and confirmed that the crash is not observed now:
2020-01-07T12:53:09.942+053...
Sridhar Seshasayee
08:41 AM Bug #43454 (Pending Backport): ceph monitor crashes after updating 'mon_memory_target' config set...
Kefu Chai
02:46 AM Backport #39474: luminous: segv in fgets() in collect_sys_info reading /proc/cpuinfo
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/32349
merged
Yuri Weinstein
02:45 AM Backport #41730: luminous: osd/ReplicatedBackend.cc: 1349: FAILED ceph_assert(peer_missing.count(...
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/31855
merged
Yuri Weinstein

01/06/2020

11:21 PM Bug #43490 (New): nautilus: "[WRN] Monitor daemon marked osd.2 down, but it is still running" in ...
Run: http://pulpito.ceph.com/yuriw-2020-01-04_16:08:12-rados-wip-yuri8-testing-2020-01-03-2031-nautilus-distro-basic-... Yuri Weinstein
11:18 PM Backport #42997: nautilus: acting_recovery_backfill won't catch all up peers
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/32064
merged
Yuri Weinstein
11:17 PM Backport #42853: nautilus: format error: ceph osd stat --format=json
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/32062
merged
Yuri Weinstein
11:16 PM Backport #42846: nautilus: src/msg/async/net_handler.cc: Fix compilation
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/31736
merged
Yuri Weinstein
10:23 PM Bug #43489 (New): PG.cc: 953: FAILED assert(0 == "past_interval start interval mismatch")

Upgrade runs from Jewel to Luminous and Luminous to Mimic
yuriw-2019-12-23_19:53:50-rados-wip-yuri3-testing-2019...
David Zafman
07:21 PM Bug #41718 (Resolved): ceph osd stat JSON output incomplete
David Zafman
07:21 PM Bug #43485 (Fix Under Review): Deprecated full/nearfull added back by mistake
David Zafman
07:16 PM Bug #43485 (Resolved): Deprecated full/nearfull added back by mistake

The change for
https://tracker.ceph.com/issues/41718 (dff411f1905cc69bfb2cfa8b62a00b4702e6aa46)
also added back...
David Zafman
06:26 PM Backport #43325 (Resolved): luminous: wrong datatype describing crush_rule
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/32267
m...
Nathan Cutler
06:25 PM Backport #43315 (Resolved): mimic:wrong datatype describing crush_rule
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/32255
m...
Nathan Cutler
06:24 PM Backport #42197 (Resolved): nautilus: osd/PrimaryLogPG.cc: 13068: FAILED ceph_assert(obc)
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/31028
m...
Nathan Cutler
06:23 PM Backport #43140 (Resolved): nautilus: ceph-mon --mkfs: public_address type (v1|v2) is not respected
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/32028
m...
Nathan Cutler
01:31 PM Backport #43473 (Resolved): nautilus: recursive lock of OpTracker::lock (70)
https://github.com/ceph/ceph/pull/32858 Nathan Cutler
01:30 PM Backport #43472 (Resolved): mimic: negative num_objects can set PG_STATE_DEGRADED
https://github.com/ceph/ceph/pull/33331 Nathan Cutler
01:30 PM Backport #43471 (Resolved): nautilus: negative num_objects can set PG_STATE_DEGRADED
https://github.com/ceph/ceph/pull/32857 Nathan Cutler
01:30 PM Backport #43470 (Rejected): mimic: asynchronous recovery + backfill might spin pg undersized for ...
https://github.com/ceph/ceph/pull/33330 Nathan Cutler
01:30 PM Backport #43469 (Resolved): nautilus: asynchronous recovery + backfill might spin pg undersized f...
https://github.com/ceph/ceph/pull/32849 Nathan Cutler
08:00 AM Bug #42861 (Fix Under Review): Libceph-common.so needs to use private link attribute when includi...
Kefu Chai

01/04/2020

03:15 PM Bug #43334 (Resolved): nautilus: rados/test_envlibrados_for_rocksdb.sh broken packages with ubunt...
i've recompiled cmake3 for xenial/amd64 with GCC-5, and uploaded the built packages to the chacra repo. please reopen... Kefu Chai
02:38 AM Bug #43334: nautilus: rados/test_envlibrados_for_rocksdb.sh broken packages with ubuntu_16.04.yaml
i need to rebuild cmake3 using the original libstdc++ instead of the one from the gcc-8/gcc-9 ppa repo. Kefu Chai

01/03/2020

11:54 PM Bug #43421 (Fix Under Review): mon spends too much time to build incremental osdmap
Neha Ojha
10:09 AM Bug #43421: mon spends too much time to build incremental osdmap
It takes 5 seconds to build 640 incremental osdmap for one client. simon gao
08:15 AM Bug #43421: mon spends too much time to build incremental osdmap
sorry. It took 5 seconds simon gao
11:49 PM Bug #43185 (Need More Info): ceph -s not showing client activity
super xor wrote:
> Possible relation to https://tracker.ceph.com/issues/43364 and https://tracker.ceph.com/issues/43...
Neha Ojha
10:48 PM Bug #43311 (Pending Backport): asynchronous recovery + backfill might spin pg undersized for a lo...
Neha Ojha
09:01 PM Feature #40870: Implement mon_memory_target
Another follow-on fix: https://github.com/ceph/ceph/pull/32473 Neha Ojha
09:00 PM Bug #43454 (Fix Under Review): ceph monitor crashes after updating 'mon_memory_target' config set...
Neha Ojha
08:24 AM Bug #43454 (Resolved): ceph monitor crashes after updating 'mon_memory_target' config setting.
Refer bugzilla https://bugzilla.redhat.com/show_bug.cgi?id=1760257 for more details. Sridhar Seshasayee
08:06 PM Backport #42197: nautilus: osd/PrimaryLogPG.cc: 13068: FAILED ceph_assert(obc)
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/31028
merged
Yuri Weinstein
04:39 PM Bug #43334: nautilus: rados/test_envlibrados_for_rocksdb.sh broken packages with ubuntu_16.04.yaml
/a/yuriw-2019-12-23_20:23:51-rados-wip-yuri-testing-2019-12-16-2241-nautilus-distro-basic-smithi/4628899/ Neha Ojha

01/02/2020

03:41 PM Bug #43403: unittest_lockdep unreliable
Happened in https://github.com/ceph/ceph/pull/27792 (among others) Nathan Cutler

01/01/2020

11:01 AM Documentation #42315: Improve rados command usage, man page and turorial
RADOS(8) Ceph RADOS(8)
NAME
rados - rados object s...
Zac Dover
10:52 AM Documentation #42315: Improve rados command usage, man page and turorial
[zdover@192-168-1-112 ~]$ rados -h
usage: rados [options] [commands]
POOL COMMANDS
lspools ...
Zac Dover

12/25/2019

03:24 PM Bug #43422 (Resolved): qa/standalone/mon/osd-pool-create.sh fails to grep utf8 pool name
... Sage Weil
12:33 PM Bug #43421: mon spends too much time to build incremental osdmap
In my cluster , it took five minutes to 1300 versions of incremental osdmap.
patch: https://github.com/ceph/ceph/...
simon gao
09:49 AM Bug #43421 (Fix Under Review): mon spends too much time to build incremental osdmap
if a client's osdmap version is too low. mon spend too much time to build incremental osdmap.
Mon can't handle norma...
simon gao

12/24/2019

05:03 AM Bug #43308 (Pending Backport): negative num_objects can set PG_STATE_DEGRADED
Kefu Chai
05:02 AM Bug #42780 (Pending Backport): recursive lock of OpTracker::lock (70)
Kefu Chai
01:53 AM Bug #43413 (New): Virtual IP address of iface lo results in failing to start an OSD
We added a virtual IP on the loopback internetface lo to complete the LVS configuration.... gb li

12/23/2019

11:54 PM Bug #43412 (Resolved): cephadm ceph_manager IndexError: list index out of range
... Sage Weil
08:26 PM Backport #43140: nautilus: ceph-mon --mkfs: public_address type (v1|v2) is not respected
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/32028
mergedReviewed-by: Ricardo Dias <rdias@suse.com>
Yuri Weinstein
02:18 PM Bug #43174: pgs inconsistent, union_shard_errors=missing
Hi David.
> Are you running your own Ceph build?
No, we use official (comunity) build.
> Sortbitwise needed to...
Aleksandr Rudenko

12/21/2019

03:06 PM Bug #43404 (Resolved): mon crash in OSDMap::_pg_to_raw_osds from update_pending_pgs
... Sage Weil

12/20/2019

11:39 PM Bug #42328 (Resolved): osd/PrimaryLogPG.cc: 3962: ceph_abort_msg("out of order op")
I can't check the original reports (logs have been removed), but assuming it's the same root cause PR #32382 5bb932c3... Samuel Just
01:31 AM Bug #42328: osd/PrimaryLogPG.cc: 3962: ceph_abort_msg("out of order op")
I observed something similar on a ceph_test_rados teuthology run: sjust-2019-12-19_20:05:13-rados-wip-sjust-read-from... Samuel Just
11:37 PM Bug #43394 (Resolved): crimson::dmclock segv in crimson::IndIntruHeap
Should be fixed with PR #32380 2c9542901532feafd569d92e9f67ccd2e1af3129 Samuel Just
08:53 PM Bug #43403 (Resolved): unittest_lockdep unreliable
... Sage Weil
08:22 AM Bug #41255: backfill_toofull seen on cluster where the most full OSD is at 1%
Hi David:
Good to know the bug is indeed fixed ... too bad it didn't make it in 13.2.8. Anyways ... building patch...
Stefan Kooman
04:50 AM Bug #38345 (In Progress): mon: segv in MonOpRequest::~MonOpRequest OpHistory::cleanup
Brad Hubbard
01:50 AM Bug #43174: pgs inconsistent, union_shard_errors=missing

Scrub incorrectly thinks the object really isn't there, but we know it is.
The way that you can see missing obje...
David Zafman

12/19/2019

11:57 PM Bug #42780 (Fix Under Review): recursive lock of OpTracker::lock (70)
https://github.com/ceph/ceph/pull/32364 Radoslaw Zarzynski
12:09 PM Bug #42780 (In Progress): recursive lock of OpTracker::lock (70)
Radoslaw Zarzynski
10:30 PM Bug #43307 (Fix Under Review): Remove use of rules batching for upmap balancer
David Zafman
10:27 PM Bug #43397 (Resolved): FS_DEGRADED to cluster log despite --no-mon-health-to-clog
... Sage Weil
09:38 PM Bug #43394 (Resolved): crimson::dmclock segv in crimson::IndIntruHeap
... Sage Weil
07:06 PM Bug #41255: backfill_toofull seen on cluster where the most full OSD is at 1%
A backport to Mimic of the fix can be found here:
https://github.com/ceph/ceph/pull/32361
Or if you can build fro...
David Zafman
02:34 PM Bug #41255: backfill_toofull seen on cluster where the most full OSD is at 1%
We added a CRUSH policy (replicated_nvme) and set this policy on our cephfs metadata pool (with 1.2 Bilion objects) a... Stefan Kooman
07:02 PM Backport #41584 (In Progress): mimic: backfill_toofull seen on cluster where the most full OSD is...
David Zafman
02:29 PM Bug #43306: segv in collect_sys_info
Neha Ojha wrote:
> This looks similar to https://tracker.ceph.com/issues/38296, though the mon seems to have been up...
Nathan Cutler
02:22 PM Backport #39474 (In Progress): luminous: segv in fgets() in collect_sys_info reading /proc/cpuinfo
Nathan Cutler
02:18 PM Bug #41383 (Resolved): scrub object count mismatch on device_health_metrics pool
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
02:14 PM Backport #42739 (Resolved): nautilus: scrub object count mismatch on device_health_metrics pool
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/31735
m...
Nathan Cutler
07:39 AM Bug #43382: medium io/system load causes quorum failure
Or due to limited bandwidth? 10G NICs dedicated. Anonymous
07:36 AM Bug #43382 (New): medium io/system load causes quorum failure
We just found out that if you put some io pressure on your system by e.g. big rsync, the mon process has issues proba... Anonymous
05:44 AM Bug #43126 (Fix Under Review): OSD_SLOW_PING_TIME_BACK nits
David Zafman
02:20 AM Bug #43318: monitor mark all services(osd mgr) down
mgr has no log when setting the debug_mgr to 40. simon gao

12/18/2019

10:31 PM Bug #43193 (Need More Info): "ceph ping mon.<id>" cannot work
Can you provide the sequence of commands that fail? Also, please attach the monitor names and monmap. Neha Ojha
10:25 PM Bug #43305 (Won't Fix): "psutil.NoSuchProcess process no longer exists" error in luminous-x-nauti...
This is an infra issue.... Neha Ojha
10:23 PM Bug #43306: segv in collect_sys_info
This looks similar to https://tracker.ceph.com/issues/38296, though the mon seems to have been upgraded to nautilus(w... Neha Ojha
10:17 PM Bug #43318 (Need More Info): monitor mark all services(osd mgr) down
Can you provide mgr logs from when this happened? Neha Ojha
10:12 PM Feature #43377 (Resolved): Make Zstandard compression level a configurable option
I've played with using the different compression algorithms on the RGWs and the default compression level for Zstanda... Bryan Stillwell
07:38 PM Backport #42739: nautilus: scrub object count mismatch on device_health_metrics pool
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/31735
merged
Yuri Weinstein
03:53 PM Backport #43316 (Resolved): nautilus:wrong datatype describing crush_rule
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/32254
m...
Nathan Cutler
12:11 PM Bug #43365: Nautilus: Random mon crashes in failed assertion at ceph::time_detail::signedspan
So it's asserting inside of to_timespan, and the Paxos code triggering that assert is
> auto start = ceph::coarse_...
Greg Farnum
12:03 PM Bug #43365 (Resolved): Nautilus: Random mon crashes in failed assertion at ceph::time_detail::sig...
Thanks to 14.2.5 auto warning for recent crashes, we are observing frequent (somewhat daily period) random crashes of... Alex Walender
09:35 AM Bug #43185: ceph -s not showing client activity
Possible relation to https://tracker.ceph.com/issues/43364 and https://tracker.ceph.com/issues/43317 Anonymous

12/17/2019

05:39 PM Bug #43308 (Fix Under Review): negative num_objects can set PG_STATE_DEGRADED
Neha Ojha
09:19 AM Backport #43346 (Resolved): nautilus: short pg log + cache tier ceph_test_rados out of order reply
https://github.com/ceph/ceph/pull/32848 Nathan Cutler
06:47 AM Bug #41950 (Can't reproduce): crimson compile
Kefu Chai
06:46 AM Bug #41950: crimson compile
i assume that you were trying to compile crimson-osd not crimson-old. please check the submodule of seastar to unders... Kefu Chai

12/16/2019

10:36 PM Bug #43296 (Need More Info): Ceph assimilate-conf results in config entries which can not be removed
Can you attach the (relevant) output from "ceph config-key dump | grep config"? I think the keys are being installed... Sage Weil
10:22 PM Bug #43296: Ceph assimilate-conf results in config entries which can not be removed
Might be related to #42964? Patrick Donnelly
10:06 PM Bug #43334 (Resolved): nautilus: rados/test_envlibrados_for_rocksdb.sh broken packages with ubunt...
Run: http://pulpito.ceph.com/yuriw-2019-12-15_16:25:11-rados-wip-yuri-nautilus-baseline_12.13.19-distro-basic-smithi/... Yuri Weinstein
08:36 PM Bug #38358 (Pending Backport): short pg log + cache tier ceph_test_rados out of order reply
Seen in nautilus: /a/yuriw-2019-12-15_16:25:11-rados-wip-yuri-nautilus-baseline_12.13.19-distro-basic-smithi/4605500/ Neha Ojha
12:40 PM Bug #43174 (New): pgs inconsistent, union_shard_errors=missing
Hmm this may be something else then. David, does it look familiar? Greg Farnum
08:40 AM Feature #43324: Make zlib windowBits configurable for compression
Xiyuan Wang wrote:
> Now the zlib windowBits is hardcoding as -15[1]. But it should be set to different value for di...
Xiyuan Wang
03:38 AM Feature #43324 (Resolved): Make zlib windowBits configurable for compression
Now the zlib windowBits is hardcoding as -15[1]. But it should be set to different value for different case.
Accor...
Xiyuan Wang
07:27 AM Backport #43325 (In Progress): luminous: wrong datatype describing crush_rule
Deepika Upadhyay
07:24 AM Backport #43325 (New): luminous: wrong datatype describing crush_rule
Deepika Upadhyay
07:24 AM Backport #43325 (Resolved): luminous: wrong datatype describing crush_rule
https://github.com/ceph/ceph/pull/32267 Deepika Upadhyay

12/15/2019

10:04 PM Documentation #41389 (Pending Backport): wrong datatype describing crush_rule
Nathan Cutler
03:55 PM Bug #38076 (Resolved): osds allows to partially start more than N+2
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
03:53 PM Feature #40528 (Resolved): Better default value for osd_snap_trim_sleep
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
03:53 PM Backport #43320 (Resolved): mimic: PeeringState::GoClean will call purge_strays unconditionally
https://github.com/ceph/ceph/pull/33329 Nathan Cutler
03:53 PM Backport #43319 (Resolved): nautilus: PeeringState::GoClean will call purge_strays unconditionally
https://github.com/ceph/ceph/pull/32847 Nathan Cutler
01:27 PM Bug #42328: osd/PrimaryLogPG.cc: 3962: ceph_abort_msg("out of order op")
Looking at the historical test runs, it seems to have started after [1] but before [2].
[1] http://pulpito.ceph.co...
Jason Dillaman
01:30 AM Bug #42328: osd/PrimaryLogPG.cc: 3962: ceph_abort_msg("out of order op")
http://qa-proxy.ceph.com/teuthology/teuthology-2019-12-02_02:01:02-rbd-master-distro-basic-smithi/4559106/teuthology.log Jason Dillaman
01:29 AM Bug #42328: osd/PrimaryLogPG.cc: 3962: ceph_abort_msg("out of order op")
http://qa-proxy.ceph.com/teuthology/jdillaman-2019-12-14_17:15:11-rbd-wip-jd-testing-distro-basic-smithi/4603518/teut... Jason Dillaman
06:55 AM Bug #43318 (Need More Info): monitor mark all services(osd mgr) down
Suddenly, all mgrs and osds in my cluster began to be set to down by the monitor.
the log of monitor like this
```
...
simon gao

12/14/2019

08:28 AM Documentation #41389 (In Progress): wrong datatype describing crush_rule
Deepika Upadhyay
07:21 AM Documentation #41389 (Pending Backport): wrong datatype describing crush_rule
Deepika Upadhyay
02:42 AM Documentation #41389: wrong datatype describing crush_rule
Just needs a cherry-pick of 3ed3de6c964ba998d5b18ceb997d1a6dffe355db Neha Ojha
08:26 AM Backport #43315 (In Progress): mimic:wrong datatype describing crush_rule
Deepika Upadhyay
08:02 AM Backport #43315 (Resolved): mimic:wrong datatype describing crush_rule
https://github.com/ceph/ceph/pull/32255 Deepika Upadhyay
08:24 AM Backport #43316 (In Progress): nautilus:wrong datatype describing crush_rule
Deepika Upadhyay
08:03 AM Backport #43316 (Resolved): nautilus:wrong datatype describing crush_rule
https://github.com/ceph/ceph/pull/32254 Deepika Upadhyay
02:50 AM Bug #43307 (In Progress): Remove use of rules batching for upmap balancer
David Zafman
02:49 AM Bug #43312 (In Progress): Change default upmap_max_deviation to 5
David Zafman
02:06 AM Bug #43312 (Resolved): Change default upmap_max_deviation to 5
David Zafman
12:24 AM Bug #43311 (Resolved): asynchronous recovery + backfill might spin pg undersized for a long time
When an osd that is part of current up set gets chosen as an
async_recovery_target, it gets removed from the acting ...
xie xingguo
12:16 AM Bug #43308 (In Progress): negative num_objects can set PG_STATE_DEGRADED
Neha Ojha

12/13/2019

08:40 PM Bug #40963 (Resolved): mimic: MQuery during Deleting state
Sage Weil
08:40 PM Bug #41317 (Pending Backport): PeeringState::GoClean will call purge_strays unconditionally
Sage Weil
07:47 PM Bug #43308 (Resolved): negative num_objects can set PG_STATE_DEGRADED
... Neha Ojha
07:05 PM Bug #43296: Ceph assimilate-conf results in config entries which can not be removed
Alwin from Proxmox provided a work around but this still appears to be a bug:
https://forum.proxmox.com/threads/ceph...
David Herselman
04:51 PM Bug #43296: Ceph assimilate-conf results in config entries which can not be removed
Setting debug_rdb to 5/5 unfortunately doesn't reveal anything:
Commands:...
David Herselman
03:37 AM Bug #43296 (Resolved): Ceph assimilate-conf results in config entries which can not be removed
We assimilated our Ceph configuration file and subsequently have a minimal config file. We are subsequently not able ... David Herselman
04:31 PM Bug #43307 (Resolved): Remove use of rules batching for upmap balancer

Due to cost of calculations for very large PG/shard counts, we will settle for balancing each pool individually for...
David Zafman
03:43 PM Bug #25174 (Can't reproduce): osd: assert failure with FAILED assert(repop_queue.front() == repop...
Neha Ojha
02:43 PM Bug #43306 (Resolved): segv in collect_sys_info
Run: http://pulpito.ceph.com/teuthology-2019-12-13_02:25:03-upgrade:luminous-x-nautilus-distro-basic-smithi/
Job: '4...
Yuri Weinstein
02:40 PM Bug #43305 (Won't Fix): "psutil.NoSuchProcess process no longer exists" error in luminous-x-nauti...
Run: http://pulpito.ceph.com/teuthology-2019-12-13_02:25:03-upgrade:luminous-x-nautilus-distro-basic-smithi/
Jobs: '...
Yuri Weinstein
08:23 AM Backport #42259 (Resolved): nautilus: document new option mon_max_pg_per_osd
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/31300
m...
Nathan Cutler
08:22 AM Backport #40947 (Resolved): luminous: Better default value for osd_snap_trim_sleep
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/31857
m...
Nathan Cutler
08:22 AM Backport #38205 (Resolved): luminous: osds allows to partially start more than N+2
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/31858
m...
Nathan Cutler
08:22 AM Backport #43093 (Resolved): luminous: Improve OSDMap::calc_pg_upmaps() efficiency
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/31992
m...
Nathan Cutler
06:17 AM Bug #40712: ceph-mon crash with assert(err == 0) after rocksdb->get
we meet this problem recently.
we decline this related more to rocksdb but not ceph
huang jun

12/12/2019

04:41 PM Backport #40947: luminous: Better default value for osd_snap_trim_sleep
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/31857
mergedReviewed-by: Josh Durgin <jdurgin@redhat.com>
Yuri Weinstein
04:41 PM Backport #38205: luminous: osds allows to partially start more than N+2
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/31858
merged
Yuri Weinstein
04:40 PM Backport #43093: luminous: Improve OSDMap::calc_pg_upmaps() efficiency
David Zafman wrote:
> https://github.com/ceph/ceph/pull/31992
merged
Yuri Weinstein
10:16 AM Bug #43174: pgs inconsistent, union_shard_errors=missing
Greg thanks for the reply.
Greg Farnum wrote:
> If you fetch an object in RGW and its backing RADOS objects are m...
Aleksandr Rudenko
09:41 AM Bug #38330 (Resolved): osd/OSD.cc: 1515: abort() in Service::build_incremental_map_msg
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
09:23 AM Backport #43119 (Resolved): mimic: osd/OSD.cc: 1515: abort() in Service::build_incremental_map_msg
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/32000
m...
Nathan Cutler
08:44 AM Bug #43193: "ceph ping mon.<id>" cannot work
The command "ceph ping mon.a" or "ceph ping mon.b" or "ceph ping mon.c" works fine.
If the mon id is not specified, ...
Min Shi
05:31 AM Bug #41317 (Fix Under Review): PeeringState::GoClean will call purge_strays unconditionally
Neha Ojha
12:04 AM Bug #43267 (Rejected): unexpected error in BlueStore::_txc_add_transaction
Jeff Layton
12:02 AM Bug #43267: unexpected error in BlueStore::_txc_add_transaction
Nope, it was full. Well spotted:... Jeff Layton
 

Also available in: Atom