Project

General

Profile

Activity

From 03/30/2020 to 04/28/2020

04/28/2020

08:04 PM Bug #44076 (Resolved): mon: update + monmap update triggers spawn loop
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
08:04 PM Bug #44248 (Resolved): Receiving RemoteBackfillReserved in WaitLocalBackfillReserved can cause th...
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
08:02 PM Backport #45314 (Resolved): octopus: scrub/osd-scrub-repair.sh: TEST_auto_repair_bluestore_failed...
https://github.com/ceph/ceph/pull/34830 Nathan Cutler
07:28 PM Support #45270 (Resolved): after reboot osd move to localhost
I believe this has been discussed several times on the mailing list. If your OSDs don't get reliably told what their ... Greg Farnum
05:51 PM Backport #45041 (In Progress): octopus: osd: incorrect read bytes stat in SPARSE_READ
Nathan Cutler
05:47 PM Backport #44842 (In Progress): octopus: nautilus: FAILED ceph_assert(head.version == 0 || e.versi...
Nathan Cutler
05:46 PM Backport #44685 (In Progress): octopus: osd/osd-backfill-stats.sh TEST_backfill_out2: wait_for_cl...
Nathan Cutler
03:44 PM Bug #45075 (Pending Backport): scrub/osd-scrub-repair.sh: TEST_auto_repair_bluestore_failed failure
Neha Ojha
12:05 AM Bug #45075: scrub/osd-scrub-repair.sh: TEST_auto_repair_bluestore_failed failure
https://github.com/ceph/ceph/pull/34602 merged Yuri Weinstein
09:25 AM Backport #44324: nautilus: Receiving RemoteBackfillReserved in WaitLocalBackfillReserved can caus...
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34512
m...
Nathan Cutler
03:01 AM Backport #44324 (Resolved): nautilus: Receiving RemoteBackfillReserved in WaitLocalBackfillReserv...
Wei-Chung Cheng
09:25 AM Backport #44289: nautilus: mon: update + monmap update triggers spawn loop
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34500
m...
Nathan Cutler
03:00 AM Backport #44289 (Resolved): nautilus: mon: update + monmap update triggers spawn loop
Wei-Chung Cheng
02:56 AM Backport #44370 (In Progress): nautilus: msg/async: the event center is blocked by rdma construct...
Wei-Chung Cheng
02:05 AM Bug #45298 (Resolved): cram: balancer/misplaced.t fails with 'Error EAGAIN: Some objects (0.00891...
/a/teuthology-2020-04-26_07:01:02-rados-master-distro-basic-smithi/4985666... Brad Hubbard
01:31 AM Bug #36304: FAILED ceph_assert(p != pg_slots.end()) in OSDShard::register_and_wake_split_child(PG*)
/a/teuthology-2020-04-26_07:01:02-rados-master-distro-basic-smithi/4986119 Brad Hubbard

04/27/2020

09:32 PM Backport #44324: nautilus: Receiving RemoteBackfillReserved in WaitLocalBackfillReserved can caus...
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/34512
merged
Yuri Weinstein
06:03 PM Bug #45292 (Need More Info): pg autoscaler merging issue
Encountering an issue where placement groups (pgs) go into status *stuck inactive* and hang in that status. This appe... Brian Wickersham
11:51 AM Bug #44286: Cache tiering shows unfound objects after OSD reboots
Issue still present on 14.2.8. Preben Berg
08:25 AM Bug #41735 (Resolved): pg_autoscaler throws HEALTH_WARN with auto_scale on for all pools
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
08:24 AM Bug #43365 (Resolved): Nautilus: Random mon crashes in failed assertion at ceph::time_detail::sig...
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
03:30 AM Feature #43377 (Resolved): Make Zstandard compression level a configurable option
Kefu Chai

04/26/2020

02:16 PM Support #45270 (Resolved): after reboot osd move to localhost
if my host retrieve hostname from DNS server PTR line, not set from hostnamectl set-hostname node01, i have next prob... Ilia Seleznev

04/25/2020

06:24 PM Bug #45202: Repeatedly OSD crashes in PrimaryLogPG::hit_set_trim()
I'm got a crash for another OSD on 4rd node, and last lines in log are not related to PG 2.f8:... KOT MATPOCKuH
12:55 PM Bug #45253: Inconsistent characters allowed set for device classes
Version is mimic 13.2.8. Sorry, forgot. Frank Schilder
12:02 AM Bug #45266 (Resolved): follower monitors can grow beyond memory target
The leader monitor periordically tells tcmalloc to release memory back to the OS, but follower monitors do not. This ... Josh Durgin

04/24/2020

07:24 PM Backport #45231 (Resolved): nautilus: pg_autoscaler throws HEALTH_WARN with auto_scale on for all...
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34618
m...
Nathan Cutler
06:20 PM Backport #45231 (In Progress): nautilus: pg_autoscaler throws HEALTH_WARN with auto_scale on for ...
Nathan Cutler
06:31 PM Backport #44486 (Resolved): nautilus: Nautilus: Random mon crashes in failed assertion at ceph::t...
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34542
m...
Nathan Cutler
03:07 PM Bug #45241: Error message: Mount failed with '(22) Invalid argument' when trying to import using ...
Using --debug it turned out that the exported PGs have different fsid than the odd-directory I am trying to import. f... E Shadabi
09:28 AM Bug #45253 (New): Inconsistent characters allowed set for device classes
I changed the device class of a number of disks yesterday successfully to "rbd.meta":... Frank Schilder
02:59 AM Bug #45243 (New): nautilus: qa/standalone/scrub/osd-scrub-repair.sh fails with osd-scrub-repair.s...
/a/yuriw-2020-04-18_19:56:53-rados-wip-yuri4-testing-2020-04-18-1756-nautilus-distro-basic-smithi/4965037... Brad Hubbard
02:09 AM Fix #45140: osd/tiering: flush cache pool may lead to slow write requests
https://github.com/ceph/ceph/pull/34623 Arvin Liang

04/23/2020

10:48 PM Bug #45241 (New): Error message: Mount failed with '(22) Invalid argument' when trying to import ...

Hi !
When I am running the following command, I get a not so descriptive error messages:
ceph-objectstore-tool --...
E Shadabi
09:06 PM Bug #45240 (New): Not able to export objects using ceph-objectstore-tool
I am trying to use the tool ceph-objectstore-tool to extract objects from offline OSD:
ceph-objectstore-tool --dat...
E Shadabi
03:47 PM Bug #43365: Nautilus: Random mon crashes in failed assertion at ceph::time_detail::signedspan
For what it's worth we're still seeing it after upgrading debian to 10.3 and installing kernel "5.4.0-0.bpo.3-amd64 #... Edwin Pers
01:39 PM Bug #40112 (Resolved): mon: rados/multimon tests fail with clock skew
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
01:38 PM Backport #45231 (Resolved): nautilus: pg_autoscaler throws HEALTH_WARN with auto_scale on for all...
https://github.com/ceph/ceph/pull/34618 Nathan Cutler
01:36 PM Bug #43889 (Resolved): expected MON_CLOCK_SKEW but got none
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
01:36 PM Documentation #43896 (Resolved): nautilus upgrade should recommend ceph-osd restarts after enabli...
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
01:36 PM Backport #45224 (Resolved): nautilus: LibRadosWatchNotify.WatchNotify failure
https://github.com/ceph/ceph/pull/35049 Nathan Cutler
12:30 PM Bug #45202: Repeatedly OSD crashes in PrimaryLogPG::hit_set_trim()
After a hour of work both OSD's continued to crash every several seconds KOT MATPOCKuH
12:08 PM Bug #45202 (New): Repeatedly OSD crashes in PrimaryLogPG::hit_set_trim()
After a network troubles I got 1 pg in a state recovery_unfound
I tried to solve this problem using command:
<pre...
KOT MATPOCKuH
08:20 AM Backport #45040 (Resolved): nautilus: mon: reset min_size when changing pool size
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34585
m...
Nathan Cutler
08:17 AM Backport #44908 (Resolved): mimic: mon: rados/multimon tests fail with clock skew
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34370
m...
Nathan Cutler
08:17 AM Backport #44083 (Resolved): mimic: expected MON_CLOCK_SKEW but got none
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34370
m...
Nathan Cutler

04/22/2020

11:53 PM Bug #44062 (Pending Backport): LibRadosWatchNotify.WatchNotify failure
Brad Hubbard
11:52 PM Bug #44062: LibRadosWatchNotify.WatchNotify failure
Seeing this in Nautilus so setting backport.
http://pulpito.ceph.com/yuriw-2020-04-21_20:54:00-rados-wip-yuri8-tes...
Brad Hubbard
05:43 PM Bug #45191 (New): erasure-code/test-erasure-eio.sh: TEST_ec_single_recovery_error fails
... Neha Ojha
05:15 PM Bug #45190 (New): osd dump times out
... Neha Ojha
10:04 AM Backport #44468 (In Progress): nautilus: mon: Get session_map_lock before remove_session
Wei-Chung Cheng

04/21/2020

11:19 PM Backport #44908: mimic: mon: rados/multimon tests fail with clock skew
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/34370
merged
Yuri Weinstein
11:19 PM Backport #44083: mimic: expected MON_CLOCK_SKEW but got none
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/34370
merged
Yuri Weinstein
11:09 PM Backport #45040: nautilus: mon: reset min_size when changing pool size
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/34585
merged
Yuri Weinstein
11:08 PM Bug #45168 (New): mimic: cephtool/test.sh: test_mon_osd_pool_set failure
... Neha Ojha
06:46 PM Backport #45053 (Resolved): octopus: nautilus upgrade should recommend ceph-osd restarts after en...
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34523
m...
Nathan Cutler
06:37 PM Backport #45054 (Resolved): nautilus: nautilus upgrade should recommend ceph-osd restarts after e...
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34524
m...
Nathan Cutler
12:34 PM Bug #44715 (Fix Under Review): common/TrackedOp.cc: 163: FAILED ceph_assert((sharded_in_flight_li...
Sebastian Wagner
02:45 AM Bug #39039: mon connection reset, command not resent
I tested this on a lab cluster after disabling cephx per https://docs.ceph.com/docs/octopus/rados/configuration/auth-... Tony Davies
02:10 AM Bug #45113 (Resolved): workunits/cls/test_cls_cmpomap.sh fails
Brad Hubbard
02:10 AM Bug #44901: luminous: osd continue down because of the hearbeattimeout
Slove it! It is because we deploy ceph in the docker use kolla asible.
We start some dockers by hand and miss some...
jack ma

04/20/2020

02:24 AM Bug #39039: mon connection reset, command not resent
This also continues to happen on octopus, I just tested on 15.2.0.
I have attached the build instructions I used t...
Tony Davies

04/18/2020

08:06 AM Fix #45140: osd/tiering: flush cache pool may lead to slow write requests
Pull request ID: 34623 Arvin Liang
07:26 AM Fix #45140 (New): osd/tiering: flush cache pool may lead to slow write requests
In OSD tiering, when flushing objects from cache pool to base pool, there are two problems can lead to slow request:
...
Arvin Liang

04/17/2020

10:25 PM Bug #45139: osd/osd-markdown.sh: markdown_N_impl failure
This was seen after the fix for https://tracker.ceph.com/issues/44662 merged. Neha Ojha
10:24 PM Bug #45139 (New): osd/osd-markdown.sh: markdown_N_impl failure
... Neha Ojha
09:57 PM Bug #43888: osd/osd-bench.sh 'tell osd.N bench' hang
Still fails occasionally
/a/nojha-2020-04-10_22:42:57-rados:standalone-master-distro-basic-smithi/4943804/
Neha Ojha
05:02 PM Bug #41735: pg_autoscaler throws HEALTH_WARN with auto_scale on for all pools
nautilus backport: https://github.com/ceph/ceph/pull/34618 Neha Ojha
04:54 PM Bug #41735 (Pending Backport): pg_autoscaler throws HEALTH_WARN with auto_scale on for all pools
This change needs to be backported into Nautilus to fix a regression (#45135) Lenz Grimmer
07:21 AM Bug #45113 (Fix Under Review): workunits/cls/test_cls_cmpomap.sh fails
Kefu Chai

04/16/2020

11:23 PM Bug #45121 (New): nautilus: osd-scrub-snaps.sh: TEST_scrub_snaps failure
... Neha Ojha
11:09 PM Bug #45075 (Fix Under Review): scrub/osd-scrub-repair.sh: TEST_auto_repair_bluestore_failed failure
Neha Ojha
07:44 PM Bug #45075 (In Progress): scrub/osd-scrub-repair.sh: TEST_auto_repair_bluestore_failed failure
Neha Ojha
05:50 PM Bug #45075: scrub/osd-scrub-repair.sh: TEST_auto_repair_bluestore_failed failure
/a/bhubbard-2020-04-16_09:57:54-rados-wip-badone-testing-distro-basic-smithi/4957883/ Neha Ojha
04:15 PM Bug #45113 (Triaged): workunits/cls/test_cls_cmpomap.sh fails
Thank you Casey! i will see if we can use the default list. Kefu Chai
02:54 PM Bug #45113: workunits/cls/test_cls_cmpomap.sh fails
i didn't realize this ran in the rados suite. it's passing in the rgw/verify suite
it looks like the rados suite...
Casey Bodley
02:25 PM Bug #45113 (Resolved): workunits/cls/test_cls_cmpomap.sh fails
... Kefu Chai
09:39 AM Backport #45038 (In Progress): mimic: mon: reset min_size when changing pool size
Nathan Cutler
09:37 AM Backport #45040 (In Progress): nautilus: mon: reset min_size when changing pool size
Nathan Cutler
08:28 AM Feature #44025 (Resolved): Make it harder to set pool replica size to 1
Nathan Cutler
12:30 AM Bug #45076 (Fix Under Review): rados: Sharded OpWQ drops suicide_grace after waiting for work
Dan Hill

04/15/2020

09:33 PM Bug #45008: [osd crash]The ceph-osd assert with rbd bench io
Sebastian Wagner wrote:
> duplicate of 44715 ?
Looks like a dup of 42347, which was on the osd.
Neha Ojha
03:54 PM Bug #45008: [osd crash]The ceph-osd assert with rbd bench io
duplicate of 44715 ? Sebastian Wagner
03:54 PM Bug #44715: common/TrackedOp.cc: 163: FAILED ceph_assert((sharded_in_flight_list.back())->ops_in_...
http://pulpito.ceph.com/swagner-2020-04-15_09:10:55-rados-wip-swagner2-testing-2020-04-14-1813-distro-basic-smithi/ Sebastian Wagner

04/14/2020

10:25 AM Feature #45079 (New): HEALTH_WARN, if require-osd-release is < mimic and OSD wants to join the cl...
When upgrading a cluster to octopus, users should get a warning, if require-osd-release is < mimic as this prevents o... Sebastian Wagner
08:54 AM Bug #44715: common/TrackedOp.cc: 163: FAILED ceph_assert((sharded_in_flight_list.back())->ops_in_...
http://pulpito.ceph.com/swagner-2020-04-09_21:46:02-rados-wip-swagner2-testing-2020-04-09-1541-distro-basic-smithi/ Sebastian Wagner

04/13/2020

10:39 PM Bug #45076 (Resolved): rados: Sharded OpWQ drops suicide_grace after waiting for work
The Sharded OpWQ will opportunistically wait for more work when processing an empty queue. While waiting, the default... Dan Hill
07:45 PM Bug #45075 (Resolved): scrub/osd-scrub-repair.sh: TEST_auto_repair_bluestore_failed failure
... Neha Ojha
06:08 PM Backport #44486 (In Progress): nautilus: Nautilus: Random mon crashes in failed assertion at ceph...
Nathan Cutler
02:44 PM Backport #43232: nautilus: pgs stuck in laggy state
@Neha - Can you make a decision whether to backport this to nautilus or not? Sage wrote:
"I'm not sure whether we ...
Nathan Cutler

04/12/2020

10:34 PM Bug #44883 (Resolved): upgrade to octopus can complain about orchestrator_cli
Sage Weil
11:25 AM Backport #45039 (In Progress): octopus: mon: reset min_size when changing pool size
Nathan Cutler
11:24 AM Feature #44025 (Pending Backport): Make it harder to set pool replica size to 1
Nathan Cutler

04/11/2020

11:43 AM Backport #45054 (In Progress): nautilus: nautilus upgrade should recommend ceph-osd restarts afte...
Nathan Cutler
09:40 AM Backport #45054 (Resolved): nautilus: nautilus upgrade should recommend ceph-osd restarts after e...
https://github.com/ceph/ceph/pull/34524 Nathan Cutler
11:39 AM Backport #45053 (In Progress): octopus: nautilus upgrade should recommend ceph-osd restarts after...
Nathan Cutler
09:39 AM Backport #45053 (Resolved): octopus: nautilus upgrade should recommend ceph-osd restarts after en...
https://github.com/ceph/ceph/pull/34523 Nathan Cutler
09:42 AM Feature #41666 (Resolved): Issue a HEALTH_WARN when a Pool is configured with [min_]size == 1
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
09:38 AM Bug #44684 (Resolved): pgs entering premerge state that still need backfill
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
09:37 AM Backport #45041 (Resolved): octopus: osd: incorrect read bytes stat in SPARSE_READ
https://github.com/ceph/ceph/pull/34809 Nathan Cutler
09:37 AM Backport #45040 (Resolved): nautilus: mon: reset min_size when changing pool size
https://github.com/ceph/ceph/pull/34585 Nathan Cutler
09:37 AM Backport #45039 (Resolved): octopus: mon: reset min_size when changing pool size
https://github.com/ceph/ceph/pull/34528 Nathan Cutler
09:36 AM Backport #45038 (Rejected): mimic: mon: reset min_size when changing pool size
https://github.com/ceph/ceph/pull/34586 Nathan Cutler

04/10/2020

04:27 PM Backport #44324 (In Progress): nautilus: Receiving RemoteBackfillReserved in WaitLocalBackfillRes...
Wei-Chung Cheng
06:44 AM Backport #45025 (Need More Info): mimic: hung osd_repop, bluestore committed but failed to trigge...
To backport https://github.com/ceph/ceph/pull/24761 to mimic we would first need to backport https://github.com/ceph/... Nathan Cutler
06:26 AM Backport #45025 (Rejected): mimic: hung osd_repop, bluestore committed but failed to trigger repo...
Nathan Cutler
05:31 AM Bug #36473: hung osd_repop, bluestore committed but failed to trigger repop_commit
24761 Nathan Cutler
03:29 AM Bug #25174: osd: assert failure with FAILED assert(repop_queue.front() == repop) In function 'vo...
this is likely duplicated with https://tracker.ceph.com/issues/22570
and resolved by https://github.com/ceph/ceph/pu...
Yan Jun

04/09/2020

03:51 PM Bug #44352: pool listings are slow after deleting objects
I think this is a known issue with slow [omap] listing caused by RocksDB fragmentation.
There was a bunch of improve...
Igor Fedotov
02:30 PM Bug #44352: pool listings are slow after deleting objects
radosgw-admin commands are just listing the pool, and the performance degradation happens with 'rados ls' too - movin... Casey Bodley
02:38 PM Backport #44289 (In Progress): nautilus: mon: update + monmap update triggers spawn loop
Wei-Chung Cheng
09:46 AM Backport #44711 (Resolved): nautilus: pgs entering premerge state that still need backfill
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34354
m...
Nathan Cutler
09:44 AM Backport #42662 (Resolved): nautilus:Issue a HEALTH_WARN when a Pool is configured with [min_]siz...
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/31842
m...
Nathan Cutler
09:43 AM Backport #44360 (Resolved): nautilus: Rados should use the '-o outfile' convention
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/33641
m...
Nathan Cutler
06:26 AM Bug #45008 (New): [osd crash]The ceph-osd assert with rbd bench io
ceph version: 14.2.5
OS:centos 7.6.1810
Procedure:
1, Create a rbd image.
2, Use rbd bench tool to write some dat...
haitao chen
02:47 AM Bug #44981: rados/test_envlibrados_for_rocksdb.sh build failure (seen in nautilus)
We should be testing the version the rocksdb submodule is pointing to. In nautilus that's...
$ git submodule statu...
Brad Hubbard
01:55 AM Bug #44981: rados/test_envlibrados_for_rocksdb.sh build failure (seen in nautilus)
We are trying to compile rocksdb master with gcc 4.8.5 but std::max_align_t only became available in 4.9. Brad Hubbard

04/08/2020

10:53 PM Backport #44711: nautilus: pgs entering premerge state that still need backfill
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/34354
merged
Yuri Weinstein
10:07 PM Bug #44939: The mon and/or osd pod memory consumption is not even. One of them consumes about 50%...
What is your mon_memory_target and osd_memory_target?
Uneven memory on the mons is likely due to the leader doing ...
Josh Durgin
09:50 PM Bug #42347: nautilus assert during osd shutdown: FAILED ceph_assert((sharded_in_flight_list.back(...
This is still an issue on 14.2.8 (at least the one shipped with proxmox):... Bastian Mäuser
03:59 PM Bug #45001 (Duplicate): mon+cephadm: ceph_assert((sharded_in_flight_list.back())->ops_in_flight_s...
Sebastian Wagner
03:29 PM Bug #45001 (Duplicate): mon+cephadm: ceph_assert((sharded_in_flight_list.back())->ops_in_flight_s...
http://pulpito.ceph.com/swagner-2020-04-08_10:27:55-rados-wip-swagner2-testing-2020-04-08-0014-distro-basic-smithi/49... Sebastian Wagner
03:31 PM Bug #44715: common/TrackedOp.cc: 163: FAILED ceph_assert((sharded_in_flight_list.back())->ops_in_...
http://pulpito.ceph.com/swagner-2020-04-08_10:27:55-rados-wip-swagner2-testing-2020-04-08-0014-distro-basic-smithi/49... Sebastian Wagner
07:43 AM Bug #44827 (Pending Backport): osd: incorrect read bytes stat in SPARSE_READ
Kefu Chai
07:40 AM Bug #44862 (Pending Backport): mon: reset min_size when changing pool size
Kefu Chai

04/07/2020

06:54 PM Backport #42662: nautilus:Issue a HEALTH_WARN when a Pool is configured with [min_]size == 1
Sridhar Seshasayee wrote:
> https://github.com/ceph/ceph/pull/31842
merged
Yuri Weinstein
05:51 PM Backport #44360: nautilus: Rados should use the '-o outfile' convention
Kefu Chai wrote:
> https://github.com/ceph/ceph/pull/33641
merged
Yuri Weinstein
04:35 PM Bug #44981 (Resolved): rados/test_envlibrados_for_rocksdb.sh build failure (seen in nautilus)
... Neha Ojha

04/06/2020

02:44 PM Bug #44959 (Closed): health warning: pgs not deep-scrubbed in time although it was in time
Hi!
Some of my PGs are listed as "not scrubbed in time" in my 14.2.8 cluster.
My scrub settings are:...
Jonas Jelten
11:00 AM Backport #44835 (Need More Info): nautilus: librados mon_command (mgr) command hang
non-trivial due to post-nautilus refactoring Nathan Cutler
10:44 AM Backport #44836 (In Progress): octopus: librados mon_command (mgr) command hang
Nathan Cutler
02:59 AM Bug #44945: Mon High CPU usage when another mon syncing from it
It is probably relate with huge removed_snap keys. Xiaoxi Chen
02:58 AM Bug #44945 (Need More Info): Mon High CPU usage when another mon syncing from it
Each sync request take very long time to come back, as show below. And from the TOP of source-mon, the CPU was on d... Xiaoxi Chen

04/04/2020

08:16 PM Bug #44939: The mon and/or osd pod memory consumption is not even. One of them consumes about 50%...
Here's overall memory consumption under traffic.The OSD consumes much more memory as well.
# knc top pods | egrep...
Yan Zhao
08:10 PM Bug #44939 (New): The mon and/or osd pod memory consumption is not even. One of them consumes abo...
This is a ceph deployment with rook release 1.2.7/ceph 14.2.8. After deployment, one of the mon pods and/or osd pods ... Yan Zhao

04/03/2020

09:32 PM Documentation #43896 (Pending Backport): nautilus upgrade should recommend ceph-osd restarts afte...
Josh Durgin
09:02 PM Bug #36473 (Pending Backport): hung osd_repop, bluestore committed but failed to trigger repop_co...
Tagging for Mimic backport consideration. Dan Hill
09:01 PM Bug #36473: hung osd_repop, bluestore committed but failed to trigger repop_commit
Mimic (pr#22739 + pr#24269) introduced this race condition, which was fixed in Nautilus (pr#24761).
Was this evalu...
Dan Hill

04/02/2020

10:54 PM Bug #44901 (Rejected): luminous: osd continue down because of the hearbeattimeout
There is clearly an issue with your network which is not a ceph issue. Brad Hubbard
02:22 AM Bug #44901 (Rejected): luminous: osd continue down because of the hearbeattimeout
HI! all! Thanks for reading this msg.
I hava one ceph cluster installed with ceph V12.2.12. It runs well for abo...
jack ma
10:25 PM Bug #44631: ceph pg dump error code 124
I think the pg dump command is timing out for some reason. The timestamps between the following log lines indicate th... Neha Ojha
10:57 AM Backport #44908 (In Progress): mimic: mon: rados/multimon tests fail with clock skew
Nathan Cutler
10:51 AM Backport #44908 (Resolved): mimic: mon: rados/multimon tests fail with clock skew
https://github.com/ceph/ceph/pull/34370 Nathan Cutler
10:53 AM Backport #44083 (In Progress): mimic: expected MON_CLOCK_SKEW but got none
Nathan Cutler
10:51 AM Bug #40112 (Pending Backport): mon: rados/multimon tests fail with clock skew
Nathan Cutler
01:28 AM Bug #44815 (Fix Under Review): Pool stats increase after PG merged (PGMap::apply_incremental does...
Greg Farnum
01:27 AM Bug #44797 (Closed): mon/cephx : trace of a deleted customer in the "auth" index
Greg Farnum

04/01/2020

09:12 PM Bug #44859 (Closed): add osd ceph cluster status slow requests are blocked > 32 sec. Implicated o...
It sounds like you may be running into the maximum pgs per osd limits. You can increase these to get around this. If ... Josh Durgin
08:13 PM Backport #44711 (In Progress): nautilus: pgs entering premerge state that still need backfill
Nathan Cutler
07:50 PM Backport #44847: octopus: osd-backfill-recovery-log.sh fails
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34313
m...
Nathan Cutler
06:55 PM Backport #44847 (Resolved): octopus: osd-backfill-recovery-log.sh fails
Sage Weil
07:34 PM Bug #43807 (Resolved): osd-backfill-recovery-log.sh fails
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
02:12 PM Bug #44755 (In Progress): Create stronger affinity between drivegroup specs and osd daemons
Joshua Schmid
01:35 PM Bug #44884: mon: weight-set create may return on uncomitted state
... Sage Weil
01:33 PM Bug #44884 (New): mon: weight-set create may return on uncomitted state
... Sage Weil
01:28 PM Bug #44883 (Resolved): upgrade to octopus can complain about orchestrator_cli
... Sage Weil
01:22 PM Bug #44882 (New): osd: leaked buffer (alloc via CephxAuthorizeHandler::verify_authorizer)
... Sage Weil
09:39 AM Bug #44862 (In Progress): mon: reset min_size when changing pool size
Deepika Upadhyay
02:57 AM Bug #39039: mon connection reset, command not resent
#44197 looks related? Brad Hubbard

03/31/2020

04:37 PM Bug #44827 (Fix Under Review): osd: incorrect read bytes stat in SPARSE_READ
Neha Ojha
04:38 AM Bug #44827 (Resolved): osd: incorrect read bytes stat in SPARSE_READ
the local vaiable 'total_read', which is always zero in code was used to accumulate total bytes it reads from
bluest...
Yan Jun
03:22 PM Bug #44862 (Resolved): mon: reset min_size when changing pool size
See https://github.com/rook/rook/issues/5127
Currently 'ceph osd pool set size x' only changes min_size if it's ab...
Josh Durgin
03:19 PM Bug #44859: add osd ceph cluster status slow requests are blocked > 32 sec. Implicated osds 10,15
日志里面出现如下信息:maybe_wait_for_max_pg withhold creation of pg 8.11: 600 >= 600 xiaoyi zhang
03:16 PM Bug #44859 (Closed): add osd ceph cluster status slow requests are blocked > 32 sec. Implicated o...
hello
I have a question.I add osd in a ceph cluster.ceph version is 12.2.8.but cluster status show slow requests ar...
xiaoyi zhang
10:10 AM Bug #43807: osd-backfill-recovery-log.sh fails
Neha Ojha wrote:
> Nathan, we need https://github.com/ceph/ceph/pull/34126 as well - See https://tracker.ceph.com/is...
Nathan Cutler
10:08 AM Backport #44847 (In Progress): octopus: osd-backfill-recovery-log.sh fails
Nathan Cutler
10:06 AM Backport #44847 (Resolved): octopus: osd-backfill-recovery-log.sh fails
https://github.com/ceph/ceph/pull/34313 Nathan Cutler
10:03 AM Bug #41424 (Resolved): readable.sh test fails
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
10:02 AM Documentation #42221 (Resolved): document new option mon_max_pg_per_osd
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
10:01 AM Bug #42810 (Resolved): ceph config rm does not revert debug_mon to default
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
10:01 AM Bug #42964 (Resolved): monitor config store: Deleting logging config settings does not decrease l...
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
10:00 AM Bug #43903 (Resolved): osd segv in ceph::buffer::v14_2_0::ptr::release (PGTempMap::decode)
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
10:00 AM Bug #44052 (Resolved): ceph -s does not show >32bit pg states
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
09:58 AM Bug #44507 (Resolved): osd/PeeringState.cc: 5582: FAILED ceph_assert(ps->is_acting(osd_with_shard...
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
09:58 AM Backport #44842 (Resolved): octopus: nautilus: FAILED ceph_assert(head.version == 0 || e.version....
https://github.com/ceph/ceph/pull/34807 Nathan Cutler
09:58 AM Backport #44841 (Resolved): nautilus: nautilus: FAILED ceph_assert(head.version == 0 || e.version...
https://github.com/ceph/ceph/pull/34957 Nathan Cutler
09:57 AM Bug #44759 (Resolved): fast luminous -> nautilus -> octopus upgrade asserts out
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are ... Nathan Cutler
09:57 AM Backport #44836 (Resolved): octopus: librados mon_command (mgr) command hang
https://github.com/ceph/ceph/pull/34416 Nathan Cutler
09:57 AM Backport #44835 (Rejected): nautilus: librados mon_command (mgr) command hang
Nathan Cutler
08:31 AM Backport #44770: octopus: fast luminous -> nautilus -> octopus upgrade asserts out
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34204
m...
Nathan Cutler
08:30 AM Backport #44717: octopus: osd/PeeringState.cc: 5582: FAILED ceph_assert(ps->is_acting(osd_with_sh...
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/34123
m...
Nathan Cutler
08:24 AM Backport #43257 (Resolved): mimic: monitor config store: Deleting logging config settings does no...
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/33327
m...
Nathan Cutler
08:23 AM Backport #42258 (Resolved): mimic: document new option mon_max_pg_per_osd
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/31875
m...
Nathan Cutler
08:22 AM Backport #43469: nautilus: asynchronous recovery + backfill might spin pg undersized for a long time
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/32849
m...
Nathan Cutler

03/30/2020

10:33 PM Backport #42168 (Resolved): nautilus: readable.sh test fails
This update was made using the script "backport-resolve-issue".
backport PR https://github.com/ceph/ceph/pull/30704
m...
Nathan Cutler
10:27 PM Bug #43807 (Pending Backport): osd-backfill-recovery-log.sh fails
Nathan, we need https://github.com/ceph/ceph/pull/34126 as well - See https://tracker.ceph.com/issues/43807#note-15 Neha Ojha
10:24 PM Bug #43807 (Resolved): osd-backfill-recovery-log.sh fails
... Nathan Cutler
02:42 PM Bug #44815: Pool stats increase after PG merged (PGMap::apply_incremental doesn't subtract stats ...
https://github.com/ceph/ceph/pull/34289 Aleksei Gutikov
02:30 PM Bug #44815 (Resolved): Pool stats increase after PG merged (PGMap::apply_incremental doesn't subt...
Pool stats like num_objects, num_bytes, increased after PGs were merged after manual set pg_num.
Steps to reproduc...
Aleksei Gutikov
01:33 PM Bug #43591: /sbin/fstrim can interfere with umount
/a/sage-2020-03-29_22:38:58-fs-wip-sage-testing-2020-03-29-0834-distro-basic-smithi/4902553 Sage Weil
01:24 PM Bug #44798 (Pending Backport): librados mon_command (mgr) command hang
Sage Weil
12:55 PM Bug #44184: Slow / Hanging Ops after pool creation
Fwiw on the cluster I'm seeing this on, I did set this flag after tidying the osd map (removed a couple of destroyed ... Jan Fajerski
11:20 AM Bug #44691 (New): mon/caps.sh fails with "Expected return 13, got 0"
Kefu Chai
 

Also available in: Atom