Project

General

Profile

Activity

From 02/22/2019 to 03/23/2019

03/23/2019

10:48 PM Backport #38901: mimic: Minor rados related documentation fixes
Remove "premerge" pg state which doesn't apply in mimic. David Zafman
09:13 PM Backport #38901 (Resolved): mimic: Minor rados related documentation fixes
https://github.com/ceph/ceph/pull/27188 Nathan Cutler
10:48 PM Backport #38902: luminous: Minor rados related documentation fixes
Remove "premerge" pg state which doesn't apply in luminous. David Zafman
09:13 PM Backport #38902 (Resolved): luminous: Minor rados related documentation fixes
https://github.com/ceph/ceph/pull/27185 Nathan Cutler
09:13 PM Backport #38906 (Resolved): nautilus: osd/PGLog.h: print olog_can_rollback_to before deciding to ...
https://github.com/ceph/ceph/pull/27302 Nathan Cutler
09:13 PM Backport #38905 (Resolved): luminous: osd/PGLog.h: print olog_can_rollback_to before deciding to ...
https://github.com/ceph/ceph/pull/27715 Nathan Cutler
09:13 PM Backport #38904 (Resolved): mimic: osd/PGLog.h: print olog_can_rollback_to before deciding to rol...
https://github.com/ceph/ceph/pull/27284 Nathan Cutler
09:13 PM Backport #38903 (Resolved): nautilus: Minor rados related documentation fixes
https://github.com/ceph/ceph/pull/27189 Nathan Cutler
09:13 PM Backport #38853 (In Progress): nautilus: .mgrstat failed to decode mgrstat state; luminous dev ve...
Nathan Cutler
05:41 PM Bug #38900 (New): EC pools don't self repair on client read error

When a replicated client read fails at the primary, it will pull the object from another OSD (see rep_repair_primar...
David Zafman
11:42 AM Documentation #38896 (Pending Backport): Minor rados related documentation fixes
Kefu Chai
12:22 AM Documentation #38896 (Resolved): Minor rados related documentation fixes

Document all pg states
Add auto repair items
"premerge" is not pg state in luminous nor mimic
David Zafman

03/22/2019

09:27 PM Bug #38845 (Resolved): mon.a@-1(probing) e1 current monmap has min_mon_release 15 (luminous) whic...
Sage Weil
03:57 PM Bug #38845: mon.a@-1(probing) e1 current monmap has min_mon_release 15 (luminous) which is >2 rel...
https://github.com/ceph/ceph/pull/27131 Yuri Weinstein
02:28 PM Bug #38845 (Fix Under Review): mon.a@-1(probing) e1 current monmap has min_mon_release 15 (lumino...
Neha Ojha
02:02 AM Bug #38845: mon.a@-1(probing) e1 current monmap has min_mon_release 15 (luminous) which is >2 rel...
Brad Hubbard wrote:
> 15 - 15 !> 2 ?
https://github.com/ceph/ceph/pull/27107 should fix this.
Neha Ojha
12:10 AM Bug #38845: mon.a@-1(probing) e1 current monmap has min_mon_release 15 (luminous) which is >2 rel...
15 - 15 !> 2 ? Brad Hubbard
09:05 PM Bug #38892: /ceph/src/tools/kvstore_tool.cc:266:1: internal compiler error: Segmentation fault
While I was looking into this I noticed this warning in the Jenkins output.... Brad Hubbard
04:46 PM Bug #38892 (Closed): /ceph/src/tools/kvstore_tool.cc:266:1: internal compiler error: Segmentation...
... Sebastian Wagner
07:12 PM Bug #38894 (Pending Backport): osd/PGLog.h: print olog_can_rollback_to before deciding to rollback
Neha Ojha
05:20 PM Bug #38894 (Resolved): osd/PGLog.h: print olog_can_rollback_to before deciding to rollback
This is important for debugging failures in merge_object_divergent_entries() before a decision to rollback is made. Neha Ojha
05:16 PM Bug #38893 (Resolved): RuntimeError: expected MON_CLOCK_SKEW but got none
... Neha Ojha
05:09 PM Cleanup #38635: bluestore: test osd_memory_target
https://github.com/ceph/ceph/pull/27083 - Merged
Will mark Pending Backport when Part-2 merges.
Neha Ojha
02:07 PM Bug #37766 (Resolved): rados_shutdown hang forever in ~objecter()
Nathan Cutler
02:06 PM Backport #38398 (Resolved): mimic: rados_shutdown hang forever in ~objecter()
Nathan Cutler
01:05 PM Backport #38881 (Resolved): nautilus: ENOENT in collection_move_rename on EC backfill target
https://github.com/ceph/ceph/pull/27654 Nathan Cutler
01:05 PM Backport #38880 (Resolved): luminous: ENOENT in collection_move_rename on EC backfill target
https://github.com/ceph/ceph/pull/28110 Nathan Cutler
01:04 PM Backport #38879 (Resolved): mimic: ENOENT in collection_move_rename on EC backfill target
https://github.com/ceph/ceph/pull/27943 Nathan Cutler
01:03 PM Backport #38873 (Resolved): luminous: Rados.get_fsid() returning bytes in python3
https://github.com/ceph/ceph/pull/27674 Nathan Cutler
01:03 PM Backport #38872 (Resolved): mimic: Rados.get_fsid() returning bytes in python3
https://github.com/ceph/ceph/pull/27259 Nathan Cutler
01:01 PM Backport #38860 (Resolved): nautilus: upmap broken the crush rule
https://github.com/ceph/ceph/pull/27225 Nathan Cutler
01:01 PM Backport #38859 (Resolved): luminous: upmap broken the crush rule
https://github.com/ceph/ceph/pull/27224 Nathan Cutler
01:01 PM Backport #38858 (Resolved): mimic: upmap broken the crush rule
https://github.com/ceph/ceph/pull/27257 Nathan Cutler
01:00 PM Backport #38857 (Resolved): luminous: should set EPOLLET flag on del_event()
https://github.com/ceph/ceph/pull/27226 Nathan Cutler
01:00 PM Backport #38856 (Resolved): mimic: should set EPOLLET flag on del_event()
https://github.com/ceph/ceph/pull/29250 Nathan Cutler
01:00 PM Backport #38854 (Resolved): luminous: .mgrstat failed to decode mgrstat state; luminous dev version?
https://github.com/ceph/ceph/pull/27207 Nathan Cutler
01:00 PM Backport #38853 (Resolved): nautilus: .mgrstat failed to decode mgrstat state; luminous dev version?
https://github.com/ceph/ceph/pull/27116 Nathan Cutler
01:00 PM Backport #38852 (Resolved): mimic: .mgrstat failed to decode mgrstat state; luminous dev version?
https://github.com/ceph/ceph/pull/29249 Nathan Cutler
11:05 AM Backport #38850: upgrade: 1 nautilus mon + 1 luminous mon can't automatically form quorum
Just to clarify slightly -- I know the upgrade instructions in the Nautilus release announcement say to "upgrade moni... Tim Serong
10:19 AM Backport #38850 (Resolved): upgrade: 1 nautilus mon + 1 luminous mon can't automatically form quorum
Seen while upgrading Luminous (12.2.10) to Nautilus (14.2.0). Three mon hosts, four osd hosts. The process was:
...
Tim Serong
09:30 AM Bug #38839: .mgrstat failed to decode mgrstat state; luminous dev version?
nautilus https://github.com/ceph/ceph/pull/27116 Sage Weil
09:30 AM Bug #38839 (Pending Backport): .mgrstat failed to decode mgrstat state; luminous dev version?
Sage Weil
07:37 AM Bug #38826 (Pending Backport): upmap broken the crush rule
Kefu Chai
01:33 AM Bug #21174: OSD crash: 903: FAILED assert(objiter->second->version > last_divergent_update)
It is possible that the crash we are seeing on osd.2 is due to 1:537949df:::20000a2c834.00000105:head incorrectly rol... Neha Ojha
01:05 AM Bug #38846: dump_pgstate_history doesn't really produce useful json output, needs an array around...
Probably be nice if it dumped the current state stack for each pg as well. Samuel Just

03/21/2019

11:06 PM Bug #38846 (Resolved): dump_pgstate_history doesn't really produce useful json output, needs an a...
... Samuel Just
08:42 PM Bug #38845 (Resolved): mon.a@-1(probing) e1 current monmap has min_mon_release 15 (luminous) whic...

dzafman-2019-03-20_19:53:02-rados-wip-zafman-testing-distro-basic-smithi/3754307
rados/upgrade/luminous-x-single...
David Zafman
06:05 PM Bug #38841 (New): Objects degraded higher than 100%
1. Working Mimic or Nautilus deployment with Bluestore (haven't tested with Filestore)
2. All OSDs up, all PGs activ...
Simon Ironside
05:29 PM Bug #38840 (Resolved): snaps missing in mapper, should be: ca was r -2...repaired

dzafman-2019-03-20_19:53:02-rados-wip-zafman-testing-distro-basic-smithi/3754443
This looks like a cache tier ev...
David Zafman
04:59 PM Bug #38839 (Fix Under Review): .mgrstat failed to decode mgrstat state; luminous dev version?
https://github.com/ceph/ceph/pull/27101 Sage Weil
04:57 PM Bug #38839 (Resolved): .mgrstat failed to decode mgrstat state; luminous dev version?
... Sage Weil
02:26 AM Backport #38719 (In Progress): luminous: crush: choose_args array size mis-sized when weight-sets...
https://github.com/ceph/ceph/pull/27085 Prashant D
01:38 AM Cleanup #38635 (In Progress): bluestore: test osd_memory_target
https://github.com/ceph/ceph/pull/27083 Neha Ojha
01:31 AM Backport #38720 (In Progress): mimic: crush: choose_args array size mis-sized when weight-sets ar...
https://github.com/ceph/ceph/pull/27082 Prashant D

03/20/2019

10:50 PM Bug #26971: failed to become clean before timeout expired

I'm not sure what this means, but pg 1.0 (size 3) needs to pick another one of the 2 remaining OSDs (4 OSDs in) to ...
David Zafman
12:05 PM Bug #38582: Pool storage MAX AVAIL reduction seems higher when single OSD reweight is done
Sorry for the delay. Attaching the required.
osd 155 is the OSD mentioned in description. The one which was manually...
Nokia ceph-users
11:51 AM Bug #38381 (Pending Backport): Rados.get_fsid() returning bytes in python3
Kefu Chai
11:40 AM Bug #38827 (In Progress): valgrind: UninitCondition in ceph::crypto::onwire::AES128GCM_OnWireRxHa...
Radoslaw Zarzynski
11:24 AM Bug #38827: valgrind: UninitCondition in ceph::crypto::onwire::AES128GCM_OnWireRxHandler::authent...
the test branch contains https://github.com/ceph/ceph/pull/27012 Kefu Chai
11:21 AM Bug #38827 (Resolved): valgrind: UninitCondition in ceph::crypto::onwire::AES128GCM_OnWireRxHandl...
... Kefu Chai
11:27 AM Bug #38828 (Resolved): should set EPOLLET flag on del_event()
Kefu Chai
10:41 AM Bug #21174: OSD crash: 903: FAILED assert(objiter->second->version > last_divergent_update)
As requested.
osd.0: ceph-post-file: 17efe900-501c-479f-ba56-dd29fef18c58
osd.4: ceph-post-file: ff22f830-e6bc-4f...
Grant Slater
12:36 AM Bug #21174: OSD crash: 903: FAILED assert(objiter->second->version > last_divergent_update)
Hi Grant,
Looking at the logs, it seems that the first crash was seen on osd.2 on pg id 1.cas2...
Neha Ojha
08:27 AM Bug #38826: upmap broken the crush rule
Here is the crush rule... huang jun
08:24 AM Bug #38826 (Resolved): upmap broken the crush rule
I setup a cluster and want to specify the primary osds through crush rule.
Here is the test script...
huang jun
03:14 AM Backport #38275 (In Progress): mimic: Fix recovery and backfill priority handling
David Zafman
12:43 AM Backport #38244 (Resolved): luminous: scrub warning check incorrectly uses mon scrub interval
David Zafman
12:43 AM Backport #38274 (Resolved): luminous: Fix recovery and backfill priority handling
David Zafman

03/19/2019

11:30 PM Bug #36739 (Pending Backport): ENOENT in collection_move_rename on EC backfill target
Neha Ojha
08:38 PM Backport #38398: mimic: rados_shutdown hang forever in ~objecter()
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/26583
merged
Yuri Weinstein

03/18/2019

06:55 PM Bug #21174: OSD crash: 903: FAILED assert(objiter->second->version > last_divergent_update)
Err. I believe I mixed up two different bugs, please disregard my previous comment. I don't currently recall what I ... Martin Millnert
06:52 PM Bug #21174: OSD crash: 903: FAILED assert(objiter->second->version > last_divergent_update)
For completeness: The root cause for the crashes I experienced were that I had oversized RADOS objects (2-10GB, max ... Martin Millnert
02:22 PM Bug #38124: OSD down on snaptrim.
Hello any updates about this? Erikas Kučinskis
06:35 AM Bug #38793 (New): data inconsistent
I did some test on rbd snap, and found data inconsistent.
cluster status:...
hongpeng lu

03/17/2019

10:21 PM Bug #38787 (Fix Under Review): osd: cache tiering flush clone wrongly
Patrick Donnelly
02:38 AM Bug #38787 (Fix Under Review): osd: cache tiering flush clone wrongly
because cephfs file snapcontext seq may start from 1, we find that in a never snaped fs,
the flush of file will dele...
Zengran Zhang
07:21 PM Bug #38294 (Resolved): osd/PG.cc: 6141: FAILED ceph_assert(info.history.same_interval_since != 0)...
Sage Weil
10:01 AM Bug #38294 (Fix Under Review): osd/PG.cc: 6141: FAILED ceph_assert(info.history.same_interval_sin...
https://github.com/ceph/ceph/pull/27018 Sage Weil
09:57 AM Bug #38294 (In Progress): osd/PG.cc: 6141: FAILED ceph_assert(info.history.same_interval_since !=...
/a/sage-2019-03-17_00:28:04-upgrade:luminous-x-wip-sage4-testing-2019-03-16-1713-distro-basic-smithi/3737326
pg 1....
Sage Weil
12:10 AM Bug #21174: OSD crash: 903: FAILED assert(objiter->second->version > last_divergent_update)
... Sage Weil

03/16/2019

11:20 PM Bug #21174: OSD crash: 903: FAILED assert(objiter->second->version > last_divergent_update)
I have a similar issue with OSDs dropping out:... Grant Slater
06:33 PM Bug #38786 (Resolved): autoscale down can lead to max_pg_per_osd limit
we adjust pgp_num all the way down to the target, which can make osds hit the max_pgs_per_osd if it's going too far.
...
Sage Weil

03/15/2019

09:45 PM Bug #38623 (Resolved): 2.8s2 past_intervals [6539,6541) start interval does not contain the requi...
Sage Weil
08:31 PM Bug #38655 (Resolved): osd: missing, size mismatch, snap mapper errors
Sage Weil
06:11 PM Bug #36739: ENOENT in collection_move_rename on EC backfill target
https://github.com/ceph/ceph/pull/26996 is a more complete fix for this issue. Neha Ojha
06:06 PM Bug #38784 (Resolved): osd: FAILED ceph_assert(attrs || !pg_log.get_missing().is_missing(soid) ||...
... Neha Ojha
05:08 PM Bug #38746 (Resolved): msgr2 leaking buffers
https://github.com/ceph/ceph/pull/26965 Sage Weil
03:20 AM Bug #38746: msgr2 leaking buffers
hmm it happens on some osds but not others.
i added to rxbuf and txbuf lengths to the dout prefix and got this
...
Sage Weil
03:01 AM Bug #38746 (Resolved): msgr2 leaking buffers
osds with bluestore consume too much ram (seeing 20GB on sepia)
to reproduce with vstart, watch bin/ceph daemon os...
Sage Weil
05:03 PM Bug #38783 (New): Changing mon_pg_warn_max_object_skew has no effect.
... Andrew Mitroshin
03:20 PM Documentation #38051 (Resolved): doc/rados/configuration: refresh osdmap section
Nathan Cutler
03:19 PM Backport #38095 (Resolved): luminous: doc/rados/configuration: refresh osdmap section
Nathan Cutler
12:13 PM Bug #38762 (New): Ubuntu/Debian repo has incorrect InRelease
On Ubuntu Bionic trying to update repo package I got error:
E: Failed to fetch https://download.ceph.com/debian-mi...
Alexander Sytar
08:59 AM Backport #38751 (Resolved): mimic: should report EINVAL in ErasureCode::parse() if m<=0
https://github.com/ceph/ceph/pull/28995 Nathan Cutler
08:58 AM Backport #38750 (Resolved): luminous: should report EINVAL in ErasureCode::parse() if m<=0
https://github.com/ceph/ceph/pull/28111 Nathan Cutler

03/14/2019

04:45 PM Feature #38616: Improvements to auto repair

I don't think we need to set "failed_repair" if primary can't recover itself on a read error. We are already setti...
David Zafman
02:46 PM Feature #38616 (In Progress): Improvements to auto repair
David Zafman
04:17 PM Bug #38345: mon: segv in MonOpRequest::~MonOpRequest OpHistory::cleanup
Forgot mention, the op appears to be an MForward. Sage Weil
11:58 AM Bug #38682 (Pending Backport): should report EINVAL in ErasureCode::parse() if m<=0
Sage Weil
12:53 AM Cleanup #38635: bluestore: test osd_memory_target
Part 1: Test with a value of osd_memory_target lesser than the default, maybe half or less than that. This can be don... Neha Ojha

03/13/2019

11:03 PM Bug #38345: mon: segv in MonOpRequest::~MonOpRequest OpHistory::cleanup
... Sage Weil
10:55 PM Bug #38345: mon: segv in MonOpRequest::~MonOpRequest OpHistory::cleanup
... Sage Weil
04:29 PM Bug #38724: _txc_add_transaction error (39) Directory not empty not handled on operation 21 (op 1...
ceph-post-file: 26dab2cb-36c9-40de-8455-1379406477e8
Sage Weil
04:29 PM Bug #38724 (Resolved): _txc_add_transaction error (39) Directory not empty not handled on operati...
... Sage Weil
03:08 PM Bug #38631 (Resolved): osd-scrub-repair.sh fails due to num_objects wrong
David Zafman
03:05 PM Bug #38678 (Resolved): Minor cleanups in tests and log output
David Zafman
01:16 PM Bug #38705 (Resolved): mgr: segv in module thread, PyArg_ParseTuple
Sage Weil
12:03 PM Backport #38720 (Resolved): mimic: crush: choose_args array size mis-sized when weight-sets are e...
https://github.com/ceph/ceph/pull/27082 Nathan Cutler
12:03 PM Backport #38719 (Resolved): luminous: crush: choose_args array size mis-sized when weight-sets ar...
https://github.com/ceph/ceph/pull/27085 Nathan Cutler
11:56 AM Bug #38664 (Pending Backport): crush: choose_args array size mis-sized when weight-sets are enabled
Sage Weil
11:56 AM Bug #38703 (Resolved): lazy omap stats aren't incorportaed into pg_autoscaler size value
Sage Weil
11:55 AM Bug #38238: rados/test.sh: api_aio_pp doesn't seem to start
/a/sage-2019-03-13_02:19:41-rados-wip-sage3-testing-2019-03-12-1657-distro-basic-smithi/3715202 Sage Weil
11:54 AM Bug #20086: LibRadosLockECPP.LockSharedDurPP gets EEXIST
... Sage Weil
11:52 AM Bug #38718 (New): 'osd crush weight-set create-compat' (and other OSDMonitor commands) can leak u...
... Sage Weil
10:55 AM Backport #38506 (Resolved): luminous: ENOENT on setattrs (obj was recently deleted)
Nathan Cutler
10:39 AM Bug #38258 (Resolved): filestore: fsync(2) return value not checked
Nathan Cutler
10:38 AM Backport #38316 (Resolved): luminous: filestore: fsync(2) return value not checked
Nathan Cutler
04:40 AM Backport #38423 (Resolved): luminous: osd/TestPGLog.cc: Verify that dup_index is being trimmed
Brad Hubbard

03/12/2019

09:46 PM Bug #38705 (Fix Under Review): mgr: segv in module thread, PyArg_ParseTuple
https://github.com/ceph/ceph/pull/26920 Sage Weil
07:30 PM Bug #38705: mgr: segv in module thread, PyArg_ParseTuple
appear to happen during standby. also, i see an ignored monmap message:... Sage Weil
07:28 PM Bug #38705: mgr: segv in module thread, PyArg_ParseTuple
lots of these failures. module varies (i've seen dashboard, prometheus so far) Sage Weil
07:11 PM Bug #38705 (Resolved): mgr: segv in module thread, PyArg_ParseTuple
... Sage Weil
07:52 PM Backport #38506: luminous: ENOENT on setattrs (obj was recently deleted)
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/26706
merged
Yuri Weinstein
07:51 PM Backport #38316: luminous: filestore: fsync(2) return value not checked
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/26871
merged
Yuri Weinstein
06:40 PM Bug #38159: ec does not recover below min_size
We coudl perhaps point the finger at the min_size choice:... Sage Weil
06:39 PM Bug #38159: ec does not recover below min_size
... Sage Weil
05:43 PM Bug #38703 (Fix Under Review): lazy omap stats aren't incorportaed into pg_autoscaler size value
https://github.com/ceph/ceph/pull/26917 Sage Weil
04:42 PM Bug #38703 (Resolved): lazy omap stats aren't incorportaed into pg_autoscaler size value
on lab cluster,... Sage Weil
03:49 PM Bug #38579: osd: should not mark cluster_messenger when commited new osdmap
Ok, I think I understand. This would noramlly trigger a RESETSESSION in the v1 protocol because the primary's connec... Sage Weil
02:21 PM Backport #38162 (Resolved): luminous: maybe_remove_pg_upmaps incorrectly cancels valid pending up...
Nathan Cutler
02:11 PM Bug #37797 (Resolved): radosbench tests hit ENOSPC
Nathan Cutler
02:11 PM Backport #38240 (Resolved): luminous: radosbench tests hit ENOSPC
Nathan Cutler
02:09 PM Backport #38400 (Resolved): luminous: rados_shutdown hang forever in ~objecter()
Nathan Cutler
02:01 PM Bug #38682 (Fix Under Review): should report EINVAL in ErasureCode::parse() if m<=0
https://github.com/ceph/ceph/pull/26894
Sage Weil
01:36 PM Bug #38682: should report EINVAL in ErasureCode::parse() if m<=0
I agree that m=0 is useless--it's no better than num_rep=1... just a (much) more complicated code path and more corne... Sage Weil
06:28 AM Bug #38682: should report EINVAL in ErasureCode::parse() if m<=0
i think we can even go further -- to prevent user from creating a profile with m=0. technically, it's correct. but pr... Kefu Chai
06:24 AM Bug #38682 (Resolved): should report EINVAL in ErasureCode::parse() if m<=0
... Kefu Chai
03:12 AM Backport #38566 (In Progress): mimic: osd_recovery_priority is not documented (but osd_recovery_o...
https://github.com/ceph/ceph/pull/26901 Prashant D

03/11/2019

11:57 PM Bug #38631 (Duplicate): osd-scrub-repair.sh fails due to num_objects wrong
David Zafman
09:39 PM Bug #38631: osd-scrub-repair.sh fails due to num_objects wrong
I'm going to remove create_rbd_pool because it isn't used anyway. David Zafman
06:55 PM Bug #38631: osd-scrub-repair.sh fails due to num_objects wrong

Reopening to use for root cause fix. This tracker should also revert 10b9626ea7b.
-The commit comment is wrong,...
David Zafman
11:53 PM Bug #38678 (Fix Under Review): Minor cleanups in tests and log output
David Zafman
04:57 PM Bug #38678 (Resolved): Minor cleanups in tests and log output
David Zafman
10:38 PM Bug #38655 (Fix Under Review): osd: missing, size mismatch, snap mapper errors
https://github.com/ceph/ceph/pull/26898 Sage Weil
05:24 PM Bug #38655: osd: missing, size mismatch, snap mapper errors
The problem originates with 2.23, a merge source. It is instantiated on osd.5 with info... Sage Weil
09:10 PM Bug #38582: Pool storage MAX AVAIL reduction seems higher when single OSD reweight is done
That does seem odd. Can you attach your crush map, "ceph osd tree", and "ceph osd dump" to this ticket? Greg Farnum
07:44 PM Backport #38162: luminous: maybe_remove_pg_upmaps incorrectly cancels valid pending upmaps
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/26127
merged
Yuri Weinstein
07:42 PM Backport #38240: luminous: radosbench tests hit ENOSPC
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/26355
merged
Yuri Weinstein
07:41 PM Backport #38400: luminous: rados_shutdown hang forever in ~objecter()
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/26579
merged
Yuri Weinstein
07:30 PM Bug #24613: luminous: rest/test.py fails with expected 200, got 400
/a/yuriw-2019-03-06_22:09:13-rados-wip-yuri4-testing-2019-03-04-2231-luminous-distro-basic-smithi/3675478/ Neha Ojha
05:21 PM Bug #38633: /ceph/src/tools/kvstore_tool.cc:39:1: internal compiler error: Segmentation fault on ...
https://github.com/ceph/ceph/pull/26828 Alfredo Deza
02:00 PM Bug #38664: crush: choose_args array size mis-sized when weight-sets are enabled
https://github.com/ceph/ceph/pull/26886 Sage Weil
01:59 PM Bug #38664 (Resolved): crush: choose_args array size mis-sized when weight-sets are enabled
simple reproducer on vstart:... Sage Weil

03/10/2019

08:58 PM Bug #38656 (New): scrub reservation leak?
/a/sage-2019-03-10_18:54:11-rados-wip-sage2-testing-2019-03-10-1053-distro-basic-smithi/3705804
pg 1.0 scrub does ...
Sage Weil
04:01 PM Bug #38655 (Resolved): osd: missing, size mismatch, snap mapper errors
... Sage Weil
03:57 PM Bug #38238: rados/test.sh: api_aio_pp doesn't seem to start
/a/sage-2019-03-10_01:08:05-rados-master-distro-basic-smithi/3703837
description: rados/thrash/{0-size-min-size-ov...
Sage Weil

03/09/2019

07:23 PM Bug #38631 (Resolved): osd-scrub-repair.sh fails due to num_objects wrong
Sage Weil
05:00 PM Bug #38633 (Resolved): /ceph/src/tools/kvstore_tool.cc:39:1: internal compiler error: Segmentatio...
Sage Weil
01:53 PM Bug #38579: osd: should not mark cluster_messenger when commited new osdmap
Sage Weil wrote:
> So in step 5, the primary hasn't seen osdmap 20, right? Only the replica has? The part I don't ...
bing lin
12:54 AM Feature #38653: Enhance health message when pool quota fills up
Greg Farnum
12:27 AM Backport #38316 (In Progress): luminous: filestore: fsync(2) return value not checked
https://github.com/ceph/ceph/pull/26871 Neha Ojha

03/08/2019

11:34 PM Feature #38653 (In Progress): Enhance health message when pool quota fills up
Greg Farnum
11:00 PM Feature #38653 (New): Enhance health message when pool quota fills up
https://bugzilla.redhat.com/show_bug.cgi?id=1481306... Greg Farnum
08:46 PM Feature #22147 (In Progress): Set multiple flags in a single command line
https://github.com/ceph/ceph/pull/26785 Neha Ojha
08:09 PM Backport #38646 (In Progress): mimic: OpTracker destruct assert when OSD destruct
Ashish Singh
02:46 PM Backport #38646 (Resolved): mimic: OpTracker destruct assert when OSD destruct
https://github.com/ceph/ceph/pull/26862 Nathan Cutler
03:00 PM Bug #38649 (Can't reproduce): [ERR] full status failsafe engaged, dropping updates, now -21474836...
/a/sage-2019-03-08_07:14:13-rados-wip-sage2-testing-2019-03-07-2213-distro-basic-smithi/3682171
Sage Weil
02:48 PM Bug #38377: OpTracker destruct assert when OSD destruct
master is still being merged into nautilus AFAICT Nathan Cutler
04:12 AM Bug #38377 (Pending Backport): OpTracker destruct assert when OSD destruct
Sage Weil
01:15 PM Bug #38579 (Need More Info): osd: should not mark cluster_messenger when commited new osdmap
So in step 5, the primary hasn't seen osdmap 20, right? Only the replica has? The part I don't understand is that i... Sage Weil
10:37 AM Backport #38610: luminous: mon: osdmap prune
https://github.com/ceph/ceph/pull/26834 Rafal Wadolowski
10:36 AM Backport #38561 (In Progress): mimic: mgr deadlock
https://github.com/ceph/ceph/pull/26833 Prashant D
08:19 AM Bug #38124: OSD down on snaptrim.
Hello,
any updates regarding this bug? I would love a patch to resolve this issue ASAP. One of my monitors just...
Darius Kasparavičius
08:12 AM Bug #38307 (Resolved): ceph-osd fails to bind to IPv6 interface for public_network
The PR https://github.com/ceph/ceph/pull/26692 enforces pick_addresses to fail when ms_bind_ipv4 and ms_bind_ipv6 opt... Ricardo Dias
04:56 AM Bug #38633 (Fix Under Review): /ceph/src/tools/kvstore_tool.cc:39:1: internal compiler error: Seg...
Brad Hubbard
01:18 AM Bug #38633 (Resolved): /ceph/src/tools/kvstore_tool.cc:39:1: internal compiler error: Segmentatio...
a1539b118ed6372c19f321c94e2246f4fd130a33... Brad Hubbard
04:26 AM Backport #38562 (In Progress): luminous: mgr deadlock
https://github.com/ceph/ceph/pull/26830 Prashant D
04:13 AM Bug #38598 (Resolved): osdmap may include only v1 address while osd binds to v2; mon drops messages
Sage Weil
04:03 AM Subtask #37732: qa/suites/rados/thrash-erasure-code*: coverage review tasks
https://github.com/ceph/ceph/pull/26417
Addresses
- Leveldb mons no longer relevant
- Fast-read could be added t...
Neha Ojha
03:57 AM Cleanup #38635: bluestore: test osd_memory_target
We want to test with different values of osd_memory_target.
Also, create tests that necessarily go beyond the osd_me...
Neha Ojha
03:50 AM Cleanup #38635 (In Progress): bluestore: test osd_memory_target
Neha Ojha

03/07/2019

10:26 PM Bug #38358: short pg log + cache tier ceph_test_rados out of order reply
/a/yuriw-2019-03-07_00:04:47-rados-wip_yuri_nautilus_3.6.19-distro-basic-smithi/3675857/ Neha Ojha
09:36 PM Bug #38631 (Resolved): osd-scrub-repair.sh fails due to num_objects wrong

A status check for "1/1 objects unfound" is coming back as "1/2 objects unfound"
Can be reproduced easily with:
...
David Zafman
03:03 PM Bug #36546 (Duplicate): common/TrackedOp.cc: 163: FAILED ceph_assert((sharded_in_flight_list.back...
Sage Weil
03:02 PM Bug #38592 (Duplicate): mon,osd: src/common/TrackedOp.cc: 163: FAILED ceph_assert((sharded_in_fli...
Sage Weil
02:56 PM Bug #38623 (Fix Under Review): 2.8s2 past_intervals [6539,6541) start interval does not contain t...
https://github.com/ceph/ceph/pull/26822 Sage Weil
12:25 PM Bug #38623 (Resolved): 2.8s2 past_intervals [6539,6541) start interval does not contain the requi...
... Sage Weil
01:52 PM Bug #38579: osd: should not mark cluster_messenger when commited new osdmap
Greg Farnum wrote:
> Do you have a reproducer for this?
>
> I get nervous when people want to remove mark_down ca...
bing lin
12:56 PM Bug #38604 (Resolved): mon logs not getting reopened after rotation
Sage Weil
12:47 PM Bug #38624 (New): crush: get_rule_weight_osd_map does not handle multi-take rules
CrushWrapper::get_rule_weight_osd_map() does not handle multi-take rules well. for example, a take 1 (primary) and t... Sage Weil
07:55 AM Backport #38565 (In Progress): mimic: Code to strip | from core pattern isn't right
Ashish Singh
06:43 AM Feature #38603: mon: osdmap prune
@Nathan, I developed and tested code, I will open PR in the next couple of days soon. Please assign this to me :) Rafal Wadolowski
02:12 AM Feature #38616: Improvements to auto repair

OSD stats might have to be in meta collection
David Zafman
01:29 AM Feature #38617 (Resolved): osd: Better error message when OSD count is less than osd_pool_default...
Clearly indicate when number of OSDs is less than osd_pool_default_size, to avoid users from setting up clusters inco... Neha Ojha

03/06/2019

10:38 PM Feature #38616 (Resolved): Improvements to auto repair

We should allow auto repair for bluestore pools since it has built in checksums. Currently, we are limited to er...
David Zafman
10:18 PM Feature #38458: Ceph does not have command to show current osd primary-affinity
So this is dumped as part of the osdmap output, but you want a way to see it for a particular OSD? Do we have any out... Greg Farnum
10:13 PM Bug #38579: osd: should not mark cluster_messenger when commited new osdmap
Do you have a reproducer for this?
I get nervous when people want to remove mark_down calls, as they are generally...
Greg Farnum
07:50 PM Bug #38604 (Fix Under Review): mon logs not getting reopened after rotation
aha, ceph-mgr and ceph-mds expliiclty set the thread name on startup.
https://github.com/ceph/ceph/pull/26797
Sage Weil
07:40 PM Bug #38604: mon logs not getting reopened after rotation
this appears to be because of /proc/$pid/stat. before,... Sage Weil
01:00 PM Bug #38604 (Resolved): mon logs not getting reopened after rotation
... Sage Weil
07:42 PM Bug #38219: rebuild-mondb hangs
rados:singleton/{all/rebuild-mondb.yaml msgr-failures/many.yaml msgr/async.yaml objectstore/bluestore-bitmap.yaml rad... Neha Ojha
06:40 PM Bug #38598 (Fix Under Review): osdmap may include only v1 address while osd binds to v2; mon drop...
Neha Ojha
03:10 AM Bug #38598: osdmap may include only v1 address while osd binds to v2; mon drops messages
Proposed OSD fix:
- if we get an osdmap with require_osd_release < nautilus, and are bound to v2+v1, we rebind to ...
Sage Weil
03:08 AM Bug #38598 (Resolved): osdmap may include only v1 address while osd binds to v2; mon drops messages
- osd binds to v2+v1
- osd sends osd_boot to mon
- mon adds v1 addr to osdmap only (due to require_osd_release < na...
Sage Weil
06:05 PM Bug #38555: scrub error on ec pg, got 6579891/0 or 7569408/6832128 bytes
"2019-03-06 15:21:41.756014 osd.5 (osd.5) 287 : cluster [ERR] 2.2s0 scrub : stat mismatch, got 2/2 objects, 1/1 clone... Sage Weil
05:59 PM Backport #38610 (Need More Info): luminous: mon: osdmap prune
Feature backport assumed to be non-trivial. Assigning to Joao, author of the feature, for now. Nathan Cutler
05:58 PM Backport #38610 (Rejected): luminous: mon: osdmap prune
https://github.com/ceph/ceph/pull/26834 Nathan Cutler
05:58 PM Feature #38603 (Pending Backport): mon: osdmap prune
Nathan Cutler
10:18 AM Feature #38603 (Resolved): mon: osdmap prune
Tracker to enable backport of this feature to luminous:
https://github.com/ceph/ceph/pull/19331
Rafal Wadolowski
04:54 PM Backport #38274 (In Progress): luminous: Fix recovery and backfill priority handling
David Zafman
02:30 AM Bug #38592: mon,osd: src/common/TrackedOp.cc: 163: FAILED ceph_assert((sharded_in_flight_list.bac...
Is this related to http://tracker.ceph.com/issues/36546? Neha Ojha

03/05/2019

11:17 PM Bug #26971: failed to become clean before timeout expired
Reproduced on master in 1 of 10 duplicate runs:
dzafman-2019-03-05_10:43:39-rados:thrash-master-distro-basic-smith...
David Zafman
07:01 PM Bug #26971: failed to become clean before timeout expired
Seen recently in luminous.
yuriw-2019-02-28_14:42:05-rados-wip-yuri4-testing-2019-02-27-2159-luminous-distro-basi...
David Zafman
10:37 PM Bug #38484 (Resolved): osd: InvalidRead, PG use-after-free putting ref
Neha Ojha
10:09 PM Bug #38525 (Resolved): qa/standalone/osd/pg-split-merge.sh fails
Neha Ojha
08:48 PM Bug #38594 (New): mimic: common/Mutex.cc: 110: FAILED assert(r == 0) in powercycle
... Neha Ojha
08:22 PM Bug #38592 (Duplicate): mon,osd: src/common/TrackedOp.cc: 163: FAILED ceph_assert((sharded_in_fli...
... Sage Weil
06:03 PM Bug #38499 (Need More Info): ceph-mon segfaults at startup
Nathan Cutler
03:12 PM Feature #21073 (Resolved): mgr: ceph/rgw: show hostnames and ports in ceph -s status output
Sage Weil
10:51 AM Bug #38582 (New): Pool storage MAX AVAIL reduction seems higher when single OSD reweight is done

i have a 5 node ceph 11.2.0 cluster with 335 osds. Each OSD is a 4TB HDD. It has one EC 4+1 pool.
Due to high st...
Nokia ceph-users
03:28 AM Backport #38511 (In Progress): mimic: ceph CLI ability to change file ownership
https://github.com/ceph/ceph/pull/26760 Prashant D
01:52 AM Backport #38510 (In Progress): luminous: ceph CLI ability to change file ownership
https://github.com/ceph/ceph/pull/26758 Prashant D
01:49 AM Backport #38507 (In Progress): mimic: ENOENT on setattrs (obj was recently deleted)
https://github.com/ceph/ceph/pull/26709 Prashant D
01:36 AM Bug #38579 (Need More Info): osd: should not mark cluster_messenger when commited new osdmap

when we run some fault test in Luminous 12.2.10, got coredump like ...
bing lin

03/04/2019

10:14 PM Support #38475: PG stuck in creating state
Support tickets will get a lot more eyes if you email the issue to ceph-users. :) Greg Farnum
10:07 PM Bug #38499: ceph-mon segfaults at startup
This must be running the labs or something, where's a log Abhi? Greg Farnum
09:23 PM Bug #37808: osd: osdmap cache weak_refs assert during shutdown
Argh, I can't find it now but I'm pretty sure I saw a PR go by that purported to fix this. The claimed issue is that ... Greg Farnum
05:36 PM Bug #37808: osd: osdmap cache weak_refs assert during shutdown
the rgw/multisite suite has been reproducing this reliably - probably because it runs with 'wait-for-scrub: false' Casey Bodley
03:14 PM Bug #38484 (Fix Under Review): osd: InvalidRead, PG use-after-free putting ref
https://github.com/ceph/ceph/pull/26742 Sage Weil
02:31 PM Bug #38219: rebuild-mondb hangs
/a/sage-2019-03-03_23:01:07-rados-wip-sage3-testing-2019-03-03-1043-distro-basic-smithi/3664297
Sage Weil
12:14 PM Backport #38567 (Resolved): luminous: osd_recovery_priority is not documented (but osd_recovery_o...
https://github.com/ceph/ceph/pull/27471 Nathan Cutler
12:14 PM Backport #38566 (Resolved): mimic: osd_recovery_priority is not documented (but osd_recovery_op_p...
https://github.com/ceph/ceph/pull/26901 Nathan Cutler
12:13 PM Backport #38565 (Resolved): mimic: Code to strip | from core pattern isn't right
https://github.com/ceph/ceph/pull/26811 Nathan Cutler
12:12 PM Backport #38562 (Resolved): luminous: mgr deadlock
https://github.com/ceph/ceph/pull/26830 Nathan Cutler
12:12 PM Backport #38561 (Resolved): mimic: mgr deadlock
https://github.com/ceph/ceph/pull/26833 Nathan Cutler
10:51 AM Bug #38322: luminous: mons do not trim maps until restarted
seen this issue with 10.2.4 Swami Reddy

03/03/2019

04:52 PM Documentation #38558 (New): doc: osd [test-]reweight-by-utilization is not properly documented in...
Looks like:... Марк Коренберг
02:11 AM Bug #38537 (Pending Backport): mgr deadlock
Kefu Chai

03/02/2019

02:31 PM Bug #38484: osd: InvalidRead, PG use-after-free putting ref
/a/sage-2019-03-02_01:13:07-rados-wip-sage2-testing-2019-03-01-1553-distro-basic-smithi/3656299 Sage Weil
02:29 PM Bug #38555 (Can't reproduce): scrub error on ec pg, got 6579891/0 or 7569408/6832128 bytes
... Sage Weil
01:53 AM Bug #38525 (Fix Under Review): qa/standalone/osd/pg-split-merge.sh fails
Neha Ojha
01:45 AM Documentation #23999 (Pending Backport): osd_recovery_priority is not documented (but osd_recover...
Neha Ojha
01:33 AM Backport #38552 (Resolved): mimic: core: lazy omap stat collection
https://github.com/ceph/ceph/pull/29189 Brad Hubbard
01:33 AM Backport #38551 (Resolved): luminous: core: lazy omap stat collection
https://github.com/ceph/ceph/pull/29190 Brad Hubbard

03/01/2019

11:06 PM Bug #23875 (Need More Info): Removal of snapshot with corrupt replica crashes osd
David Zafman
11:04 PM Bug #38325 (Pending Backport): Code to strip | from core pattern isn't right
David Zafman
09:46 PM Bug #38484 (Can't reproduce): osd: InvalidRead, PG use-after-free putting ref
i think i must have mixed up my test branches or something. i can't reproduce this. Sage Weil
09:45 PM Bug #38483 (In Progress): FAILED ceph_assert(p != pg_slots.end()) in OSDShard::register_and_wake_...
Sage Weil
06:22 PM Feature #38550: osd: Implement lazy omap usage statistics per osd
From - https://tracker.ceph.com/issues/38136 Vikhyat Umrao
06:21 PM Feature #38550 (Duplicate): osd: Implement lazy omap usage statistics per osd
This https://github.com/ceph/ceph/pull/26614 implements per pg and it would be good to summarize them per osd. Vikhyat Umrao
06:02 PM Feature #38136 (Pending Backport): core: lazy omap stat collection
Vikhyat Umrao
05:01 PM Bug #38537 (Fix Under Review): mgr deadlock
https://github.com/ceph/ceph/pull/26723 Sage Weil
02:29 PM Bug #38537 (Resolved): mgr deadlock
... Sage Weil
03:40 PM Backport #38507 (New): mimic: ENOENT on setattrs (obj was recently deleted)
Nathan Cutler
04:13 AM Backport #38507 (In Progress): mimic: ENOENT on setattrs (obj was recently deleted)
-https://github.com/ceph/ceph/pull/26708- Prashant D
03:28 PM Bug #36306 (Resolved): monstore tool rebuild does not generate creating_pgs
Nathan Cutler
03:27 PM Backport #36434 (Resolved): luminous: monstore tool rebuild does not generate creating_pgs
Nathan Cutler
03:26 PM Bug #36497 (Resolved): FAILED ceph_assert(can_write == WriteStatus::NOWRITE) in ProtocolV1::repla...
Nathan Cutler
03:26 PM Backport #37905 (Resolved): luminous: FAILED ceph_assert(can_write == WriteStatus::NOWRITE) in Pr...
Nathan Cutler
03:26 PM Backport #37904 (Resolved): mimic: FAILED ceph_assert(can_write == WriteStatus::NOWRITE) in Proto...
Nathan Cutler
03:25 PM Bug #24676 (Resolved): FreeBSD/Linux integration - monitor map with wrong sa_family
Nathan Cutler
03:24 PM Backport #37972 (Resolved): luminous: FreeBSD/Linux integration - monitor map with wrong sa_family
Nathan Cutler
02:40 PM Documentation #23999: osd_recovery_priority is not documented (but osd_recovery_op_priority is)
https://github.com/ceph/ceph/pull/26705/commits/9475acb9805abeb6ab631df912cdbce0a7f34d3d Neha Ojha
10:29 AM Bug #38053 (Resolved): Add hashinfo testing for dump command of ceph-objectstore-tool
Nathan Cutler
10:28 AM Backport #38140 (Resolved): luminous: Add hashinfo testing for dump command of ceph-objectstore-tool
Nathan Cutler
10:28 AM Backport #38141 (Resolved): mimic: Add hashinfo testing for dump command of ceph-objectstore-tool
Nathan Cutler
10:14 AM Bug #38295 (Resolved): luminous->(mimic,nautilus): PGMapDigest decode error on luminous end
Nathan Cutler
10:14 AM Backport #38342 (Resolved): mimic: luminous->(mimic,nautilus): PGMapDigest decode error on lumino...
Nathan Cutler
05:28 AM Bug #38525: qa/standalone/osd/pg-split-merge.sh fails
looks like the test is broken. we aren't reliably making a gap, so it would usually pass for the wrong reason. Sage Weil
05:27 AM Bug #38525 (Resolved): qa/standalone/osd/pg-split-merge.sh fails
... Sage Weil
03:10 AM Bug #38077: Marking all OSDs as "out" does not trigger a HEALTH_ERR state
Hi,I don't know whether my opinion is right or not, but I think the status should be HEALTH_WARN when OSDs being mark... richael zhuang
02:24 AM Backport #38506 (In Progress): luminous: ENOENT on setattrs (obj was recently deleted)
https://github.com/ceph/ceph/pull/26706 Prashant D
12:10 AM Bug #38238: rados/test.sh: api_aio_pp doesn't seem to start
/a/sage-2019-02-28_12:30:17-rados-wip-sage-testing-2019-02-27-1720-distro-basic-smithi/3649931
description: rados/...
Sage Weil

02/28/2019

07:35 PM Backport #36434: luminous: monstore tool rebuild does not generate creating_pgs
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/25825
merged
Yuri Weinstein
07:29 PM Backport #37905: luminous: FAILED ceph_assert(can_write == WriteStatus::NOWRITE) in ProtocolV1::r...
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/25956
merged
Yuri Weinstein
07:24 PM Backport #37972: luminous: FreeBSD/Linux integration - monitor map with wrong sa_family
Mykola Golub wrote:
> https://github.com/ceph/ceph/pull/26042
merged
Yuri Weinstein
07:24 PM Backport #38140: luminous: Add hashinfo testing for dump command of ceph-objectstore-tool
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/26284
merged
Yuri Weinstein
04:32 PM Bug #38358: short pg log + cache tier ceph_test_rados out of order reply
This is on luminous:
/a/teuthology-2019-02-23_01:30:03-rados-luminous-distro-basic-smithi/3627561/
We recently ...
Neha Ojha
03:25 PM Bug #38184: osd: recovery does not preserve copy-on-write allocations between object clones after...
This is indeed the current behavior. The OSD isn't clever enough to preserve the shared allocations across recovery.... Sage Weil
01:54 PM Bug #38513 (Rejected): luminous: "AsyncReserver.h: 190: FAILED assert(!queue_pointers.count(item)...
Run: http://pulpito.ceph.com/yuriw-2019-02-27_17:20:44-rados-wip-yuri3-testing-2019-02-25-2101-luminous-distro-basic-... Yuri Weinstein
12:36 PM Backport #38511 (Resolved): mimic: ceph CLI ability to change file ownership
https://github.com/ceph/ceph/pull/26760 Nathan Cutler
12:36 PM Backport #38510 (Resolved): luminous: ceph CLI ability to change file ownership
https://github.com/ceph/ceph/pull/26758 Nathan Cutler
12:36 PM Backport #38507 (Resolved): mimic: ENOENT on setattrs (obj was recently deleted)
https://github.com/ceph/ceph/pull/26709 Nathan Cutler
12:36 PM Backport #38506 (Resolved): luminous: ENOENT on setattrs (obj was recently deleted)
https://github.com/ceph/ceph/pull/26706 Nathan Cutler

02/27/2019

11:50 PM Feature #38136: core: lazy omap stat collection
Brad Hubbard wrote:
> Backporting https://github.com/ceph/ceph/pull/26614 may be easier Vikhyat if/when it merges?
...
Vikhyat Umrao
10:59 PM Bug #38431 (Resolved): osd: leaked pg refs on shutdown
Sage Weil
10:59 PM Bug #38477 (Resolved): upgrade to nautilus leaves v1: osd blacklist entries
Sage Weil
04:29 PM Backport #38342: mimic: luminous->(mimic,nautilus): PGMapDigest decode error on luminous end
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/26451
merged
Yuri Weinstein
04:24 PM Bug #38238: rados/test.sh: api_aio_pp doesn't seem to start
/a/rdias-2019-02-26_22:35:27-rados-wip-rdias2-testing-distro-basic-smithi/3642422
description: rados/thrash/{0-siz...
Sage Weil
03:35 PM Bug #38307: ceph-osd fails to bind to IPv6 interface for public_network
Jesse, what's the value of the ms_bind_ipv6 and ms_bind_ipv4 in your configuration when you hit this problem?
My t...
Ricardo Dias
01:55 PM Bug #38499 (Need More Info): ceph-mon segfaults at startup
... Abhishek Lekshmanan
08:54 AM Feature #38496 (New): ceph.in: use same units for displaying ceph osd df
... Марк Коренберг
12:03 AM Bug #38184: osd: recovery does not preserve copy-on-write allocations between object clones after...
Anyone? Vitaliy Filippov

02/26/2019

09:01 PM Bug #37808: osd: osdmap cache weak_refs assert during shutdown
/ceph/teuthology-archive/pdonnell-2019-02-26_07:49:50-multimds-wip-pdonnell-testing-20190226.051327-distro-basic-smit... Patrick Donnelly
02:35 PM Bug #38484 (Resolved): osd: InvalidRead, PG use-after-free putting ref
... Sage Weil
02:34 PM Bug #38403: osd: leaked from OSDMap::apply_incremental
/a/sage-2019-02-26_12:41:21-rados:verify-wip-sage-testing-2019-02-25-1642-distro-basic-smithi/3641678
Sage Weil
01:21 PM Bug #38483 (Fix Under Review): FAILED ceph_assert(p != pg_slots.end()) in OSDShard::register_and_...
https://github.com/ceph/ceph/pull/26651 Sage Weil
12:40 PM Bug #38483 (Resolved): FAILED ceph_assert(p != pg_slots.end()) in OSDShard::register_and_wake_spl...
... Sage Weil

02/25/2019

11:08 PM Bug #38433 (Duplicate): rados/test.sh timeout
#38238 Sage Weil
10:55 PM Bug #38477 (Fix Under Review): upgrade to nautilus leaves v1: osd blacklist entries
https://github.com/ceph/ceph/pull/26640 Sage Weil
07:13 PM Bug #38477 (Resolved): upgrade to nautilus leaves v1: osd blacklist entries
after a mimic -> nautilus upgrade,... Sage Weil
08:37 PM Backport #38141: mimic: Add hashinfo testing for dump command of ceph-objectstore-tool
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/26283
merged
Yuri Weinstein
06:53 PM Bug #38377 (Fix Under Review): OpTracker destruct assert when OSD destruct
Greg Farnum
06:41 PM Bug #38295 (Fix Under Review): luminous->(mimic,nautilus): PGMapDigest decode error on luminous end
Follow-up fix: https://github.com/ceph/ceph/pull/26636 Sage Weil
02:44 PM Feature #38370 (Pending Backport): ceph CLI ability to change file ownership
Sage Weil
02:43 PM Bug #38432 (Pending Backport): ENOENT on setattrs (obj was recently deleted)
Sage Weil
12:15 PM Support #38475 (New): PG stuck in creating state
Hi,
After one big fail of my ceph cluster I would like force create PGs (beacause old pgs are lost definitively).
...
Simon Falicon
11:21 AM Feature #38136: core: lazy omap stat collection
Backporting https://github.com/ceph/ceph/pull/26614 may be easier Vikhyat if/when it merges? Brad Hubbard
04:08 AM Backport #38443 (In Progress): mimic: osd-markdown.sh can fail with CLI_DUP_COMMAND=1
-https://github.com/ceph/ceph/pull/26618- Prashant D
01:56 AM Bug #37772: unittest_seastar_messenger fails with debug build
Hi,I tested both ceph master and ceph14.0.1 on X86 and the unittest_seastar_messenger passed with debug build. So I d... richael zhuang
01:03 AM Backport #38442 (In Progress): luminous: osd-markdown.sh can fail with CLI_DUP_COMMAND=1
https://github.com/ceph/ceph/pull/26616 Prashant D

02/24/2019

03:30 PM Bug #24320: out of order reply and/or osd assert with set-chunks-read.yaml
... Sage Weil
03:28 PM Bug #21592: LibRadosCWriteOps.CmpExt got 0 instead of -4095-1
/a/sage-2019-02-23_23:02:18-rados-wip-sage2-testing-2019-02-23-1354-distro-basic-smithi/3631993 Sage Weil
03:27 PM Bug #24990: api_watch_notify: LibRadosWatchNotify.Watch3Timeout failed
... Sage Weil
03:26 PM Bug #38358: short pg log + cache tier ceph_test_rados out of order reply
/a/sage-2019-02-23_23:02:18-rados-wip-sage2-testing-2019-02-23-1354-distro-basic-smithi/3631889 Sage Weil
07:52 AM Feature #38462 (New): Store comments to config options stored in monitors (i.e. ceph config dump)
It will be nice to have ability to add arbitrary comments to any option stored. In Ceph.conf it is possible. I see it... Марк Коренберг
06:47 AM Bug #38461 (New): Ceph osd out is the same as ceph osd reweight 0 (result in same bucket weights)
http://docs.ceph.com/docs/mimic/rados/operations/add-or-rm-osds says:... Марк Коренберг

02/23/2019

04:43 PM Feature #38458 (New): Ceph does not have command to show current osd primary-affinity
It will be nice to have ability to show current primary-affinity value for an osd. Марк Коренберг

02/22/2019

09:19 PM Backport #38423: luminous: osd/TestPGLog.cc: Verify that dup_index is being trimmed
Thanks for the tidy up Nathan. Brad Hubbard
05:45 PM Bug #22525 (Resolved): auth: ceph auth add does not sanity-check caps
Nathan Cutler
05:45 PM Backport #23670 (Resolved): luminous: auth: ceph auth add does not sanity-check caps
Nathan Cutler
05:27 PM Feature #37597 (Resolved): ceph-objectstore-tool: Add HashInfo to object dump output
Nathan Cutler
05:27 PM Backport #37690 (Resolved): luminous: ceph-objectstore-tool: Add HashInfo to object dump output
Nathan Cutler
05:26 PM Bug #37776 (Resolved): workunits/rados/test_health_warnings.sh fails with <9 osds down
Nathan Cutler
05:26 PM Backport #37815 (Resolved): luminous: workunits/rados/test_health_warnings.sh fails with <9 osds ...
Nathan Cutler
05:26 PM Bug #24601 (Resolved): FAILED assert(is_up(osd)) in OSDMap::get_inst(int)
Nathan Cutler
05:26 PM Backport #37833 (Resolved): luminous: FAILED assert(is_up(osd)) in OSDMap::get_inst(int)
Nathan Cutler
05:17 PM Bug #38431 (Fix Under Review): osd: leaked pg refs on shutdown
https://github.com/ceph/ceph/pull/26595 Sage Weil
05:06 PM Bug #38431: osd: leaked pg refs on shutdown
This appears to be as simple as a queued write in progress when shutdown happens:... Sage Weil
12:55 PM Bug #38431 (Resolved): osd: leaked pg refs on shutdown
/a/sage-2019-02-21_21:52:17-rados-wip-sage3-testing-2019-02-21-1359-distro-basic-smithi/3622562
w/ pg ref logs
Sage Weil
04:47 PM Bug #37808: osd: osdmap cache weak_refs assert during shutdown
/a/sage-2019-02-22_15:54:54-rados-wip-sage2-testing-2019-02-22-0711-distro-basic-smithi/3626248 Sage Weil
04:17 PM Bug #38070 (Resolved): A PG repairing doesn't mean PG is damaged
Nathan Cutler
04:17 PM Backport #38207 (Resolved): luminous: A PG repairing doesn't mean PG is damaged
Nathan Cutler
04:07 PM Backport #38317 (Resolved): mimic: filestore: fsync(2) return value not checked
Nathan Cutler
04:07 PM Bug #37593 (Resolved): ec pool lost data due to snap clone
Nathan Cutler
04:06 PM Backport #37993 (Resolved): luminous: ec pool lost data due to snap clone
Nathan Cutler
04:06 PM Cleanup #38025 (Resolved): qa/overrides/short_pg_log.yaml: reduce osd_{min,max}_pg_log_entries
Nathan Cutler
04:06 PM Backport #38046 (Resolved): luminous: qa/overrides/short_pg_log.yaml: reduce osd_{min,max}_pg_log...
Nathan Cutler
04:06 PM Bug #37919 (Resolved): osd/ECBackend.cc: 1547: FAILED ceph_assert(!(*m).is_missing(hoid))
Nathan Cutler
04:06 PM Backport #38105 (Resolved): luminous: osd/ECBackend.cc: 1547: FAILED ceph_assert(!(*m).is_missing...
Nathan Cutler
03:41 PM Backport #38450 (In Progress): mimic: src/osd/OSDMap.h: 1065: FAILED assert(__null != pool)
Nathan Cutler
02:36 PM Backport #38450 (Resolved): mimic: src/osd/OSDMap.h: 1065: FAILED assert(__null != pool)
https://github.com/ceph/ceph/pull/29976 Nathan Cutler
03:39 PM Backport #38243 (Resolved): mimic: scrub warning check incorrectly uses mon scrub interval
Nathan Cutler
03:36 PM Bug #38238: rados/test.sh: api_aio_pp doesn't seem to start
ceph-client.admin.19974.log.gz is aio_pp
it starts, gets a few tests in, then the log stops unexpectedly......
Sage Weil
03:03 PM Bug #38238: rados/test.sh: api_aio_pp doesn't seem to start
another instance:
/a/sage-2019-02-21_21:52:17-rados-wip-sage3-testing-2019-02-21-1359-distro-basic-smithi/3622638
...
Sage Weil
03:09 PM Backport #38162 (In Progress): luminous: maybe_remove_pg_upmaps incorrectly cancels valid pending...
Nathan Cutler
02:57 PM Bug #38432 (Fix Under Review): ENOENT on setattrs (obj was recently deleted)
https://github.com/ceph/ceph/pull/26591 Sage Weil
12:59 PM Bug #38432 (Resolved): ENOENT on setattrs (obj was recently deleted)
... Sage Weil
02:34 PM Backport #38443 (Resolved): mimic: osd-markdown.sh can fail with CLI_DUP_COMMAND=1
https://github.com/ceph/ceph/pull/27907 Nathan Cutler
02:34 PM Backport #38442 (Resolved): luminous: osd-markdown.sh can fail with CLI_DUP_COMMAND=1
https://github.com/ceph/ceph/pull/26616 (merged for v12.2.12)
backport of follow-up fix: https://github.com/ceph/cep...
Nathan Cutler
02:33 PM Backport #38437 (Resolved): mimic: crc cache should be invalidated when posting preallocated rx b...
https://github.com/ceph/ceph/pull/29247 Nathan Cutler
02:33 PM Backport #38436 (Resolved): luminous: crc cache should be invalidated when posting preallocated r...
https://github.com/ceph/ceph/pull/29248 Nathan Cutler
01:18 PM Bug #38425 (Duplicate): mon: segmentation fault in AuthMonitor::create_pending
just fixed this, #38372 Sage Weil
04:45 AM Bug #38425: mon: segmentation fault in AuthMonitor::create_pending
Here's a different stack trace that's probably related:
/ceph/teuthology-archive/pdonnell-2019-02-19_07:16:18-fs-w...
Patrick Donnelly
04:42 AM Bug #38425 (Duplicate): mon: segmentation fault in AuthMonitor::create_pending
... Patrick Donnelly
01:14 PM Bug #38372 (Resolved): segfault in "AuthMonitor::increase_max_global_id()"
Sage Weil
01:11 PM Bug #38416 (Pending Backport): crc cache should be invalidated when posting preallocated rx buffers
Sage Weil
01:10 PM Bug #36337: OSDs crash with failed assertion in PGLog::merge_log as logs do not overlap
... Sage Weil
01:06 PM Bug #38433 (Duplicate): rados/test.sh timeout
... Sage Weil
06:03 AM Backport #38398 (In Progress): mimic: rados_shutdown hang forever in ~objecter()
https://github.com/ceph/ceph/pull/26583 Prashant D
03:35 AM Backport #38400 (In Progress): luminous: rados_shutdown hang forever in ~objecter()
https://github.com/ceph/ceph/pull/26579 Prashant D
 

Also available in: Atom