Feature #626
closedqa: add IOR, rompio, or other parallel workloads suite
Description
We've had reports that rompio is just terrifically unstable, and shows serious scaling issues.
IOR is a more common benchamrk in this area.
Updated by Sage Weil over 12 years ago
- Tracker changed from Tasks to Feature
- Subject changed from Test rompio on cfuse to qa: add IOR, rompio, or other parallel workloads suite
- Target version set to v0.37
Updated by Sage Weil over 12 years ago
- Target version deleted (
v0.37) - Translation missing: en.field_position set to 21
Updated by Sage Weil over 12 years ago
IOR depends on mpi. mpich2 is pretty easy to set up (there's a package).
I think an ior task would need to:
- take a list of clients
- push a machine list to a 'master' node that starts the job (probably not where teuth itself is running)
- set up a temp ssh key so they can connect to each other (or we could make this part of the teuth worker config?)
- set up a symlink on each client so that a single path (/tmp/cephtest/sharedmnt.0 or something) gets you into the mount point on all machines
- download the ior tarball, compile and build it (needs to build with mpicc, which requires mpich2 be installed)
- run ior with whatever parameters the task specifies.
Might make sense to make a generic 'mpi' task that sets up the mpi environment, and separate that from the ior bits.
Updated by Sage Weil over 12 years ago
- Target version set to v0.36
- Translation missing: en.field_position deleted (
25) - Translation missing: en.field_position set to 31
Updated by Sage Weil over 12 years ago
- Translation missing: en.field_position deleted (
53) - Translation missing: en.field_position set to 878
Updated by Sage Weil over 12 years ago
- Translation missing: en.field_position deleted (
886) - Translation missing: en.field_position set to 1
- Translation missing: en.field_position changed from 1 to 899
Updated by Sage Weil over 12 years ago
- Target version changed from v0.36 to v0.37
Updated by Sage Weil over 11 years ago
- Project changed from Ceph to CephFS
- Category deleted (
qa)
Updated by Sage Weil over 11 years ago
- Translation missing: en.field_position deleted (
1339) - Translation missing: en.field_position set to 2
Updated by Greg Farnum over 11 years ago
SamL has done some work on getting MPI going under teuthology, and on running some multi-client FS tests. I'm not sure what the status of that work is, but whoever does this bug will need to check into that.
Updated by Sage Weil over 11 years ago
- Status changed from New to In Progress
- Assignee set to Sam Lang
Yeah, that's what slang's working on to enable this. Assigning this to him.
Updated by Sage Weil over 11 years ago
- Translation missing: en.field_position deleted (
4) - Translation missing: en.field_position set to 1
Updated by Sage Weil over 11 years ago
- Target version set to v0.57b
- Translation missing: en.field_position deleted (
1) - Translation missing: en.field_position set to 5
Updated by Sam Lang over 11 years ago
- Status changed from In Progress to Closed
Added tests to the marginal qa suite that run IOR, mdtest, and fsx-mpi.