Project

General

Profile

Actions

Feature #3730

closed

Support replication factor in Hadoop

Added by Noah Watkins over 11 years ago. Updated about 5 years ago.

Status:
Closed
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Reviewed:
Affected Versions:
Component(FS):
Hadoop/Java
Labels (FS):
Java/Hadoop
Pull request ID:

Description

In order to support per-file replication values in Hadoop we need to specify that a new file should be generated in a data pool configured with the desired replication factor.

We'll expand the Hadoop configuration to support a mapping of [int] -> [string] that provides a replication to pool_name mapping for the Ceph installation.

There are 3 cases:

  1. No configuration is given. In this case we use always use the default pool.
  2. Configuration is given and exact replication factor match is found. Use the pool.
  3. Configuration is given and in-exact match is found: use closest match, round up.
Actions

Also available in: Atom PDF