Project

General

Profile

Hadoop over Ceph RGW status update » History » Version 4

Yuan Zhou, 06/10/2015 01:34 PM

1 2 Yuan Zhou
h3. Hadoop over Ceph RGW status update
2 1 Yuan Zhou
3 3 Yuan Zhou
*Summary*
4 1 Yuan Zhou
The goal is to write a Hadoop Compatible Filesystem(RGWFS) to allow Hadoop run over RGW. We also want to add load balancing feature to allow this scale to some rack rachitecture
5
6 3 Yuan Zhou
*Owners*
7 1 Yuan Zhou
Yuan Zhou(Intel)
8
Jian Zhang(Intel)
9
10
11 4 Yuan Zhou
*Interested Parties*
12 1 Yuan Zhou
If you are interested in contributing to this blueprint, or want to be a "speaker" during the Summit session, list your name here.
13
Name (Affiliation)
14
Name (Affiliation)
15
Name
16
17 4 Yuan Zhou
*Current Status*
18 2 Yuan Zhou
In Infernalis we proposed this BP(Hadoop over Ceph Radosgw with SSD cache). During the last several months, we've got some progress.
19 4 Yuan Zhou
* RGWFS
20 1 Yuan Zhou
Thanks to SwiftFS, RGWFS is able to reuse lots of code. Currently the general code path is done. We're able to read/write with Hadoop command line tool through RGWFS, which talks to the backend Rados cluster.
21 4 Yuan Zhou
* RGW-Proxy
22 1 Yuan Zhou
We have implented a simple WSGI server that can give out the nearest RGW instance by looking through the internal data mapping in the Rados cluster. By giving the object name, RGW-Proxy would query in the cluster to check the mapping of data(ceph osd map obj_name), and then give out corresponding RGW instance
23
24
25 2 Yuan Zhou
*Detailed Description*
26 1 Yuan Zhou
There're a few things we're working on. 
27 2 Yuan Zhou
* Make RGWFS work with multiple RGW instance
28
* Performance testing
29 1 Yuan Zhou
30 2 Yuan Zhou
*Work items*
31 1 Yuan Zhou
This section should contain a list of work tasks created by this blueprint.  Please include engineering tasks as well as related build/release and documentation work.  If this blueprint requires cleanup of deprecated features, please list those tasks as well.
32
33 2 Yuan Zhou
*Coding tasks*
34 1 Yuan Zhou
Task 1
35
Task 2
36
Task 3
37
38 2 Yuan Zhou
*Build / release tasks*
39 1 Yuan Zhou
Task 1
40
Task 2
41
Task 3
42
43 2 Yuan Zhou
*Documentation tasks*
44 1 Yuan Zhou
Task 1
45
Task 2
46
Task 3
47
48 2 Yuan Zhou
*Deprecation tasks*
49 1 Yuan Zhou
Task 1
50
Task 2
51
Task 3