Project

General

Profile

Hadoop over Ceph RGW status update » History » Version 2

Yuan Zhou, 06/10/2015 01:31 PM

1 2 Yuan Zhou
h3. Hadoop over Ceph RGW status update
2 1 Yuan Zhou
3 2 Yuan Zhou
* Summary *
4 1 Yuan Zhou
The goal is to write a Hadoop Compatible Filesystem(RGWFS) to allow Hadoop run over RGW. We also want to add load balancing feature to allow this scale to some rack rachitecture
5
6 2 Yuan Zhou
* Owners *
7 1 Yuan Zhou
Yuan Zhou(Intel)
8
Jian Zhang(Intel)
9
10
11 2 Yuan Zhou
* Interested Parties *
12 1 Yuan Zhou
If you are interested in contributing to this blueprint, or want to be a "speaker" during the Summit session, list your name here.
13
Name (Affiliation)
14
Name (Affiliation)
15
Name
16
17 2 Yuan Zhou
* Current Status *
18 1 Yuan Zhou
In Infernalis we proposed this BP(Hadoop over Ceph Radosgw with SSD cache). During the last several months, we've got some progress.
19 2 Yuan Zhou
RGWFS
20 1 Yuan Zhou
Thanks to SwiftFS, RGWFS is able to reuse lots of code. Currently the general code path is done. We're able to read/write with Hadoop command line tool through RGWFS, which talks to the backend Rados cluster.
21
22 2 Yuan Zhou
RGW-Proxy
23 1 Yuan Zhou
We have implented a simple WSGI server that can give out the nearest RGW instance by looking through the internal data mapping in the Rados cluster. By giving the object name, RGW-Proxy would query in the cluster to check the mapping of data(ceph osd map obj_name), and then give out corresponding RGW instance
24
25
26 2 Yuan Zhou
*Detailed Description*
27 1 Yuan Zhou
There're a few things we're working on. 
28 2 Yuan Zhou
* Make RGWFS work with multiple RGW instance
29
* Performance testing
30 1 Yuan Zhou
31 2 Yuan Zhou
*Work items*
32 1 Yuan Zhou
This section should contain a list of work tasks created by this blueprint.  Please include engineering tasks as well as related build/release and documentation work.  If this blueprint requires cleanup of deprecated features, please list those tasks as well.
33
34 2 Yuan Zhou
*Coding tasks*
35 1 Yuan Zhou
Task 1
36
Task 2
37
Task 3
38
39 2 Yuan Zhou
*Build / release tasks*
40 1 Yuan Zhou
Task 1
41
Task 2
42
Task 3
43
44 2 Yuan Zhou
*Documentation tasks*
45 1 Yuan Zhou
Task 1
46
Task 2
47
Task 3
48
49 2 Yuan Zhou
*Deprecation tasks*
50 1 Yuan Zhou
Task 1
51
Task 2
52
Task 3