Friday, January 10, 2014

Apache Tajo CDH - Cluster Setup (Configuration)

1. version

- CDH: CDH-4.4.0-1.cdh4.4.0.p0.39
- Hadoop: Hadoop 2.0.0-cdh4.4.0
- Tajo: https://github.com/gruter/tajo-cdh (2014.01.10)

&

- tajo-0.2.0-incubating

2. Configuration

- cd $TAJO_HOME/conf
- cp tajo-site.xml.template tajo-site.xml
- vi tajo-site.xml
- add

<property>
<name>tajo.rootdir</name>
<value>hdfs://hostname:port/tajo</value>
</property>

<property>
<name>tajo.master.umbilical-rpc.address</name>
<value>hostname:26001</value>
</property>

<property>
<name>tajo.catalog.client-rpc.address</name>
<value>hostname:26005</value>
</property>

<property>
<name>tajo.cluster.distributed</name>
<value>true</value>
</property>

- (there was 'tajo.cluster.distributed' property before, but not for current documentation)
- $HADOOP_HOME/bin/hadoop fs -mkdir /tajo
- $HADOOP_HOME/bin/hadoop fs -chmod g+w /tajo
- vi workers
- add worker information
- example

worker1
worker2
worker3

3. Reference

http://tajo.incubator.apache.org/configuration.html

<property>
<name>org.apache.tajo.rootdir</name>
<value>hdfs://hostname:port/tajo</value>
</property>

<property>
<name>org.apache.tajo.cluster.distributed</name>
<value>true</value>
</property>

http://tajo.incubator.apache.org/tajo-0.2.0-doc.html

<property>
<name>tajo.rootdir</name>
<value>hdfs://hostname:port/tajo</value>
</property>

<property>
<name>tajo.master.umbilical-rpc.address</name>
<value>hostname:26001</value>
</property>

<property>
<name>tajo.catalog.client-rpc.address</name>
<value>hostname:26005</value>
</property>

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.