2.3. Hadoop
还得自己build Hadoop,不然会因为缺少a durable sync而丢失数据,真贱……
暂时没管
2.3.1. Hadoop Security
2.3.2. dfs.datanode.max.xcievers
暂时没管
http://hbase.apache.org/book/standalone_dist.html
2.4. HBase run modes: Standalone and Distributed
2.4.1. Standalone HBase
没用到
2.4.2. Distributed
2.4.2.1. Pseudo-distributed
没用到
2.4.2.2. Fully-distributed
需要指定NameNode和hbase.cluster.distributed = true
<configuration> ... <property> <name>hbase.rootdir</name> <value>hdfs://namenode.example.org:54310/hbase</value> <description>The directory shared by RegionServers. </description> </property> <property> <name>hbase.cluster.distributed</name> <value>true</value> <description>The mode the cluster will be in. Possible values are false: standalone and pseudo-distributed setups with managed Zookeeper true: fully-distributed with unmanaged Zookeeper Quorum (see hbase-env.sh) </description> </property> ... </configuration>
2.4.2.2.1. regionservers
设置conf/regionservers添加region servers
2.4.2.2.2. ZooKeeper and HBase
设置hbase-site.xml添加
<property> <name>hbase.zookeeper.quorum</name> <value>rs1.example.com,rs2.example.com,rs3.example.com,rs4.example.com,rs5.example.com</value> <description>Comma separated list of servers in the ZooKeeper Quorum. For example, "host1.mydomain.com,host2.mydomain.com,host3.mydomain.com". By default this is set to localhost for local and pseudo-distributed modes of operation. For a fully-distributed setup, this should be set to a full list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set in hbase-env.sh this is the list of servers which we will start/stop ZooKeeper on. </description>
2.4.2.2.3. HDFS Client Configuration
编辑hbase-env.sh添加
export HBASE_CLASSPATH = /home/michael/lib/hadoop-0.20.203.0/conf export JAVA_HOME=/home/michael/lib/jdk1.6.0_27
2.4.3. Running and Confirming Your Installation
运行bin/start-hbase.sh启动hbase