hbase 表数据迁移
2016-04-07 16:25
246 查看
http://blog.csdn.net/xiao_jun_0820/article/details/28615557
1 CopyTable 工具
用法:
CopyTable is a utility that can copy part or of all of a table, either to the same cluster or another cluster. The target table must first exist. The usage is as follows:
Options:
Args:
tablename Name of table to copy.
Example of copying 'TestTable' to a cluster that uses replication for a 1 hour window:
Caching for the input Scan is configured via
By default, CopyTable utility only copies the latest version of row cells unless
See Jonathan Hsieh's Online HBase Backups with CopyTable blog post for
more on CopyTable.
2 Export和Import工具
Export is a utility that will dump the contents of table to HDFS in a sequence file. Invoke via:
Note: caching for the input Scan is configured via
Import is a utility that will load data that has been exported back into HBase. Invoke via:
To import 0.94 exported files in a 0.96 cluster or onwards, you need to set system property "hbase.import.version" when running the import command as below:
export带时间范围的具体用法: hbase org.apache.hadoop.hbase.mapreduce.Export member5 hdfs://master24:9000/user/hadoop/dump2 1 1401938590466 1401938590467
导出路径为HDFS路径,写全路径。
导入的表必须存在预先定义好。
1 CopyTable 工具
用法:
CopyTable is a utility that can copy part or of all of a table, either to the same cluster or another cluster. The target table must first exist. The usage is as follows:
$ bin/hbase org.apache.hadoop.hbase.mapreduce.CopyTable [--starttime=X] [--endtime=Y] [--new.name=NEW] [--peer.adr=ADR] tablename
Options:
starttimeBeginning of the time range. Without endtime means starttime to forever.
endtimeEnd of the time range. Without endtime means starttime to forever.
versionsNumber of cell versions to copy.
new.nameNew table's name.
peer.adrAddress of the peer cluster given in the format hbase.zookeeper.quorum:hbase.zookeeper.client.port:zookeeper.znode.parent
familiesComma-separated list of ColumnFamilies to copy.
all.cellsAlso copy delete markers and uncollected deleted cells (advanced option).
Args:
tablename Name of table to copy.
Example of copying 'TestTable' to a cluster that uses replication for a 1 hour window:
$ bin/hbase org.apache.hadoop.hbase.mapreduce.CopyTable --starttime=1265875194289 --endtime=1265878794289 --peer.adr=server1,server2,server3:2181:/hbase TestTable
Scanner Caching
Caching for the input Scan is configured via hbase.client.scanner.cachingin the job configuration.
Versions
By default, CopyTable utility only copies the latest version of row cells unless --versions=nis explicitly specified in the command.
See Jonathan Hsieh's Online HBase Backups with CopyTable blog post for
more on CopyTable.
2 Export和Import工具
Export is a utility that will dump the contents of table to HDFS in a sequence file. Invoke via:
$ bin/hbase org.apache.hadoop.hbase.mapreduce.Export <tablename> <outputdir> [<versions> [<starttime> [<endtime>]]]
Note: caching for the input Scan is configured via
hbase.client.scanner.cachingin the job configuration.
$ bin/hbase org.apache.hadoop.hbase.mapreduce.Export <tablename> <outputdir> [<versions> [<starttime> [<endtime>]]]
Import is a utility that will load data that has been exported back into HBase. Invoke via:
$ bin/hbase org.apache.hadoop.hbase.mapreduce.Import <tablename> <inputdir>
To import 0.94 exported files in a 0.96 cluster or onwards, you need to set system property "hbase.import.version" when running the import command as below:
$ bin/hbase -Dhbase.import.version=0.94 org.apache.hadoop.hbase.mapreduce.Import <tablename> <inputdir>
export带时间范围的具体用法: hbase org.apache.hadoop.hbase.mapreduce.Export member5 hdfs://master24:9000/user/hadoop/dump2 1 1401938590466 1401938590467
导出路径为HDFS路径,写全路径。
导入的表必须存在预先定义好。
相关文章推荐
- 如何执行hbase 的mapreduce job
- PHP与Memcached服务器交互的分布式实现源码分析
- 关于kafka producer 分区策略的思考
- Android App开发中RecyclerView控件的基本使用教程
- 腾讯案例实战!聊聊设计中「需求」的正确打开方式
- spl 中转换时间NOW()
- 生成10个数并求它们的和(1)
- 看!oracle 的循环LOOP
- 【BZOJ3196】【Tyvj1730】二逼平衡树,第一次的树套树(线段树+splay)
- Java---JDBC学习
- 随机十个数求和
- Linux alias --设置命令的别名
- navicate Premium注册码
- 问卷调查
- 在Mac终端中使用vim编辑文件步骤
- php等守护进程监控脚本(转载 http://www.9958.pw/post/php_script_scan)
- DynamicJSONserializer
- LeakDiag的使用和形成的LOG文件的分析方法
- isKindOfClass|isMemberOfClass|conformsToProtocol|respondsToSelector|methodForSelector的详细介绍和区别
- Android 辅助服务资料收集