大数据之pig安装
2015-09-01 09:55
155 查看
大数据之pig安装
1.下载
2. 解压安装
mapreduce模式安装:
最后显示统计结果。success
1.下载
2. 解压安装
mapreduce模式安装:
1:设置HADOOP_HOME,如果pig所在节点不是集群中的节点,那就需要把集群中使用的hadoop的安装包拷贝过来一份。 export HADOOP_HOME=/usr/local/hadoop-2.6.0 2:创建一个文件夹,cluster-conf,里面保存的是hadoop的配置文件,core-site.xml、hdfs-site.xml、mapred-site.xml、yarn-site.xml,具体配置属性参考提供的配置文件 export PIG_CLASSPATH=../cluster-conf export HADOOP_CONF_DIR=../cluster-conf download 4 files about hadoop for pig 注意:如果在执行的过程中报错(右图),则需要在主节点启动job history server(因为pig在hadoop集群上执行任务之后需要解析执行日志才能知道任务是否执行成功。) sbin/mr-jobhistory-daemon.sh start historyserver (一定要在resourceManager主节点上运行)
3.检测: pig
进入:
然后在指定hadoop 上的文件
hello.text
.csharpcode, .csharpcode pre
{
font-size: small;
color: black;
font-family: consolas, "Courier New", courier, monospace;
background-color: #ffffff;
/*white-space: pre;*/
}
.csharpcode pre { margin: 0em; }
.csharpcode .rem { color: #008000; }
.csharpcode .kwrd { color: #0000ff; }
.csharpcode .str { color: #006080; }
.csharpcode .op { color: #0000c0; }
.csharpcode .preproc { color: #cc6633; }
.csharpcode .asp { background-color: #ffff00; }
.csharpcode .html { color: #800000; }
.csharpcode .attr { color: #ff0000; }
.csharpcode .alt
{
background-color: #f4f4f4;
width: 100%;
margin: 0em;
}
.csharpcode .lnum { color: #606060; }
运行:
A= load ‘hdfs://hadoop11:9000/hello.txt’as (name:chararray,myname:chararray);
dump A;
进入:
然后在指定hadoop 上的文件
hello.text
hello you hello me
.csharpcode, .csharpcode pre
{
font-size: small;
color: black;
font-family: consolas, "Courier New", courier, monospace;
background-color: #ffffff;
/*white-space: pre;*/
}
.csharpcode pre { margin: 0em; }
.csharpcode .rem { color: #008000; }
.csharpcode .kwrd { color: #0000ff; }
.csharpcode .str { color: #006080; }
.csharpcode .op { color: #0000c0; }
.csharpcode .preproc { color: #cc6633; }
.csharpcode .asp { background-color: #ffff00; }
.csharpcode .html { color: #800000; }
.csharpcode .attr { color: #ff0000; }
.csharpcode .alt
{
background-color: #f4f4f4;
width: 100%;
margin: 0em;
}
.csharpcode .lnum { color: #606060; }
运行:
A= load ‘hdfs://hadoop11:9000/hello.txt’as (name:chararray,myname:chararray);
dump A;
最后显示统计结果。success
相关文章推荐
- Shortcuts Now Are Paid Back with Interest Later
- uva 11374 Airport Express 机场快线 迪杰斯特拉算法
- 根文件系统 编辑 http://baike.baidu.com/link?url=LzxNeeT7z7WnA6NCLzWSMHm_Z_8U-tcQouhTFCEk2UyyXloxHwMdNYAR87
- CRM_ORDER_MAINTAIN 修改订单简单示例
- CRM_ORDER_MAINTAIN 创建订单简单示例
- epoll源码实现分析[整理] http://blog.csdn.net/fengwen168168/article/details/48103009
- Epoll实现原理解析 http://blog.csdn.net/wangxiaoqin00007/article/details/14450021
- epoll_create函数实现源码分析 http://blog.csdn.net/lmh12506/article/details/7556188
- poll&&epoll实现分析(二)——epoll实现 http://blog.csdn.net/fengwen168168/article/details/48091599
- poll&&epoll实现分析(一)—poll实现 http://blog.csdn.net/fengwen168168/article/details/48091793
- 根文件系统的构建与分析(一)之流程分析 http://blog.csdn.net/jianchi88/article/details/7682901
- HDFS分配策略学笔记二
- Linux--根文件系统的挂载过程分析 http://blog.csdn.net/guopeixin/article/details/5962482
- [LeetCode#263]Factorial Trailing Zeroes
- [LeetCode#204]Factorial Trailing Zeroes
- 大数据
- xml解析及编译汇总 valgrind检测内存泄露 http://blog.csdn.net/lifan5/article/details/8030285
- Drainage Ditches 最大流入门练习题,各种算法
- ld: library not found for -lAFNetworking clang: error: linker command failed with exit code 1 (use -
- [LeetCode#172]Factorial Trailing Zeroes