RHadoop实现wordcount
2014-08-08 20:08
363 查看
参考文献:《Big Data Analytics With R And Hadoop》
wordcount等于hello word
so,begin
wordcount = function(input,output = NULL,pattern = " ")
{
wc.map = function(., lines)
{
keyval(unlist(strsplit(x = lines,split = pattern)),1)
}
wc.reduce = function(word, counts )
{
keyval(word, sum(counts))
}
mapreduce(input = input ,output = output,input.format = "text",map = wc.map,reduce = wc.reduce,combine = T)
}
hdfs.put('/home/hadoop/桌面/word','/RHadoop/1/')
wordcount('/RHadoop/1/')
需要做的:修改路径,在对应路径创建一个名为word的文件。。然后把英文放进去。亲测。伪分布式600M十四分钟。10台机器完全分布式1.4G三分四十秒。不过当时环境不稳定现在应该更快。
wordcount等于hello word
so,begin
wordcount = function(input,output = NULL,pattern = " ")
{
wc.map = function(., lines)
{
keyval(unlist(strsplit(x = lines,split = pattern)),1)
}
wc.reduce = function(word, counts )
{
keyval(word, sum(counts))
}
mapreduce(input = input ,output = output,input.format = "text",map = wc.map,reduce = wc.reduce,combine = T)
}
hdfs.put('/home/hadoop/桌面/word','/RHadoop/1/')
wordcount('/RHadoop/1/')
需要做的:修改路径,在对应路径创建一个名为word的文件。。然后把英文放进去。亲测。伪分布式600M十四分钟。10台机器完全分布式1.4G三分四十秒。不过当时环境不稳定现在应该更快。
相关文章推荐
- Hadoop Pipes编程之C++实现WordCount
- 5行代码怎么实现Hadoop的WordCount?
- Hadoop学习笔记(1):WordCount程序的实现与总结
- Hadoop的WordCount实现
- Hadoop2.7.3+Hive2.1.0整合实现wordcount程序
- Hadoop Pipes编程之C++实现WordCount
- 本地eclipse连接远程hadoop集群运行wordcount实例,实现远程调试
- Hadoop流实现WordCount程序样例
- Python+Hadoop Streaming实现MapReduce(word count)
- hadoop 实现简单的wordcount实例
- hadoop-python——Wordcount程序:python实现详解
- Hadoop WordCount改进实现正确识别单词以及词频降序排序
- 5行代码怎么实现Hadoop的WordCount?
- java编写的hadoop wordcount,单MR任务实现按照词频排序输出结果
- 用淘宝Fourinone实现Hadoop经典实例wordcount
- 5行代码怎么实现Hadoop的WordCount?
- centos6.4安装hadoop-1.2.1,实现wordcount功能
- Eclipse重写Wordcount类实现处理中文字符,利用hadoop Eclipse插件远程调试hadoop运行WordCount程序
- 使用SAS实现HADOOP Map/Reduce程序-wordcount
- python基于Hadoop Streaming实现简单的WordCount