spark--PageRank
2017-02-03 16:23
417 查看
package spark1 import org.apache.spark.{SparkConf, SparkContext} object test { def main(args: Array[String]): Unit = { val nums = 2 val conf = new SparkConf().setAppName("PageRank").setMaster("local") val sc = new SparkContext(conf) val links = sc.parallelize(Array(('A', Array('D')), ('B', Array('A')), ('C', Array('A', 'B')), ('D', Array('A', 'C')))).cache() var ranks = sc.parallelize(Array(('A', 1.0), ('B', 1.0), ('C', 1.0), ('D', 1.0))) for (i <- 1 to nums) { val contribs = links.join(ranks).flatMap({ case (url, (links, rank)) => links.map(dest => (dest, rank / links.size)) }) ranks = contribs.reduceByKey(_+_).mapValues(0.15+0.85*_) } ranks.foreach(println) } }
(B,0.394375) (A,1.3081249999999998) (C,0.575) (D,1.7224999999999997)
相关文章推荐
- Spark RDD API详解(一) Map和Reduce
- 使用spark和spark mllib进行股票预测
- Spark随谈——开发指南(译)
- Spark,一种快速数据分析替代方案
- Spark SQL数据加载和保存实例讲解
- eclipse 开发 spark Streaming wordCount
- Understanding Spark Caching
- ClassNotFoundException:scala.PreDef$
- Windows 下Spark 快速搭建Spark源码阅读环境
- Spark中将对象序列化存储到hdfs
- 使用java代码提交Spark的hive sql任务,run as java application
- Spark机器学习(一) -- Machine Learning Library (MLlib)
- Spark机器学习(二) 局部向量 Local-- Data Types - MLlib
- Spark机器学习(三) Labeled point-- Data Types
- Spark初探
- Spark Streaming初探
- Spark本地开发环境搭建
- zeppelin 提交spark 任务异常:.JsonMappingException: Could not find creator property with name zeppelin
- PageRank算法计算网页的价值
- 搭建hadoop/spark集群环境