SPARK PAIRED RDD JOIN

时间:2018-09-03 20:29:35

标签: apache-spark join

我正在尝试在Spark上加入三个不同的RDD,但这会引发错误

ActiveCell.Offset(0, -2).FormulaArray = "=INDEX(Table1[Id Company],MATCH(1,(D$5=Table1[Client])*(""" & ActiveCell.Offset(0, -1).Value & """=Table1[Id Client]),0))"

错误:-

val name= sc.textFile("/user/kumarrupesh2389619/EmployeeName.csv")
val namepairRDD= name.map(x => (x.split(",")(0), x.split(",")(1)))

val manger= sc.textFile("/user/kumarrupesh2389619/Employeemanager.csv")
val mangerpairRDD= manger.map(x => (x.split(",")(0), x.split(",")(1)))

val salary= sc.textFile("/user/kumarrupesh2389619/Employeesalary.csv")
val salarypairRDD= salary.map(x => (x.split(",")(0), x.split(",")(1)))

val joineddata=namepairRDD.join(mangerpairRDD).join(salarypairRDD)

0 个答案:

没有答案