tonykang22 / study

0 stars 0 forks source link

09. Spark 개요 : RDD - Rault Tolerance #169

Open tonykang22 opened 1 year ago

tonykang22 commented 1 year ago

RDD - Fault Tolerance

Lineage


val msgs = sc.textFile("hdfs://...").filter(s => s.startsWith("ERROR"))
                                                        .map(s => s.split("\t"(2))


  1. MappedRDD
    • func = split(...)
  2. FilteredRDD
    • func = startsWith(...)
  3. HadoopRDD
    • path = hdfs://...



Fault Recovery Test

Example


image