
本文共 1340 字,大约阅读时间需要 4 分钟。
??????
1. RDD - ????????
RDD?Resilient Distributed Data??????????????????????????????RDD??????????????????????????????????
2. DAG - ?????
DAG?Directed Acyclic Graph????????Spark?????????????????DAG??????RDD??????????????????????????????
3. Executor
Executor???????????Worker Node?????????????????????????????????????????????????????
4. Application
Application????????????????Spark????WordCount????Spark??????????????????Job??????????????????Task Set??????????????Task????????????????
5. ??
???Executor??????????????????????????????Spark?????????????????????Join?Filter?Sort??
6. ??
???Application????????????????RDD???RDD??????????????????????????????????????????????
7. SparkContext
SparkContext???Driver??????????????????Application??SparkContext???????????????????????????????
2. Spark????
Spark??????????????Driver????????????????Driver??????????????????????????????????????????????????????????
3. Spark????
1. ??DAG?
SparkContext????????????RDD??????DAG??DAG??????RDD????????????????????
2. ?????Stage?
????DAG????????????Stage???????????????????????????????????????
3. ????
?????????????????????????????????Spark????????????????????????????????????
4. ???????
Executor????????????????????Task Schedule?Task Schedule???????DAG Schedule????SparkContext?????????????????HDFS???
???????SparkContext????????????????????????????????????????
发表评论
最新留言
关于作者
