亚洲免费A级片,成人一区二区三区视频

關于executor和task的概念可以參考官方文檔
本文使用的源碼是spark 2.0.0版本

Task的數量

根據類DAGScheduler中的submitMissingTasks方法可以知道，在stage中會為每個需要計算的partition生成一個task，換句話說也就是每個task處理一個partition。

//From submitMissingTasks
......   
val tasks: Seq[Task[_]] = try {
      stage match {
        case stage: ShuffleMapStage =>
          partitionsToCompute.map { id =>
            val locs = taskIdToLocations(id)
            val part = stage.rdd.partitions(id)
            new ShuffleMapTask(stage.id, stage.latestInfo.attemptId,
              taskBinary, part, locs, stage.latestInfo.taskMetrics, properties)
          }

        case stage: ResultStage =>
          val job = stage.activeJob.get
          partitionsToCompute.map { id =>
            val p: Int = stage.partitions(id)
            val part = stage.rdd.partitions(p)
            val locs = taskIdToLocations(id)
            new ResultTask(stage.id, stage.latestInfo.attemptId,
              taskBinary, part, locs, id, properties, stage.latestInfo.taskMetrics)
          }
      }
    }
......

Task的最大并發(fā)數

當task被提交到executor之后，會根據executor可用的cpu核數，決定一個executor中最多同時運行多少個task。在類TaskSchedulerImpl的resourceOfferSingleTaskSet方法中，CPUS_PER_TASK的定義為val CPUS_PER_TASK = conf.getInt("spark.task.cpus", 1)，也就是說默認情況下一個task對應cpu的一個核。如果一個executor可用cpu核數為8，那么一個executor中最多同是并發(fā)執(zhí)行8個task；假如設置spark.task.cpus為2，那么同時就只能運行4個task。

//From resourceOfferSingleTaskSet
......
      if (availableCpus(i) >= CPUS_PER_TASK) {
        try {
          for (task <- taskSet.resourceOffer(execId, host, maxLocality)) {
            tasks(i) += task
            val tid = task.taskId
            taskIdToTaskSetManager(tid) = taskSet
            taskIdToExecutorId(tid) = execId
            executorIdToTaskCount(execId) += 1
            executorsByHost(host) += execId
            availableCpus(i) -= CPUS_PER_TASK
            assert(availableCpus(i) >= 0)
            launchedTask = true
          }
        } catch {
          case e: TaskNotSerializableException =>
            logError(s"Resource offer failed, task set ${taskSet.name} was not serializable")
            // Do not offer resources for this task, but don't throw an error to allow other
            // task sets to be submitted.
            return launchedTask
        }
      }
......

Yarn的task與Spark中task的區(qū)別

在Yarn的NodeManager節(jié)點上啟動一個map task或者reduce task，在物理上啟動的是一個jvm進程；而Spark的task是Executor進程中的一個線程。

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九欧美,1769亚洲,黄色成人av

Spark executor中task的數量與最大并發(fā)數

Spark executor中task的數量與最大并發(fā)數

Task的數量

Task的最大并發(fā)數

Yarn的task與Spark中task的區(qū)別

相關閱讀更多精彩內容

友情鏈接更多精彩內容

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九 欧美,1769亚洲,黄色成人av

Spark executor中task的數量與最大并發(fā)數

Task的數量

Task的最大并發(fā)數

Yarn的task與Spark中task的區(qū)別

相關閱讀更多精彩內容

友情鏈接更多精彩內容

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九欧美,1769亚洲,黄色成人av