pyspark.RDDBarrier.mapPartitionsWithIndex¶

RDDBarrier。 mapPartitionsWithIndex ( f:可調用的((int,Iterable(T]],Iterable(U]],preservesPartitioning:bool=假 )→pyspark.rdd.RDD(U] ¶

通過應用一個函數返回一個新的抽樣的每個分區包裝抽樣,而追蹤指數的原始分區。和所有任務都推出了在舞台上的障礙。接口是一樣的RDD.mapPartitionsWithIndex ()。請查看API文檔。

筆記

這個API是實驗

以前的

pyspark.RDDBarrier.mapPartitions

下一個

pyspark.BarrierTaskContext.allGather