addFile
addFile可以把文件分發(fā)到各個worker當中,然后worker會把文件存放在臨時目錄下。之后可以通過SparkFiles.get()獲取文件
代碼
from pyspark import SparkFiles
from pyspark.sql import SparkSession
# 創(chuàng)建SparkSession
spark = SparkSession.builder\
.master("local")\
.appName("cal person")\
.getOrCreate()
# 添加文件
spark.sparkContext.addFile("./dict/load_dict.txt")
# 獲取文件
with open(SparkFiles.get("load_dict.txt"), "r") as f:
print(f.read())