目標(biāo):
按Chapter關(guān)鍵字合并2個表,并且保存為genSum.csv
實現(xiàn)思路:
csv表中Name中先提取關(guān)鍵字copy到Chapter列。然后merger表2,最后保存為genSum.csv。

截圖.png
實現(xiàn)代碼:
__author__ = 'cllea'
import pandas as pd
import numpy as np
df = pd.read_csv("task.csv")
#提取Name列
s =df["Name"]
#轉(zhuǎn)為list
listName=s.tolist()#list
#在list中修改字符串
for i, v in enumerate(listName):
listName[i] = v.strip()[v.index(']')+2:v.index(']')+11]
#print(listName)
#list轉(zhuǎn)為dataframe
data = pd.DataFrame(listName,columns=['Chapter'])
#print(data)
#按列拼接dataframe
dfA=pd.concat([df,data],axis=1)
#print(dfA)
#合并dataframe
dfB = pd.read_excel("myplan.xlsx")
#print(dfB)
#對關(guān)鍵字Chapter列向左連接(左邊dfA為全部)
dfC = pd.merge(dfA, dfB,how='left',on=['Chapter'])
#print(dfC)
#保存到csv中
dfC.to_csv('genSum.csv',chunksize=10,encoding="utf_8_sig")