Python的數(shù)據(jù)分析,大部分的教程都是想講numpy,再講Dataframe,再講讀取文件。但我看書的時(shí)候,前面二章看的實(shí)在頭暈,所以,我們還是通過讀取文件來開始我們的Python數(shù)據(jù)分析吧。
讀取CSV
- 讀取csv通過read_csv讀取
import pandas as pd
zhuanti = pd.read_csv(open('C:/Users/luopan/Desktop/xiaozhu.csv',encoding='utf-8'))
zhuanti

- 設(shè)置第一列為索引
import pandas as pd
zhuanti1 = pd.read_csv(open('C:/Users/luopan/Desktop/xiaozhu.csv',encoding='utf-8'),index_col=0)
zhuanti1

- 設(shè)置header,這里把header去掉
import pandas as pd
zhuanti2 = pd.read_csv(open('C:/Users/luopan/Desktop/xiaozhu.csv',encoding='utf-8'),index_col=0,header=None)
zhuanti2

- 跳過前2行
import pandas as pd
zhuanti3 = pd.read_csv(open('C:/Users/luopan/Desktop/xiaozhu.csv',encoding='utf-8'),skiprows=[1,2],index_col=0)
zhuanti3

讀取Excel
- 利用read_excel讀取excel文件
import pandas as pd
test = pd.read_excel('C:/Users/luopan/Desktop/test.xlsx',sheetname='Sheet2',header=None)
test

讀取MySQL
import pandas as pd
import pymysql
conn = pymysql.connect(host='localhost', user='root', passwd='123456', db='test', port=3306, charset='utf8')
jianshu = pd.read_sql('select * from jianshu1',conn)
jianshu

讀取MongoDB
import pandas as pd
import pymongo
client = pymongo.MongoClient('localhost',port = 27017)
test = client['test']
tieba = test['tieba']
data = pd.DataFrame(list(tieba.find()))
data
