在部署工程中,可能會遇到一些問題,這里做一下總結(jié)
1.日志查看方式
首先,本節(jié)介紹一下查詢k8s日志的方式
查看k8s錯誤日志:
journalctl -xe
查詢具體組件狀態(tài),以查詢pod為例:
kubectl describe pod <pod-name>
2.docker鏡像拉取超時問題
由于一些網(wǎng)絡(luò)原因,拉取鏡像超時,我們可以從其他倉庫拉取鏡像到本地,然后打tag,替代遠程鏡像。例如,在node節(jié)點部署過程中,遇到k8s.gcr.io/pause:3.6,errImagepull問題,查日志發(fā)現(xiàn)是k8s.gcr.io網(wǎng)絡(luò)不通。
用別的庫拉取鏡像到本地,然后打tag,本地鏡像需要在所有節(jié)點下載并打tag
docker pull rancher/pause:3.6
docker tag rancher/pause:3.6 k8s.gcr.io/pause:3.6
分享在本文實踐過程中拉取過的鏡像
docker pull registry.cn-hangzhou.aliyuncs.com/google_containers/nginx-ingress-controller:v1.1.0
docker pull registry.cn-hangzhou.aliyuncs.com/google_containers/kube-webhook-certgen:v1.1.1
docker pull mirrorgooglecontainers/pause:3.6
docker pull k8s.gcr.io/pause:3.6
docker pull mirrorgooglecontainers/pause:3.6
docker pull rancher/pause:3.6
docker pull gotok8s/kube-proxy:v1.23.1
docker pull registry.cn-hangzhou.aliyuncs.com/google_containers/nginx-ingress-controller:v1.1.0
docker pull registry.cn-hangzhou.aliyuncs.com/google_containers/kube-webhook-certgen:v1.1.1
3.The connection to the server localhost:8080 was refused
The connection to the server localhost:8080 was refused - did you specify the right host or port?
解決方法,執(zhí)行以下命令:
export KUBECONFIG=/etc/kubernetes/admin.conf
4.cgroups driver問題
docker的默認cgroups driver可能有誤,一般需要設(shè)置成systemd,修改k8s啟動參數(shù)
docker info命令查看當前cgourp driver

編輯/etc/systemd/system/kubelet.service.d/10-kubeadm.conf,增加環(huán)境變量參數(shù)

重啟docker服務(wù)
sudo systemctl restart docker
sudo systemctl daemon-reload
5.hostname未知警告
在/etc/hosts中設(shè)置映射
127.0.0.1 master-node