KubeSphere 2.0.2 云平台安装测试结果

rayzhou2017 commented 5 years ago

Three install options:

online
offline
KubeSphere only, i.e., install KubeSphere on existing k8s cluster or k8s-based distribution.

All are 64 bit OSes (x86)

	qingcloud	huawei cloud	aliyun	tencent cloud	aws	Google Cloud
Ubuntu 16.04 online allinone	√	√2	√	√4	√
Ubuntu 16.04 online 1master 2nodes	√	√2	√	√4	√
Ubuntu 18.04 online allinone	√	√2	√	√	√	√
Ubuntu 18.04 online 1master 2nodes	√	√2	√	√	√
CentOS 7.6 online allinone	√	√	√ 5	√	√1
CentOS 7.6 online 1master 2nodes	√	√	√5	√	√1
RHEL 7.4 online allinone	√				√3
RHEL 7.4 online 1master 2nodes	√				√3

Image from AWS Marketplace
huawei cloud安装报错及解决方法：https://github.com/kubesphere/ks-installer/issues/19 (pip安装) https://github.com/kubesphere/ks-installer/issues/20 (coredns无法启动)
image from AWS 社区AMI，online失败，需要用offline安装包且注释掉ebtables https://github.com/kubesphere/ks-installer/issues/23#issuecomment-523239730
tencent cloud中ubuntu16.04镜像较旧(ubuntu16.04.1)，可能会遇到 illegal instruction (core dump)，规避方法可参考：https://github.com/kubesphere/ks-installer/issues/21
aliyun中centos7安装时可能会遇到pip版本问题，解决方法可参考：https://github.com/kubesphere/ks-installer/issues/16

mayuhope commented 5 years ago

腾讯云测试结果： 1、Ubuntu 16.04.1安装在线版单节点和多节点：均失败，issue：https://github.com/kubesphere/ks-installer/issues/21

2、Ubuntu 18.04.1安装在线版单节点：成功 Ubuntu 18.04.1安装在线版多节点：成功

3、Centos7.5、Centos 7.6 安装在线版单节点：成功 Centos 7.6安装在线版多节点：成功

4 、Ubuntu18.04.1 、Ubuntu 16.04.1、Ubuntu 14.04.1（不支持离线部署）

5、缺少RHEL 7.4这个操作系统版本

Forest-L commented 5 years ago

阿里云： 1、ubuntu16.04.6 单节点在线：OK；

2、ubuntu16.04.6 单节点/多节点离线：OK，需要添加对应系统版本软件依赖包，issue：https://github.com/kubesphere/ks-installer/issues/22

3、centos7.5 单节点/多节点：OK，系统pip版本问题，处理方法参考issue：https://github.com/kubesphere/ks-installer/issues/16

4、centos7.6 单节点离线：OK 系统pip版本问题，处理方法参考issue：https://github.com/kubesphere/ks-installer/issues/16 需要添加对应系统版本软件依赖包，issue：https://github.com/kubesphere/kubesphere/issues/630

5、 rhel 7.4 单节点在线：Failure

6、ubuntu18.04.1单节点在线：OK

7、centos7.6单节点在线：OK 系统pip版本问题，处理方法参考issue：https://github.com/kubesphere/ks-installer/issues/16

8、centos7.6多节点在线：OK 系统pip版本问题，处理方法参考issue：https://github.com/kubesphere/ks-installer/issues/16

9、ubuntu16.04.6多节点在线：OK

10、ubuntu18.04多节点在线：OK

rayzhou2017 commented 5 years ago

阿里云： centos7.5和centos7.6 单节点/多节点offline 替换了centos-7.5-amd64.iso和centos-7.6-amd64.iso 成功 centos7.5和centos7.6 online 成功

ubuntu16.04.6 单节点/多节点offline 增加ubuntu-16.04.6-server-amd64.iso 成功 Ubuntu16.04.6 单节点 online 成功

这个测试我都看不懂，郭峰那个就很容易看懂

rayzhou2017 commented 5 years ago

aws rhel 7.4 online alline 出错 Screen Shot 2019-08-20 at 1 58 56 PM

pixiake commented 5 years ago

rhel 7.4由于官方yum源中没有安装脚本中指定的相关软件，所以导致安装失败

可以更换或者添加centos7的yum源尝试解决
下个版本中会对该问题做相关处理

pixiake commented 5 years ago

aws rhel 7.4 offline alline 出错

该问题为当前系统中没有预装ebtables，而安装包中使用的软件依赖包也缺少ebtables，导致安装失败

临时解决办法： vi ../k8s/roles/kubernetes/preinstall/vars/redhat.yml 注释掉ebtables

ubuntu的话修改ubuntu.yml

或者单独下载ebtables安装

Forest-L commented 5 years ago

vmware和各个平台机器对比： 1、在青云和阿里云centos7.6 离线install版本可以正常部署； 2、而同样的离线安装包在vmware机器上还报有依赖问题； vmware弄的机器，开发者很难有8核以上的机器。

hing007210 commented 5 years ago

TASK [ks-devops/ks-devops : OpenPitrix | Waiting for openpitrix-db] *** Tuesday 10 September 2019 11:29:55 +0800 (0:00:00.396) 0:04:18.141 * FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (15 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (14 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (13 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (12 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (11 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (10 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (9 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (8 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (7 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (6 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (5 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (4 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (3 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (2 retries left). FAILED - RETRYING: OpenPitrix | Waiting for openpitrix-db (1 retries left). fatal: [master1]: FAILED! => {"attempts": 15, "changed": true, "cmd": "/usr/local/bin/kubectl -n openpitrix-system get pod | grep openpitrix-db-deployment | awk '{print $3}'", "delta": "0:00:00.113061", "end": "2019-09-10 11:32:28.753923", "rc": 0, "start": "2019-09-10 11:32:28.640862", "stderr": "", "stderr_lines": [], "stdout": "Pending", "stdout_lines": ["Pending"]}

hing007210 commented 5 years ago

在安装时报错。

Forest-L commented 5 years ago

@hing007210
1、麻烦额外建一个issue哦， 2、kubectl get pod -n openpitrix-system看下该namespace的pod是否正常启动， 3、然后kubectl describe pod -n openpitrix-system kubectl get pod -n openpitrix-system|grep openpitrix-db-deployment|awk '{print $1}' 查看pod的日志

Forest-L commented 5 years ago

经测试，通过以下方法可安装最新版本的pip（pip-19.2.3）可规避pip安装相关问题：

https://github.com/kubesphere/ks-installer/issues/16 https://github.com/kubesphere/ks-installer/issues/19
curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py
sudo python get-pip.py

yeyouqun commented 5 years ago

我在安装时，出现这个问题： Wednesday 18 September 2019 01:24:28 -0400 (0:00:00.639) 0:02:32.879 *** fatal: [ks-allinone]: FAILED! => {"changed": true, "cmd": "/usr/local/bin/helm upgrade --install ks-sonarqube /etc/kubesphere/sonarqube/sonarqube-0.13.5.tgz -f /etc/kubesphere/sonarqube/custom-values-sonarqube.yaml --namespace kubesphere-devops-system", "delta": "0:00:00.203654", "end": "2019-09-18 01:24:29.367462", "msg": "non-zero return code", "rc": 1, "start": "2019-09-18 01:24:29.163808", "stderr": "Error: could not find a ready tiller pod", "stderr_lines": ["Error: could not find a ready tiller pod"], "stdout": "", "stdout_lines": []}

PLAY RECAP ** ks-allinone : ok=172 changed=7 unreachable=0 failed=1

superfrankiefan commented 5 years ago

AWS UBUNTU 16.04.6 LTS

install mode: Kubesphere all in one

Error Description:

Stucked in the following task:

TASK [download : Check the local digest sha256 corresponds to the given image tag] ** Wednesday 09 October 2019 03:08:42 +0000 (0:00:00.057) 0:08:20.102 *****

TASK [download : container_download | Download containers if pull is required or told to always pull (delegate)] **** Wednesday 09 October 2019 03:08:42 +0000 (0:00:00.053) 0:08:20.156 *****

TASK [download : container_download | Download containers if pull is required or told to always pull (all nodes)] *** Wednesday 09 October 2019 03:08:42 +0000 (0:00:00.036) 0:08:20.192 * FAILED - RETRYING: container_download | Download containers if pull is required or told to always pull (all nodes) (4 retries left). changed: [ks-allinone]

Cloud someone give some suggetions?

Thanks & Regards, Frankie

sheeplinjie commented 5 years ago

部署了快好几个小时了，镜像还没拉下来呢，太慢了就没有个线上存镜像的国内的仓库吗

inix commented 5 years ago

centos 7.5本地离线安装报错：

Thursday 24 October 2019  12:21:25 +0800 (0:00:00.370)       0:00:53.959 ******
fatal: [ks-allinone]: FAILED! => {"changed": true, "cmd": "/usr/local/bin/helm upgrade --install ks-sonarqube /etc/kubesphere/sonarqube/sonarqube-0.13.5.tgz -f /etc/kubesphere/sonarqube/custom-values-sonarqube.yaml --namespace kubesphere-devops-system", "delta": "0:00:02.181980", "end": "2019-10-24 12:21:28.168526", "msg": "non-zero return code", "rc": 1, "start": "2019-10-24 12:21:25.986546", "stderr": "Error: UPGRADE FAILED: Get https://10.233.0.1:443/api/v1/namespaces/kube-system/configmaps?labelSelector=NAME%!D(MISSING)ks-sonarqube%!C(MISSING)OWNER%!D(MISSING)TILLER%!C(MISSING)STATUS%!D(MISSING)DEPLOYED: dial tcp 10.233.0.1:443: connect: no route to host", "stderr_lines": ["Error: UPGRADE FAILED: Get https://10.233.0.1:443/api/v1/namespaces/kube-system/configmaps?labelSelector=NAME%!D(MISSING)ks-sonarqube%!C(MISSING)OWNER%!D(MISSING)TILLER%!C(MISSING)STATUS%!D(MISSING)DEPLOYED: dial tcp 10.233.0.1:443: connect: no route to host"], "stdout": "", "stdout_lines": []}

还有这个问题：

hursday 24 October 2019  14:23:33 +0800 (0:00:00.821)       0:11:49.833 ******
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (30 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (29 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (28 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (27 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (26 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (25 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (24 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (23 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (22 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (21 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (20 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (19 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (18 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (17 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (16 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (15 retries left).
FAILED - RETRYING: ks-alerting | Waiting for alerting-db-init (14 retries left).

AkideLiu commented 5 years ago

vSphere 6.7u2平台 CentOS7.7： 1.all-in-one在线部署：成功 2.在 Kubernetes 在线部署 KubeSphere ：成功 kubernetes version：v1.13.5

3.在 Kubernetes 在线部署 KubeSphere ：失败> https://github.com/kubesphere/ks-installer/issues/334 kubernetes version：v1.15.3 基于terraform-vsphere-kubespray部署的kubernetes https://github.com/sguyennet/terraform-vsphere-kubespray

4.all-in-one离线部署：失败因为系统是CentOS7.7，离线安装包里面只有CentOS 7.4 7.5 查看了kubespaere提供的镜像，不是官方标准镜像，所以没有继续测试。

xianyuwu commented 5 years ago

百度智能云 ubuntu18.04 在线安装成功。因为BCC配置比较低，所以有很多服务没启用。

focusonit commented 4 years ago

在金山云上安装2.02离线版，操作系统centos7.4，在etcd步骤出错：

TASK [etcd : Gen_certs | target ca-certificate store file] *** Tuesday 17 December 2019 08:50:10 +0800 (0:00:00.119) 0:01:08.026 ** ok: [node1] ok: [node2]

TASK [etcd : Gen_certs | add CA to trusted CA dir] *** Tuesday 17 December 2019 08:50:10 +0800 (0:00:00.104) 0:01:08.130 ** fatal: [node2]: FAILED! => {"changed": false, "msg": "Source /etc/ssl/etcd/ssl/ca.pem not found"} fatal: [node1]: FAILED! => {"changed": false, "msg": "Source /etc/ssl/etcd/ssl/ca.pem not found"}

NO MORE HOSTS LEFT ***

p-null commented 4 years ago

Hi, coming from the english documentation. Would it be better if we can change the issue title and content to be in English?

kubesphere / ks-installer

KubeSphere 2.0.2 云平台安装测试结果 #23