Commits · 137f1a86a409f5a2240233629fa7cfb6bb457906 · 小白蛋 / Rook

This project is mirrored from https://gitee.com/wangmingco/rook.git. Pull mirroring failed 2 years ago.
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.

23 Oct, 2019 13 commits

Merge pull request #4163 from travisn/release-1.1.3 · 137f1a86
Travis Nielsen authored 5 years ago
```
Set the v1.1.3 release tag
```
137f1a86

build: set the v1.1.3 release tag · c0cf08b7

Travis Nielsen authored 5 years ago


With the v1.1.3 patch release we need to set the new tag
in the deployment manifests and documentation.
Signed-off-by: Travis Nielsen <tnielsen@redhat.com>

c0cf08b7

Merge pull request #4162 from rook/mergify/bp/release-1.1/pr-4161 · 2eecdf42
mergify[bot] authored 5 years ago
```
ceph: osd remove --conf flag (bp #4161)
```
2eecdf42
Merge pull request #4159 from rook/mergify/bp/release-1.1/pr-3996 · e81fab89
mergify[bot] authored 5 years ago
```
fix operator reconcile to restart osd daemons (bp #3996)
```
e81fab89
Merge pull request #4160 from rook/mergify/bp/release-1.1/pr-4021 · 88f9e100
mergify[bot] authored 5 years ago
```
Enable restoring a cluster after disaster recovery (bp #4021)
```
88f9e100

ceph: osd remove --conf flag · 9b614df4

Sébastien Han authored 5 years ago

Even legacy osds were still running with this flag. In my previous patch
I was trying to minimize the impact of the change and was afraid the
config file was holding important information but that appears not to be
true.

The only option in the config could have the keyring path, but it's
passed in the CLI startup line.

Closes: https://github.com/rook/rook/issues/4063

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 9f053ca4)

9b614df4

Merge pull request #4157 from rook/mergify/bp/release-1.1/pr-4116 · 4e2fa771
mergify[bot] authored 5 years ago
```
Ceph: Added removeOSDsIfOutAndSafeToRemove to Cluster CR (bp #4116)
```
4e2fa771

ceph: create missing mon deployments without waiting for quorum · 2b8f1be0

travisn authored 5 years ago


If mon deployments need to be created, the operator should
go ahead and create all of them instead of waiting for quorum
in between checking each mon. In particular, if a cluster is
being restored in a disaster recovery situation, none of the mons
deployments will exist. All of them should be created before
checking for quorum.
Signed-off-by: travisn <tnielsen@redhat.com>
(cherry picked from commit 53812c00)

2b8f1be0

ceph: mgr modules requests should timeout · 3347a637

travisn authored 5 years ago


Requests to enable mgr modules or call mgr modules should
timeout rather than hang. For example, if creating a self
signed cert request hangs, the mgr should continue with other
actions in the orchestration
Signed-off-by: travisn <tnielsen@redhat.com>
(cherry picked from commit a72a9816)

3347a637

ceph: reduce verbose logging checking for clean PGs · d1880c2c

travisn authored 5 years ago


Checking for clean PGs is a recurring event when reconciling nodes
for upgrades and fencing. No need to log every request.
Signed-off-by: travisn <tnielsen@redhat.com>
(cherry picked from commit 43979747)

d1880c2c

ceph: fix operator reconcile to restart osds daemons · 36b7dbfa

Santosh Pillai authored 5 years ago


OSD on PVC does not upgrade when the user upgrades the ceph version on cluster-on-pvc yaml.
The solution incudes:
   - upgrade osd prepare and daemon pods on upgrade
   - skip c-v prepare if filesystem is already present on the pvc device.
   - skip lvm release in case of upgrade.
Signed-off-by: Santosh Pillai <sapillai@redhat.com>
(cherry picked from commit 8ea693a7)

36b7dbfa

Ceph: Added removeOSDsIfOutAndSafeToRemove to Cluster CR · 932d7e2d

rohan47 authored 5 years ago


OSDs can be removed automatically with the current mechanism if a new
setting removeOSDsIfOutAndSafeToRemove is set to true. The default for
all new or upgraded clusters should be false.
Signed-off-by: rohan47 <rohgupta@redhat.com>
(cherry picked from commit 7f9611d4)

932d7e2d

Merge pull request #4144 from leseb/bp-4086 · da6b8592
Travis Nielsen authored 5 years ago
```
ceph: rework csi keys and secrets (bp #4086)
```
da6b8592

22 Oct, 2019 4 commits

Merge pull request #4136 from rook/mergify/bp/release-1.1/pr-4009 · 7b6b6fe3
mergify[bot] authored 5 years ago
```
[OSD on PVC] add kubernetes version check. (bp #4009)
```
7b6b6fe3

ceph: rework csi keys and secrets · 772ed887

Sébastien Han authored 5 years ago

We know create 4 secrets containing 4 ceph keys where each of them have
limited permissions to access the cluster.

Closes: https://github.com/rook/rook/issues/4074

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit fcc2e21f)

772ed887

ceph: do not enable app on pool · ba7fed2b

Sébastien Han authored 5 years ago


The command `fs new` already enable the application pool settings for
the cephfs pools so we don't need to run that command again.
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit eafc2a47)

ba7fed2b

ci: more debug · 42009014

Sébastien Han authored 5 years ago


When we give up on waiting for the pod to be running which describe it
to see what's wrong.
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit eccea9fa)

42009014

21 Oct, 2019 3 commits

Merge pull request #4137 from leseb/mergify/bp/release-1.1/pr-4083 · d9b721f2
Sébastien Han authored 5 years ago
```
Backport: ceph: expose prepare pod resource limits
```
d9b721f2

ceph: expose prepare pod resource limits · 8fe46057

Sébastien Han authored 5 years ago

The osd-prepare jobs should not inherit osd resources since this might
cause scheduling issues.
Instead, we can now use the new CRD property `prepareosd` in the resource section.

Refer to the documentation for more details.

Closes: https://github.com/rook/rook/issues/2502

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 1765ad65)

8fe46057

ceph: add kubernetes version check for running osd on pvc. · d0f466a7

Santosh Pillai authored 5 years ago


  - OSD on PVC provisioning fails if k8s version is less than 1.13.0
  - updated document to reflect minimum k8s version required.
  - Added check to skip OSD on pvc provisoning if the minimum k8s version requirement is not met.
Signed-off-by: Santosh Pillai <sapillai@redhat.com>
(cherry picked from commit edf0c7ae)

d0f466a7

18 Oct, 2019 1 commit
- Merge pull request #4128 from rook/mergify/bp/release-1.1/pr-4087 · 117e5a04
  mergify[bot] authored 5 years ago
```
Issue 4073 configure rgw non ec pool as metadata (bp #4087)
```
  117e5a04
17 Oct, 2019 1 commit

ceph: add 'rgw.buckets.non-ec' to list of RGW metadataPools · 5e7d3b33

Owen Tuz authored 5 years ago


This is used for S3 multipart uploads and should be configured in the same way as other metadata pools.
Signed-off-by: Owen Tuz <owen@segfault.re>
(cherry picked from commit 15e3ecad)

5e7d3b33

16 Oct, 2019 16 commits

Merge pull request #4112 from rook/mergify/bp/release-1.1/pr-4094 · 2c195d75
mergify[bot] authored 5 years ago
```
ceph: hide wrong error for clusterdisruption controller (bp #4094)
```
2c195d75
Merge pull request #4114 from rook/mergify/bp/release-1.1/pr-4098 · 5d7368bd
mergify[bot] authored 5 years ago
```
Multiple integration test fixes to improve CI stability (bp #4098)
```
5d7368bd
Merge pull request #4118 from rook/mergify/bp/release-1.1/pr-4109 · 8b4333bd
mergify[bot] authored 5 years ago
```
ceph: detect mount fstype more accurately (bp #4109)
```
8b4333bd
Merge pull request #4115 from rook/mergify/bp/release-1.1/pr-4110 · c2af7ff5
mergify[bot] authored 5 years ago
```
ceph: mgr do not override annotation (bp #4110)
```
c2af7ff5

ceph: detect mount fstype more accurately · 2ea4599e

Sébastien Han authored 5 years ago

Using `df --type ceph` has proven not to be reliable enough, so let's try
with findmnt.

Closes: https://github.com/rook/rook/issues/4107

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 9cfdc4f5)

2ea4599e

ceph: mgr do not override annotation · 0fdbcbfb

Sébastien Han authored 5 years ago

The current implementation was overriding any previous annotations set on
the object meta.
Moving the logic to its own method as well as adding unit tests.

Closes: https://github.com/rook/rook/issues/4106

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit f6a5caa1)

0fdbcbfb

tests: increase wait timeout for mon failover tests · c3cb322b

travisn authored 5 years ago


The timeout of 80 seconds wasn't always sufficient for mon failover.
The timeout is now increased to 150 seconds.
Signed-off-by: travisn <tnielsen@redhat.com>
(cherry picked from commit 0cca2cde)

c3cb322b

tests: ceph multicluster tests to wait for one cluster at a time · 0c09a2b9

travisn authored 5 years ago


The two clusters were being created in parallel by the tests
and sometimes causing the test to timeout. The operator handles
the cluster serially so we don't gain anything by starting
them asynchronously. With this change the tests will be started
serially to match the behavior of the operator and avoid
the issues with test timeouts.
Signed-off-by: travisn <tnielsen@redhat.com>
(cherry picked from commit 378f209d)

0c09a2b9

tests: mark failed test for log collection · 2533464d

travisn authored 5 years ago


The logs are only collected by default if a test failed.
Ensure that we collect the logs if the test setup step
failed.
Signed-off-by: travisn <tnielsen@redhat.com>
(cherry picked from commit 98c21950)

2533464d

tests: consolidate block integration tests · da9fe8fa

travisn authored 5 years ago


The BlockCreateSuite and BlockMountUnmountSuite suites are almost identical
for running block tests with different types of mounts. We can
consolidate these into a single test suite and eliminate a couple of
the tests that are already covered by other tests.
Signed-off-by: travisn <tnielsen@redhat.com>
(cherry picked from commit a178cccd)

da9fe8fa

tests: use kubectl create instead of apply · e5aa6c2b

travisn authored 5 years ago


Testing if the create does any better in the tests than apply.
Signed-off-by: travisn <tnielsen@redhat.com>
(cherry picked from commit 979b6fd2)

e5aa6c2b

tests: add logging to track down file test cleanup issue · c13e232e

travisn authored 5 years ago


Adding more logging until we can track down the file-test pod
cleanup issue
Signed-off-by: travisn <tnielsen@redhat.com>
(cherry picked from commit f5eb11d8)

c13e232e

ceph: hide wrong error for clusterdisruption controller · 5ad74f43

Sébastien Han authored 5 years ago


The clusterdisruption starts up before the Ceph cluster is created and
during the initilization, it'll display invalid error messages such as:

2019-10-14T04:22:21.243679454Z 2019-10-14 04:22:21.243668 E | clusterdisruption-controller: could not check cluster health: failed to get status: exit status 1
2019-10-14T04:22:22.524222494Z 2019-10-14 04:22:22.524179 I | exec: Running command: ceph status --connect-timeout=15 --cluster=openshift-storage --conf=/var/lib/rook/openshift-storage/openshift-storage.config --keyring=/var/lib/rook/openshift-storage/client.admin.keyring --format json --out-file /tmp/605071670
2019-10-14T04:22:22.617370007Z 2019-10-14 04:22:22.617321 I | exec: Error initializing cluster client: ObjectNotFound('error calling conf_read_file',)

That error message is invalid since the cluster is not up and running
YET.
So let's simply log a message.
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 05140d7a)

5ad74f43

Merge pull request #4108 from rook/mergify/bp/release-1.1/pr-4099 · 05f21e0b
mergify[bot] authored 5 years ago
```
ceph: fix topologyAware support (bp #4099)
```
05f21e0b

tests: adjust env vars in the osd unit tests · e30e5760

travisn authored 5 years ago


The unit tests in the backport require a change that was not required
in master.
Signed-off-by: travisn <tnielsen@redhat.com>

e30e5760

ceph: add OSDs to proper buckets in crush hierarchy while deployment · a1635d99

Mateusz Los authored 5 years ago


add missing topologyAware and location env variables to OSD pods
Signed-off-by: Mateusz Los <los.mateusz@gmail.com>
(cherry picked from commit cb52aea9)

a1635d99

15 Oct, 2019 1 commit
- Merge pull request #4097 from rook/mergify/bp/release-1.1/pr-4090 · 81be6531
  mergify[bot] authored 5 years ago
```
OwnerReference and test pod cleanup (bp #4090)
```
  81be6531
14 Oct, 2019 1 commit

ceph: more robust removal of cluster finalizer · e670b6de

travisn authored 5 years ago


The cluster finalizer in some scenarios was not being removed
during cluster removal. Specifically, if the cluster CR had been
modified, the operator would always fail to remove the finalizer.
This could occur when the CR status is updated around the same time
that the cluster is deleted. Therefore, we need to remove the finalizer
from a freshly retrieved instance of the cluster.
Signed-off-by: travisn <tnielsen@redhat.com>
(cherry picked from commit bc684400)

e670b6de