This project is mirrored from https://gitee.com/mirrors/nomad.git. Pull mirroring failed 2 years ago.
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.

31 Jan, 2022 1 commit
- Generate files for 1.2.5 release · 06d912a2
  Nomad Release bot authored 3 years ago
  
  06d912a2
28 Jan, 2022 9 commits

docs: add 1.2.5 to changelog · f0e1938d
Tim Gross authored 3 years ago

f0e1938d
docs: missing changelog for #11892 (#11959) · 57517b16
Tim Gross authored 3 years ago

57517b16
set LAST_RELEASE to 1.2.4 for the 1.2.5 release branch · 00df7dda
Tim Gross authored 3 years ago

00df7dda

CSI: node unmount from the client before unpublish RPC (#11892) · 707b4b3e

Tim Gross authored 3 years ago

When an allocation stops, the `csi_hook` makes an unpublish RPC to the
servers to unpublish via the CSI RPCs: first to the node plugins and
then the controller plugins. The controller RPCs must happen after the
node RPCs so that the node has had a chance to unmount the volume
before the controller tries to detach the associated device.

But the client has local access to the node plugins and can
independently determine if it's safe to send unpublish RPC to those
plugins. This will allow the server to treat the node plugin as
abandoned if a client is disconnected and `stop_on_client_disconnect`
is set. This will let the server try to send unpublish RPCs to the
controller plugins, under the assumption that the client will be
trying to unmount the volume on its end first.

Note that the CSI `NodeUnpublishVolume`/`NodeUnstageVolume` RPCs can
return ignorable errors in the case where the volume has already been
unmounted from the node. Handle all other errors by retrying until we
get success so as to give operators the opportunity to reschedule a
failed node plugin (ex. in the case where they accidentally drained a
node without `-ignore-system`). Fan-out the work for each volume into
its own goroutine so that we can release a subset of volumes if only
one is stuck.

707b4b3e

CSI: move terminal alloc handling into denormalization (#11931) · 593b8db9

Tim Gross authored 3 years ago

* The volume claim GC method and volumewatcher both have logic
collecting terminal allocations that duplicates most of the logic
that's now in the state store's `CSIVolumeDenormalize` method. Copy
this logic into the state store so that all code paths have the same
view of the past claims.
* Remove logic in the volume claim GC that now lives in the state
store's `CSIVolumeDenormalize` method.
* Remove logic in the volumewatcher that now lives in the state
store's `CSIVolumeDenormalize` method.
* Remove logic in the node unpublish RPC that now lives in the state
store's `CSIVolumeDenormalize` method.

593b8db9

csi: ensure that PastClaims are populated with correct mode (#11932) · 7181e965

Tim Gross authored 3 years ago

In the client's `(*csiHook) Postrun()` method, we make an unpublish
RPC that includes a claim in the `CSIVolumeClaimStateUnpublishing`
state and using the mode from the client. But then in the
`(*CSIVolume) Unpublish` RPC handler, we query the volume from the
state store (because we only get an ID from the client). And when we
make the client RPC for the node unpublish step, we use the _current
volume's_ view of the mode. If the volume's mode has been changed
before the old allocations can have their claims released, then we end
up making a CSI RPC that will never succeed.

Why does this code path get the mode from the volume and not the
claim? Because the claim written by the GC job in `(*CoreScheduler)
csiVolumeClaimGC` doesn't have a mode. Instead it just writes a claim
in the unpublishing state to ensure the volumewatcher detects a "past
claim" change and reaps all the claims on the volumes.

Fix this by ensuring that...

7181e965

CSI: resolve invalid claim states (#11890) · cdbb2bcf

Tim Gross authored 3 years ago

* csi: resolve invalid claim states on read

It's currently possible for CSI volumes to be claimed by allocations
that no longer exist. This changeset asserts a reasonable state at
the state store level by registering these nil allocations as "past
claims" on any read. This will cause any pass through the periodic GC
or volumewatcher to trigger the unpublishing workflow for those claims.

* csi: make feasibility check errors more understandable

When the feasibility checker finds we have no free write claims, it
checks to see if any of those claims are for the job we're currently
scheduling (so that earlier versions of a job can't block claims for
new versions) and reports a conflict if the volume can't be scheduled
so that the user can fix their claims. But when the checker hits a
claim that has a GCd allocation, the state is recoverable by the
server once claim reaping completes and no user intervention is
required; the blocked eval should complete. Differentiate the
scheduler error produced by these two conditions.

cdbb2bcf

csi: update leader's ACL in volumewatcher (#11891) · debffe24

Tim Gross authored 3 years ago

The volumewatcher that runs on the leader needs to make RPC calls
rather than writing to raft (as we do in the deploymentwatcher)
because the unpublish workflow needs to make RPC calls to the
clients. This requires that the volumewatcher has access to the
leader's ACL token.

But when leadership transitions, the new leader creates a new leader
ACL token. This ACL token needs to be passed into the volumewatcher
when we enable it, otherwise the volumewatcher can find itself with a
stale token.

debffe24

Update IsEmpty to check for pre-1.2.4 fields (#11930) · 143fb90e
Derek Strickland authored 3 years ago

143fb90e

19 Jan, 2022 1 commit
- Release v1.2.4 · 2f9accbd
  Nomad Release Bot authored 3 years ago
  
  2f9accbd
18 Jan, 2022 5 commits
- Generate files for 1.2.4 release · 9f21b724
  Nomad Release bot authored 3 years ago
  
  9f21b724
- docs: add 1.2.4 to changelog · c5fd90a7
  Luiz Aoqui authored 3 years ago
  
  c5fd90a7
- docs: add `nomad.plan.node_rejected` metric (#11860) · a0c0b808
  Luiz Aoqui authored 3 years ago
  
  a0c0b808
- ui: fix test (#11870) · 61340142
  Luiz Aoqui authored 3 years ago
  
  61340142
- cli: Add event stream capture to nomad operator debug (#11865) · 8d28bfe4
  Dave May authored 3 years ago
  
  8d28bfe4
17 Jan, 2022 5 commits
- cli: improve debug error messages (#11507) · dc81f265
  Michael Schurter authored 3 years ago
```
Improves `nomad debug` error messages when contacting agents that do not
have /v1/agent/host endpoints (the endpoint was added in v0.12.0)

Part of #9568 and manually tested against Nomad v0.8.7.

Hopefully isRedirectError can be reused for more cases listed in #9568
```
  dc81f265
- docs: update 1.2.0 upgrade note now that the UI ACL is fixed (#11840) · ac18d719
  Luiz Aoqui authored 3 years ago
  
  ac18d719
- docs: add HashiBox to the list of community tools (#11861) · d1c3c220
  Luiz Aoqui authored 3 years ago
  
  d1c3c220
- changelog: add entry for #11793 (#11862) · 4c5dd858
  Luiz Aoqui authored 3 years ago
  
  4c5dd858
- Merge pull request #11849 from hashicorp/b-changelog-11848 · 868ab230
  James Rasell authored 3 years ago
```
changelog: add entry for #11848
```
  868ab230
15 Jan, 2022 1 commit
- scheduler: detect and log unexpected scheduling collisions (#11793) · 8a427a47
  Luiz Aoqui authored 3 years ago
  
  8a427a47
14 Jan, 2022 15 commits
- Merge pull request #11820 from hashicorp/f-ui/alloc-legend · fcd86e49
  Jai authored 3 years ago
```
feat:  add links to legend items in `allocation-summary`
```
  fcd86e49
- csi: volume deregistration should require exact ID (#11852) · 307bcada
  Tim Gross authored 3 years ago
```
The command line client sends a specific volume ID, but this isn't
enforced at the API level and we were incorrectly using a prefix match
for volume deregistration, resulting in cases where a volume with a
shorter ID that's a prefix of another volume would be deregistered
instead of the intended volume.
```
  307bcada
- csi: when warning for multiple prefix matches, use full ID (#11853) · e14c10e8
  Tim Gross authored 3 years ago
```
When the `volume deregister` or `volume detach` commands get an ID
prefix that matches multiple volumes, show the full length of the
volume IDs in the list of volumes shown so so that the user can select
the correct one.
```
  e14c10e8
- freebsd: build fix for ARM7 32-bit (#11854) · 6b7ecb2a
  Tim Gross authored 3 years ago
```
The size of `stat_t` fields is architecture dependent, which was
reportedly causing a build failure on FreeBSD ARM7 32-bit
systems. This changeset matches the behavior we have on Linux.
```
  6b7ecb2a
- drivers: set world-readable permissions on copied resolv.conf (#11856) · 77287b0b
  Tim Gross authored 3 years ago
```
When we copy the system DNS to a task's `resolv.conf`, we should set
the permissions as world-readable so that unprivileged users within
the task can read it.
```
  77287b0b
- chore: add changelog · 9731dea7
  Jai Bhagat authored 3 years ago
  
  9731dea7
- test: add test stories for clicking allocation summary · 08a5e867
  Jai Bhagat authored 3 years ago
  
  08a5e867
- refact: add data-test-selectors and correct css selectors in summary · a3d62408
  Jai Bhagat authored 3 years ago
  
  a3d62408
- styling: remove clickable link text decoration override to match new mocks · 2e73e425
  Jai Bhagat authored 3 years ago
  
  2e73e425
- refact: allocation and child summaries into ember-cli-page-object components · ecaf46c6
  Jai Bhagat authored 3 years ago
  
  ecaf46c6
- fix: typo in data-test-selector · c1bef174
  Jai Bhagat authored 3 years ago
  
  c1bef174
- styling: update styling to match new figma mocks · 1eebec0a
  Jai Bhagat authored 3 years ago
  
  1eebec0a
- feat: add clicking functionality to alloc status legend · 205a07c2
  Jai Bhagat authored 3 years ago
  
  205a07c2
- changelog: add entry for #11848 · 54cbfe0c
  James Rasell authored 3 years ago
  
  54cbfe0c
- Merge pull request #11842 from hashicorp/b-name-oss-files-consistently · 8f01d74f
  James Rasell authored 3 years ago
```
chore: ensure consistent file naming for non-enterprise files.
```
  8f01d74f
13 Jan, 2022 3 commits
- Merge pull request #11403 from hashicorp/f-gh-11059 · c08a0366
  James Rasell authored 3 years ago
```
agent/docs: add better clarification when top-level data dir needs setting
```
  c08a0366
- Merge pull request #11402 from hashicorp/document-client-initial-vault-renew · eee5d90e
  James Rasell authored 3 years ago
```
taskrunner: add clarifying initial vault token renew comment.
```
  eee5d90e
- Fix log level parsing from lines that include a timestamp (#11838) · 3bf78fde
  Luiz Aoqui authored 3 years ago
  
  3bf78fde

Menu

免费DevSecOps平台，让您的项目体验完整的DevSecOps流程，让项目更安全