Commits · 2c6de3e8b5379e71441ee305c55ff8c7f10a0b21 · 小白蛋 / Nomad

This project is mirrored from https://gitee.com/mirrors/nomad.git. Pull mirroring failed 2 years ago.
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.

28 Jan, 2022 5 commits

CSI: move terminal alloc handling into denormalization (#11931) · 2c6de3e8

Tim Gross authored 3 years ago

* The volume claim GC method and volumewatcher both have logic
collecting terminal allocations that duplicates most of the logic
that's now in the state store's `CSIVolumeDenormalize` method. Copy
this logic into the state store so that all code paths have the same
view of the past claims.
* Remove logic in the volume claim GC that now lives in the state
store's `CSIVolumeDenormalize` method.
* Remove logic in the volumewatcher that now lives in the state
store's `CSIVolumeDenormalize` method.
* Remove logic in the node unpublish RPC that now lives in the state
store's `CSIVolumeDenormalize` method.

2c6de3e8

csi: ensure that PastClaims are populated with correct mode (#11932) · 26b50083

Tim Gross authored 3 years ago

In the client's `(*csiHook) Postrun()` method, we make an unpublish
RPC that includes a claim in the `CSIVolumeClaimStateUnpublishing`
state and using the mode from the client. But then in the
`(*CSIVolume) Unpublish` RPC handler, we query the volume from the
state store (because we only get an ID from the client). And when we
make the client RPC for the node unpublish step, we use the _current
volume's_ view of the mode. If the volume's mode has been changed
before the old allocations can have their claims released, then we end
up making a CSI RPC that will never succeed.

Why does this code path get the mode from the volume and not the
claim? Because the claim written by the GC job in `(*CoreScheduler)
csiVolumeClaimGC` doesn't have a mode. Instead it just writes a claim
in the unpublishing state to ensure the volumewatcher detects a "past
claim" change and reaps all the claims on the volumes.

Fix this by ensuring that the `CSIVolumeDenormalize` creates past
claims for all nil allocations with a correct access mode set.

26b50083

CSI: resolve invalid claim states (#11890) · 6e0119de

Tim Gross authored 3 years ago

* csi: resolve invalid claim states on read

It's currently possible for CSI volumes to be claimed by allocations
that no longer exist. This changeset asserts a reasonable state at
the state store level by registering these nil allocations as "past
claims" on any read. This will cause any pass through the periodic GC
or volumewatcher to trigger the unpublishing workflow for those claims.

* csi: make feasibility check errors more understandable

When the feasibility checker finds we have no free write claims, it
checks to see if any of those claims are for the job we're currently
scheduling (so that earlier versions of a job can't block claims for
new versions) and reports a conflict if the volume can't be scheduled
so that the user can fix their claims. But when the checker hits a
claim that has a GCd allocation, the state is recoverable by the
server once claim reaping completes and no user intervention is
required; the blocked eval should complete. Differentiate the
scheduler error produced by these two conditions.

6e0119de

csi: update leader's ACL in volumewatcher (#11891) · 41c2daf4

Tim Gross authored 3 years ago

The volumewatcher that runs on the leader needs to make RPC calls
rather than writing to raft (as we do in the deploymentwatcher)
because the unpublish workflow needs to make RPC calls to the
clients. This requires that the volumewatcher has access to the
leader's ACL token.

But when leadership transitions, the new leader creates a new leader
ACL token. This ACL token needs to be passed into the volumewatcher
when we enable it, otherwise the volumewatcher can find itself with a
stale token.

41c2daf4

csi: reap unused volume claims at leadership transitions (#11776) · ad8166de

Tim Gross authored 3 years ago

When `volumewatcher.Watcher` starts on the leader, it starts a watch
on every volume and triggers a reap of unused claims on any change to
that volume. But if a reaping is in-flight during leadership
transitions, it will fail and the event that triggered the reap will
be dropped. Perform one reap of unused claims at the start of the
watcher so that leadership transitions don't drop this event.

ad8166de

19 Jan, 2022 1 commit
- Release v1.1.10 · 2f08fe23
  Nomad Release Bot authored 3 years ago
  
  2f08fe23
18 Jan, 2022 18 commits

Generate files for 1.1.10 release · 028cef25
Nomad Release bot authored 3 years ago

028cef25
docs: add 1.1.10 to changelog · d7ae04eb
Luiz Aoqui authored 3 years ago

d7ae04eb
Merge pull request #11744 from hashicorp/b-node-copy · 5d5bb262
Michael Schurter authored 3 years ago
```
Fix Node.Copy()
```
5d5bb262
changelog: add entry for #11793 (#11862) · 2ba4892c
Luiz Aoqui authored 3 years ago

2ba4892c

drivers: set world-readable permissions on copied resolv.conf (#11856) · 0d14741d

Tim Gross authored 3 years ago

When we copy the system DNS to a task's `resolv.conf`, we should set
the permissions as world-readable so that unprivileged users within
the task can read it.

0d14741d

freebsd: build fix for ARM7 32-bit (#11854) · 2fb80225

Tim Gross authored 3 years ago

The size of `stat_t` fields is architecture dependent, which was
reportedly causing a build failure on FreeBSD ARM7 32-bit
systems. This changeset matches the behavior we have on Linux.

2fb80225

csi: when warning for multiple prefix matches, use full ID (#11853) · 54203561

Tim Gross authored 3 years ago

When the `volume deregister` or `volume detach` commands get an ID
prefix that matches multiple volumes, show the full length of the
volume IDs in the list of volumes shown so so that the user can select
the correct one.

54203561

csi: volume deregistration should require exact ID (#11852) · cd0139d1

Tim Gross authored 3 years ago

The command line client sends a specific volume ID, but this isn't
enforced at the API level and we were incorrectly using a prefix match
for volume deregistration, resulting in cases where a volume with a
shorter ID that's a prefix of another volume would be deregistered
instead of the intended volume.

cd0139d1

Merge pull request #11849 from hashicorp/b-changelog-11848 · c48d40d1
James Rasell authored 3 years ago
```
changelog: add entry for #11848
```
c48d40d1
Merge pull request #11833 from hashicorp/deps-go-getter-v1.5.11 · 10c786a0
Michael Schurter authored 3 years ago
```
deps: update go-getter to v1.5.11
```
10c786a0
Merge pull request #11830 from hashicorp/b-validate-reserved-ports · cc1e4847
Michael Schurter authored 3 years ago
```
agent: validate reserved_ports are valid
```
cc1e4847
docs: improve changelog for PR #11783 (#11818) · 8d34f9b4
Tim Gross authored 3 years ago

8d34f9b4
scheduler: detect and log unexpected scheduling collisions (#11793) · 58017f99
Luiz Aoqui authored 3 years ago

58017f99

Un-break templates when using vault stanza change_mode noop (#11783) · 14c8bbb5

grembo authored 3 years ago

Templates in nomad jobs make use of the vault token defined in
the vault stanza when issuing credentials like client certificates.

When using change_mode "noop" in the vault stanza, consul-template
is not informed in case a vault token is re-issued (which can
happen from time to time for various reasons, as described
in https://www.nomadproject.io/docs/job-specification/vault).

As a result, consul-template will keep using the old vault token
to renew credentials and - once the token expired - stop renewing
credentials. The symptom of this problem is a vault_token
file that is newer than the issued credential (e.g., TLS certificate)
in a job's /secrets directory.

This change corrects this, so that h.updater.updatedVaultToken(token)
is called, which will inform stakeholders about the new
token and make sure, the new token is used by consul-template.

Example job template fragment:

    vault {
        policies = ["n...

14c8bbb5

task runner: fix goroutine leak in prestart hook (#11741) · a24bd934

Tim Gross authored 3 years ago

The task runner prestart hooks take a `joincontext` so they have the
option to exit early if either of two contexts are canceled: from
killing the task or client shutdown. Some tasks exit without being
shutdown from the server, so neither of the joined contexts ever gets
canceled and we leak the `joincontext` (48 bytes) and its internal
goroutine. This primarily impacts batch jobs and any task that fails
or completes early such as non-sidecar prestart lifecycle tasks.
Cancel the `joincontext` after the prestart call exits to fix the
leak.

a24bd934

fix host network reserved port fingerprint (#11728) · f730bc28
Luiz Aoqui authored 3 years ago

f730bc28

scheduler: fix quadratic performance with spread blocks (#11712) · d63e628a

Tim Gross authored 3 years ago

When the scheduler picks a node for each evaluation, the
`LimitIterator` provides at most 2 eligible nodes for the
`MaxScoreIterator` to choose from. This keeps scheduling fast while
producing acceptable results because the results are binpacked.

Jobs with a `spread` block (or node affinity) remove this limit in
order to produce correct spread scoring. This means that every
allocation within a job with a `spread` block is evaluated against
_all_ eligible nodes. Operators of large clusters have reported that
jobs with `spread` blocks that are eligible on a large number of nodes
can take longer than the nack timeout to evaluate (60s). Typical
evaluations are processed in milliseconds.

In practice, it's not necessary to evaluate every eligible node for
every allocation on large clusters, because the `RandomIterator` at
the base of the scheduler stack produces enough variation in each pass
that the likelihood of an uneven sprea...

d63e628a

cli: ensure `-stale` flag is respected by `nomad operator debug` (#11678) · 03e5425e

Tim Gross authored 3 years ago

When a cluster doesn't have a leader, the `nomad operator debug`
command can safely use stale queries to gracefully degrade the
consistency of almost all its queries. The query parameter for these
API calls was not being set by the command.

Some `api` package queries do not include `QueryOptions` because
they target a specific agent, but they can potentially be forwarded to
other agents. If there is no leader, these forwarded queries will
fail. Provide methods to call these APIs with `QueryOptions`.

03e5425e

17 Jan, 2022 4 commits

Override TLS flags individually for meta commands (#11592) · 060a4740

Derek Strickland authored 3 years ago


* Override TLS flags individually for meta commands

* Update command/meta.go
Co-authored-by: Tim Gross <tgross@hashicorp.com>
Co-authored-by: Tim Gross <tgross@hashicorp.com>

060a4740

client: respect `client_auto_join` after connection loss (#11585) · 13047b65

Tim Gross authored 3 years ago

The `consul.client_auto_join` configuration block tells the Nomad
client whether to use Consul service discovery to find Nomad
servers. By default it is set to `true`, but contrary to the
documentation it was only respected during the initial client
registration. If a client missed a heartbeat, failed a
`Node.UpdateStatus` RPC, or if there was no Nomad leader, the client
would fallback to Consul even if `client_auto_join` was set to
`false`. This changeset returns early from the client's trigger for
Consul discovery if the `client_auto_join` field is set to `false`.

13047b65

Merge pull request #11579 from hashicorp/b-getscalingpolicy-rpc-index-response · b830afbb
Michael Schurter authored 3 years ago
```
rpc: fix scaling policy get index response when policy is found.
```
b830afbb

build(deps): bump github.com/hashicorp/cronexpr from 1.1.0 to 1.1.1 in /api (#11132) · 782d5398

dependabot[bot] authored 3 years ago

* build(deps): bump github.com/hashicorp/cronexpr in /api

Bumps [github.com/hashicorp/cronexpr](https://github.com/hashicorp/cronexpr) from 1.1.0 to 1.1.1.
- [Release notes](https://github.com/hashicorp/cronexpr/releases)
- [Commits](https://github.com/hashicorp/cronexpr/compare/v1.1.0...v1.1.1

)

---
updated-dependencies:
- dependency-name: github.com/hashicorp/cronexpr
  dependency-type: direct:production
  update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>

* go mod tidy
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Tim Gross <tim@0x74696d.com>

782d5398

10 Dec, 2021 4 commits
- Release v1.1.9 · d277aa09
  Nomad Release Bot authored 3 years ago
  
  d277aa09
- Generate files for 1.1.9 release · c54f2dde
  Nomad Release bot authored 3 years ago
  
  c54f2dde
- docs: add 1.1.9 to changelog · c15bfa03
  Tim Gross authored 3 years ago
  
  c15bfa03
- golang security update 1.16.12 backport (#666) · bd732ec3
  Tim Gross authored 3 years ago
```
Includes cherry-pick of 530c0f84 to
fix build
```
  bd732ec3
19 Nov, 2021 4 commits
- Release v1.1.8 · 189d3f1b
  Nomad Release Bot authored 3 years ago
  
  189d3f1b
- Generate files for 1.1.8 release · 41cda4cf
  Nomad Release bot authored 3 years ago
  
  41cda4cf
- docs: add 1.1.8 to changelog · 4178b33e
  Luiz Aoqui authored 3 years ago
  
  4178b33e
- qemu: add `args_allowlist` to sandbox VM command line inputs · 7e9c3ea6
  Tim Gross authored 3 years ago
```
The QEMU driver allows arbitrary command line options, but many of
these options give access to host resources that operators may not
want to expose such as devices. Add an optional allowlist to the
plugin configuration so that operators can limit the resources for
QEMU.
```
  7e9c3ea6
15 Nov, 2021 4 commits
- Release v1.1.7 · 20fb2f2b
  Nomad Release Bot authored 3 years ago
  
  20fb2f2b
- Generate files for 1.1.7 release · 24264a2d
  Nomad Release bot authored 3 years ago
  
  24264a2d
- docs: add 1.1.7 to changelog · f77203ca
  Luiz Aoqui authored 3 years ago
  
  f77203ca
- Merge pull request #11280 from benbuzbee/log-err · 30d1436a
  James Rasell authored 3 years ago
```
Log error if there are no event handlers registered
```
  30d1436a