This project is mirrored from https://gitee.com/mirrors/nomad.git.
Pull mirroring failed .
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.
- 03 Feb, 2022 3 commits
-
-
Tim Gross authored
Non-CSI garbage collection tasks on the server only log the cutoff index in the case where it's not a forced GC from `nomad system gc`. Do the same for CSI for consistency.
-
Tim Gross authored
Many of Nomad's Autopilot features require raft protocol version 3. Set the default raft protocol to 3, and improve the upgrade documentation.
-
Seth Hoenig authored
cleanup: prevent leaks from time.After
-
- 02 Feb, 2022 9 commits
-
-
Glen Yu authored
-
ttys3 authored
-
René Moser authored
-
Samantha authored
Update the logic in the Nomad client's alloc health tracker which erroneously marks existing healthy allocations with dead poststart ephemeral tasks as unhealthy even if they were already successful during a previous deployment.
-
Seth Hoenig authored
This PR replaces use of time.After with a safe helper function that creates a time.Timer to use instead. The new function returns both a time.Timer and a Stop function that the caller must handle. Unlike time.NewTimer, the helper function does not panic if the duration set is <= 0.
-
Luiz Aoqui authored
-
Seth Hoenig authored
build: disable semgrep on structs.go for now
-
James Rasell authored
e2e: moved missed volume test stop command to util helper.
-
James Rasell authored
-
- 01 Feb, 2022 8 commits
-
-
Tim Gross authored
prepare for next release
-
Tim Gross authored
-
Seth Hoenig authored
-
Tim Gross authored
-
James Rasell authored
e2e: account for new job stop CLI exit behaviour.
-
Seth Hoenig authored
deps: import libtime the normal way
-
James Rasell authored
PR #11550 changed the job stop exit behaviour when monitoring the deployment. When stopping a job, the deployment becomes cancelled and therefore the CLI now exits with status code 1 as it see this as an error. This change adds a new utility e2e function that accounts for this behaviour.
-
Michael Schurter authored
Bugfix: auto-promote canary taskgroups when mixed with non-canary taskgroups
-
- 31 Jan, 2022 5 commits
-
-
kainoaseto authored
-
Michael Schurter authored
core: prevent malformed plans from crashing leader
-
Seth Hoenig authored
Previously we copied this library by hand to avoid vendor-ing a bunch of files related to minimock. Now that we no longer vendor, just import the library normally. Also we might use more of the library for handling `time.After` uses, for which this library provides a Context-based solution.
-
Michael Schurter authored
-
Michael Schurter authored
The Plan.Submit endpoint assumed PlanRequest.Plan was never nil. While there is no evidence it ever has been nil, we should not panic if a nil plan is ever submitted because that would crash the leader.
-
- 28 Jan, 2022 15 commits
-
-
Tim Gross authored
-
Noel Quiles authored
* chore: Add Demandbase tag to consent manager * fix: Add services to manager options
-
Jai authored
feat: add evaluations view with table
-
Jai Bhagat authored
-
Jai Bhagat authored
-
Jai Bhagat authored
-
Jai Bhagat authored
-
Jai Bhagat authored
-
Jai Bhagat authored
-
Jai Bhagat authored
To support pagination on evaluations queries.
-
Jai Bhagat authored
-
Jai Bhagat authored
-
Jai Bhagat authored
-
Jai Bhagat authored
-
Tim Gross authored
When an allocation stops, the `csi_hook` makes an unpublish RPC to the servers to unpublish via the CSI RPCs: first to the node plugins and then the controller plugins. The controller RPCs must happen after the node RPCs so that the node has had a chance to unmount the volume before the controller tries to detach the associated device. But the client has local access to the node plugins and can independently determine if it's safe to send unpublish RPC to those plugins. This will allow the server to treat the node plugin as abandoned if a client is disconnected and `stop_on_client_disconnect` is set. This will let the server try to send unpublish RPCs to the controller plugins, under the assumption that the client will be trying to unmount the volume on its end first. Note that the CSI `NodeUnpublishVolume`/`NodeUnstageVolume` RPCs can return ignorable errors in the case where the volume has already been unmounted from the node. Handle all other errors by retrying until we get success so as to give operators the opportunity to reschedule a failed node plugin (ex. in the case where they accidentally drained a node without `-ignore-system`). Fan-out the work for each volume into its own goroutine so that we can release a subset of volumes if only one is stuck.
-