This project is mirrored from https://gitee.com/mirrors/nomad.git. Pull mirroring failed .
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.
  1. 11 May, 2022 1 commit
    • Michael Schurter's avatar
      core: emit node evals only for sys jobs in dc · 9f70ede5
      Michael Schurter authored
      Whenever a node joins the cluster, either for the first time or after
      being `down`, we emit a evaluation for every system job to ensure all
      applicable system jobs are running on the node.
      
      This patch adds an optimization to skip creating evaluations for system
      jobs not in the current node's DC. While the scheduler performs the same
      feasability check, skipping the creation of the evaluation altogether
      saves disk, network, and memory.
      9f70ede5
  2. 26 Apr, 2022 4 commits
    • Michael Schurter's avatar
      client: fix waiting on preempted alloc (#12779) · e7924e35
      Michael Schurter authored
      Fixes #10200
      
      **The bug**
      
      A user reported receiving the following error when an alloc was placed
      that needed to preempt existing allocs:
      
      ```
      [ERROR] client.alloc_watcher: error querying previous alloc:
      alloc_id=28... previous_alloc=8e... error="rpc error: alloc lookup
      failed: index error: UUID must be 36 characters"
      ```
      
      The previous alloc (8e) was already complete on the client. This is
      possible if an alloc stops *after* the scheduling decision was made to
      preempt it, but *before* the node running both allocations was able to
      pull and start the preemptor. While that is hopefully a narrow window of
      time, you can expect it to occur in high throughput batch scheduling
      heavy systems.
      
      However the RPC error made no sense! `previous_alloc` in the logs was a
      valid 36 character UUID!
      
      **The fix**
      
      The fix is:
      
      ```
      -		prevAllocID:  c.Alloc.PreviousAllocation,
      +		prevAllocID:  watchedAllocID,
      ```
      
      The alloc watcher new func used for ...
      Unverified
      e7924e35
    • Tim Gross's avatar
      E2E: move volume mounts test to use golang's stdlib test runner (#12788) · 059c89df
      Tim Gross authored
      Part of ongoing work to remove the old E2E framework code.
      Unverified
      059c89df
    • Tim Gross's avatar
      E2E: remove old CLI for driving provisioning (#12787) · 26b0e047
      Tim Gross authored
      We moved off the old provisioning process for nightly E2E to one driven
      entirely by Terraform quite a while back now. We're in the slow
      process of removing the framework code for this test-by-test, but this
      chunk of code no longer has any callers.
      Unverified
      26b0e047
    • Tim Gross's avatar
      CSI: enforce one plugin supervisor loop via `sync.Once` (#12785) · b32722a6
      Tim Gross authored
      We enforce exactly one plugin supervisor loop by checking whether
      `running` is set and returning early. This works but is fairly
      subtle. It can briefly result in two goroutines where one quickly
      exits before doing any work. Clarify the intent by using
      `sync.Once`. The goroutine we've spawned only exits when the entire
      task runner is being torn down, and not when the task driver restarts
      the workload, so it should never be re-run.
      Unverified
      b32722a6
  3. 25 Apr, 2022 3 commits
  4. 22 Apr, 2022 19 commits
  5. 21 Apr, 2022 13 commits