This project is mirrored from https://gitee.com/mirrors/nomad.git. Pull mirroring failed .
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.
  1. 23 Jun, 2022 3 commits
    • Phil Renaud's avatar
      Changelog · ac14be46
      Phil Renaud authored
      ac14be46
    • Phil Renaud's avatar
    • Tim Gross's avatar
      fix deadlock in plan_apply (#13407) · 0213a36e
      Tim Gross authored
      The plan applier has to get a snapshot with a minimum index for the
      plan it's working on in order to ensure consistency. Under heavy raft
      loads, we can exceed the timeout. When this happens, we hit a bug
      where the plan applier blocks waiting on the `indexCh` forever, and
      all schedulers will block in `Plan.Submit`.
      
      Closing the `indexCh` when the `asyncPlanWait` is done with it will
      prevent the deadlock without impacting correctness of the previous
      snapshot index.
      
      This changeset includes the a PoC failing test that works by injecting
      a large timeout into the state store. We need to turn this into a test
      we can run normally without breaking the state store before we can
      merge this PR.
      
      Increase `snapshotMinIndex` timeout to 10s.
      This timeout creates backpressure where any concurrent `Plan.Submit`
      RPCs will block waiting for results. This sheds load across all
      servers and gives raft some CPU to catch up, because schedulers won't
      dequeue more work while waiting. Increase it to 10s based on
      observations of large production clusters.
      0213a36e
  2. 22 Jun, 2022 2 commits
  3. 21 Jun, 2022 3 commits
  4. 20 Jun, 2022 2 commits
  5. 17 Jun, 2022 1 commit
  6. 16 Jun, 2022 5 commits
  7. 15 Jun, 2022 1 commit
  8. 14 Jun, 2022 7 commits
  9. 13 Jun, 2022 1 commit
  10. 10 Jun, 2022 6 commits
  11. 09 Jun, 2022 4 commits
    • Seth Hoenig's avatar
      Merge pull request #13308 from hashicorp/f-generic-helpers · 2697e63a
      Seth Hoenig authored
      helpers: provide a few generic helper functions
      2697e63a
    • Seth Hoenig's avatar
      helpers: provide a few generic helper functions · 109f25fe
      Seth Hoenig authored
      This PR deprecates some functions in favor of generic alternatives.
      
      The new functions are compatible only with Nomad v1.4+.
      
      The old functions (nor their use) should not be removed until Nomad v1.6+.
      109f25fe
    • Tim Gross's avatar
      CSI: skip node unpublish on GC'd or down nodes (#13301) · dd1bbbec
      Tim Gross authored
      If the node has been GC'd or is down, we can't send it a node
      unpublish. The CSI spec requires that we don't send the controller
      unpublish before the node unpublish, but in the case where a node is
      gone we can't know the final fate of the node unpublish step.
      
      The `csi_hook` on the client will unpublish if the allocation has
      stopped and if the host is terminated there's no mount for the volume
      anyways. So we'll now assume that the node has unpublished at its
      end. If it hasn't, any controller unpublish will potentially hang or
      error and need to be retried.
      dd1bbbec
    • phreakocious's avatar
      Add `guest_agent` config option for QEMU driver (#12800) · f8774369
      phreakocious authored
      Add boolean 'guest_agent' config option for QEMU driver, which will
      create the socket file for the QEMU Guest Agent in the task dir when
      enabled.
      f8774369
  12. 08 Jun, 2022 5 commits