This project is mirrored from https://gitee.com/mirrors/nomad.git. Pull mirroring failed .
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.
  1. 06 Sep, 2022 1 commit
    • Seth Hoenig's avatar
      core: connect with client before marking down due to missed heartbeat · 7b6b75d6
      Seth Hoenig authored
      This PR makes the Server agent(*) attempt to make a connection with a
      Client before marking it as NodeStatusDown due to a missed heartbeat.
      
      Manually double-checking responsiveness of the Client before marking
      it down paves over issues where the Client may still be fine, but
      coordination between Client, the server it is reporting status to,
      and the leader managing heartbeat timers is not lined up.
      
      (*) Who _should_ be the leader, but might not be in the immediate
      aftermath of something triggering an election.
      
      Potentially resolved causes of erroneous down-ed nodes:
      
      1. Stale leader w/ expiring timers marking nodes it should not
      2. Overloaded leader not responding to incoming update RPCs before timers expire
      7b6b75d6
  2. 01 Sep, 2022 7 commits
  3. 31 Aug, 2022 5 commits
  4. 30 Aug, 2022 11 commits
  5. 29 Aug, 2022 13 commits
  6. 26 Aug, 2022 3 commits
    • Michael Schurter's avatar
      consul: allow stale namespace results (#12953) · 728f5298
      Michael Schurter authored
      Nomad reconciles services it expects to be registered in Consul with
      what is actually registered in the local Consul agent. This is necessary
      to prevent leaking service registrations if Nomad crashes at certain
      points (or if there are bugs).
      
      When Consul has namespaces enabled, we must iterate over each available
      namespace to be sure no services were leaked into non-default
      namespaces.
      
      Since this reconciliation happens often, there's no need to require
      results from the Consul leader server. In large clusters this creates
      far more load than the "freshness" of the response is worth.
      
      Therefore this patch switches the request to AllowStale=true
      728f5298
    • Tim Gross's avatar
      d1faead3
    • Tim Gross's avatar
      file rename · 91c81ba9
      Tim Gross authored
      91c81ba9