This project is mirrored from https://gitee.com/mirrors/nomad.git.
Pull mirroring failed .
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.
- 12 Apr, 2022 1 commit
-
-
Tim Gross authored
This changeset fixes two sources of flakiness in the event stream test. First, the stream request gets the event *closest* to the index, not the exact match. Although events are written before raft entries they're written asynchronously, so it's possible to race and get a raft index from this query higher than the current head of the event buffer. Ensure the job is running before we try to get the index, so that we've given the event enough time to land in the buffer. Second, the assertion that the found index is greater than the start index is only true if the `PlanResult` event manages to land before we do the second registration. Although it should now with the first fix above, it's not a correct assertion for what we're testing.
-
- 11 Apr, 2022 10 commits
-
-
Luiz Aoqui authored
-
claire labry authored
-
Seth Hoenig authored
feat: remove dependency to consul/lib
-
Karan Sharma authored
-
Yoan Blanc authored
Signed-off-by:
Yoan Blanc <yoan@dosimple.ch>
-
Tim Gross authored
The oversubscription test expects an output that requires the client has polled the task for stats at least once. Wait long enough to ensure that we've polled the stats before failing the test.
-
Tim Gross authored
-
Tim Gross authored
In the reconciler's filtering for tainted nodes, we use whether the server supports disconnected clients as a gate to a bunch of our logic, but this doesn't account for cases where the job doesn't have `max_client_disconnect`. The only real consequence of this appears to be that allocs on disconnected nodes are marked "complete" instead of "lost".
-
Seth Hoenig authored
fix(plugins): should return when ctx.Done
-
James Rasell authored
Some tests may chose to deregister jobs to check Nomad cleanup logic, however, it is still possible for the test to fail and exit before this is hit. This therefore adds a cancellable cleanup func which can be deferred, using context to control whether it gets run or not.
-
- 09 Apr, 2022 1 commit
-
-
Yoan Blanc authored
Signed-off-by:
Yoan Blanc <yoan@dosimple.ch>
-
- 08 Apr, 2022 12 commits
-
-
Tim Gross authored
-
Luiz Aoqui authored
-
Luiz Aoqui authored
-
fyn authored
-
Seth Hoenig authored
docs: fixup title formatting in upgrade guide
-
Seth Hoenig authored
-
Luiz Aoqui authored
-
Luiz Aoqui authored
-
James Rasell authored
-
Luiz Aoqui authored
-
Seth Hoenig authored
Add custom variable validation to docs
-
Seth Hoenig authored
-
- 07 Apr, 2022 15 commits
-
-
Thomas Wunderlich authored
Custom variable validation is a useful feature that is supported by Nomad and not just Terraform. As such it should be documented on the input variable page. I've cribbed the content from the terraform docs so this should be consistent across projects
-
Luiz Aoqui authored
-
Luiz Aoqui authored
-
Jasmine Dahilig authored
docs: add token_last_renewal and token_next_renewal to server metrics and key metrics #12435 (#12505)
-
hc-github-team-nomad-core authored
-
Luiz Aoqui authored
-
Luiz Aoqui authored
-
Phil Renaud authored
* Capitalize methods * Let ESLint yell at us again * Dasherize
-
Tim Gross authored
We introduced a `pprof-interval` argument to `operator debug` in #11938, and unfortunately this has resulted in a lot of test flakes. The actual command in use is mostly fine (although I've fixed some quirks here), so what's really happened is that the change has revealed some existing issues in the tests. Summary of changes: * Make first pprof collection synchronous to preserve the existing behavior for the common case where the pprof interval matches the duration. * Clamp `operator debug` pprof timing to that of the command. The `pprof-duration` should be no more than `duration` and the `pprof-interval` should be no more than `pprof-duration`. Clamp the values rather than throwing errors, which could change the commands that existing users might already have in debugging scripts * Testing: remove test parallelism The `operator debug` tests that stand up servers can't be run in parallel, because we don't have a way of canceling the API calls for pprof. The agent will still be running the last pprof when we exit, and that breaks the next test that talks to that same agent. (Because you can only run one pprof at a time on any process!) We could split off each subtest into its own server, but this test suite is already very slow. In future work we should fix this "for real" by making the API call cancelable. * Testing: assert against unexpected errors in `operator debug` tests. If we assert there are no unexpected error outputs, it's easier for the developer to debug when something is going wrong with the tests because the error output will be presented as a failing test, rather than just a failing exit code check. Or worse, no failing exit code check! This also forces us to be explicit about which tests will return 0 exit codes but still emit (presumably ignorable) error outputs. Additional minor bug fixes (mostly in tests) and test refactorings: * Fix text alignment on pprof Duration in `operator debug` output * Remove "done" channel from `operator debug` event stream test. The goroutine we're blocking for here already tells us it's done by sending a value, so block on that instead of an extraneous channel * Event stream test timer should start at current time, not zero * Remove noise from `operator debug` test log output. The `t.Logf` calls already are picked out from the rest of the test output by being prefixed with the filename. * Remove explicit pprof args so we use the defaults clamped from duration/interval
-
Seth Hoenig authored
client: set environment variable indicating set of reserved cpu cores
-
James Rasell authored
-
Seth Hoenig authored
-
Lars Lehtonen authored
-
Seth Hoenig authored
This PR injects the 'NOMAD_CPU_CORES' environment variable into tasks that have been allocated reserved cpu cores. The value uses normal cpuset notation, as found in cpuset.cpu cgroup interface files. Note this value is not necessiarly the same as the content of the actual cpuset.cpus interface file, which will also include shared cpu cores when using cgroups v2. This variable is a workaround for users who used to be able to read the reserved cgroup cpuset file, but lose the information about distinct reserved cores when using cgroups v2. Side discussion in: https://github.com/hashicorp/nomad/issues/12374
-
Derek Strickland authored
-
- 06 Apr, 2022 1 commit
-
-
Tim Gross authored
We expect every Nomad API client to use a single connection to any given agent, so take advantage of keep-alive by switching the default transport to `DefaultPooledClient`. Provide a facility to close idle connections for testing purposes. Restores the previously reverted #12409 Co-authored-by:
Ben Buzbee <bbuzbee@cloudflare.com>
-