This project is mirrored from https://gitee.com/mirrors/nomad.git.
Pull mirroring failed .
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.
- 18 Jun, 2019 2 commits
-
-
Chris Baker authored
-
Chris Baker authored
-
- 17 Jun, 2019 1 commit
-
-
Chris Baker authored
-
- 12 Jun, 2019 2 commits
-
-
Danielle Lancashire authored
-
Danielle Lancashire authored
We currently only run cleanup Service Hooks when a task is either Killed, or Exited. However, due to the implementation of a task runner, tasks are only Exited if they every correctly started running, which is not true when you recieve an error early in the task start flow, such as not being able to pull secrets from Vault. This updates the service hook to also call consul deregistration routines during a task Stop lifecycle event, to ensure that any registered checks and services are cleared in such cases. fixes #5770
-
- 14 May, 2019 3 commits
-
-
Michael Schurter authored
Registration and restoring allocs don't share state or depend on each other in any way (syncing allocs with servers is done outside of registration). Since restoring is synchronous, start the registration goroutine first. For nodes with lots of allocs to restore or close to their heartbeat deadline, this could be the difference between becoming "lost" or not.
-
Michael Schurter authored
Refactoring of 104067bc Switch the MarkLive method for a chan that is closed by the client. Thanks to @notnoop for the idea! The old approach called a method on most existing ARs and TRs on every runAllocs call. The new approach does a once.Do call in runAllocs to accomplish the same thing with less work. Able to remove the gate abstraction that did much more than was needed.
-
Michael Schurter authored
Fixes #1795 Running restored allocations and pulling what allocations to run from the server happen concurrently. This means that if a client is rebooted, and has its allocations rescheduled, it may restart the dead allocations before it contacts the server and determines they should be dead. This commit makes tasks that fail to reattach on restore wait until the server is contacted before restarting.
-
- 02 Apr, 2019 2 commits
-
-
Michael Schurter authored
This reverts commit cb36f453.
-
Michael Schurter authored
-
- 01 Apr, 2019 1 commit
-
-
Michael Schurter authored
Avoid libcontainer.Process trying to lookup the binary via $PATH as the executor has already found where the binary is located.
-
- 25 Mar, 2019 1 commit
-
-
Michael Schurter authored
I chose to make them more of integration tests since there's a lot more plumbing involved. The internal implementation details of how we craft task envs can now change and these tests will still properly assert the task runtime environment is setup properly.
-
- 04 Mar, 2019 1 commit
-
-
Michael Schurter authored
The test is sadly quite complicated and peeks into things (logmon's reattach config) AR doesn't normally have access to. However, I couldn't find another way of asserting logmon got cleaned up without resorting to smaller unit tests. Smaller unit tests risk re-implementing dependencies in an unrealistic way, so I opted for an ugly integration test.
-
- 01 Mar, 2019 1 commit
-
-
Michael Schurter authored
This commit is a significant change. TR.Run is now always executed, even for terminal allocations. This was changed to allow TR.Run to cleanup (run stop hooks) if a handle was recovered. This is intended to handle the case of Nomad receiving a DesiredStatus=Stop allocation update, persisting it, but crashing before stopping AR/TR. The commit also renames task runner hook data as it was very easy to accidently set state on Requests instead of Responses using the old field names.
-
- 22 Feb, 2019 7 commits
-
-
Mahmood Ali authored
When Vault token expires and task is restarted, emit `TaskRestartSignal` similar to v0.8.7
-
Mahmood Ali authored
-
- 21 Feb, 2019 3 commits
-
-
Michael Schurter authored
Fixes restarting on recoverable errors from StartTask. Ports TestTaskRunner_Run_RecoverableStartError from 0.8 which discovered the bug.
-
Michael Schurter authored
-
Michael Schurter authored
-
- 20 Feb, 2019 2 commits
-
-
Michael Schurter authored
0.9.0beta2 contains a regression where artifact download errors would not cause a task restart and instead immediately fail the task. This restores the pre-0.9 behavior of retrying all artifact errors and adds missing tests.
-
Michael Schurter authored
Adds a new helper and removes a duplicated test.
-
- 14 Feb, 2019 2 commits
-
-
Mahmood Ali authored
Co-Authored-By:
schmichael <michael.schurter@gmail.com>
-
Mahmood Ali authored
Co-Authored-By:
schmichael <michael.schurter@gmail.com>
-
- 12 Feb, 2019 4 commits
-
-
Michael Schurter authored
-
Michael Schurter authored
Broke ShutdownDelay but the test was timing dependent so it just appeared flaky. Made the test slower so that it should never incorrectly pass.
-
Michael Schurter authored
-
Michael Schurter authored
Also fix race conditions in the mock vault client.
-
- 22 Jan, 2019 3 commits
-
-
Michael Schurter authored
Added ability to adjust the number of events the TaskRunner keeps as there's no way to observe all events otherwise. Task events differ slightly from 0.8 because 0.9 emits Terminated every time a task exits instead of only when it exits on its own (not due to restart or kill). 0.9 does not emit Killing/Killed for restarts like 0.8 which seems fine as `Restart Signaled/Terminated/Restarting` is more descriptive. Original v0.8 events emitted: ``` expected := []string{ "Received", "Task Setup", "Started", "Restart Signaled", "Killing", "Killed", "Restarting", "Started", "Restart Signaled", "Killing", "Killed", "Restarting", "Started", "Restart Signaled", "Killing", "Killed", "Not Restarting", } ```
-
Michael Schurter authored
-
Michael Schurter authored
Also fix signal error handling in mock_driver.
-
- 18 Jan, 2019 1 commit
-
-
Chris Baker authored
-
- 15 Jan, 2019 1 commit
-
-
Michael Schurter authored
Porting test from 0.8 to 0.9.
-
- 14 Jan, 2019 1 commit
-
-
Michael Schurter authored
Restore a pre-0.9 test that asserts Consul services are deregistered before a task's shutdown delay.
-
- 07 Jan, 2019 1 commit
-
-
Alex Dadgar authored
-
- 19 Dec, 2018 1 commit
-
-
Nick Ethier authored
-