This project is mirrored from https://gitee.com/cowcomic/pixie.git. Pull mirroring failed .
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer.
  1. 06 Mar, 2021 1 commit
    • Vihang Mehta's avatar
      Drop already applied updates correctly · b78983e6
      Vihang Mehta authored
      Summary:
      This check wasn't handling the fact that we might see the same update
      multiple times. As a result we were requesting missing updates with equal from
      and to values and holding on to that connection for 20 minutes until it timed
      out. This should get rid of such spurious requests.
      
      Test Plan:
      The async nature of this and double delivery of stan is hard to
      reproduce in a test environment. I think it's doable but will require some
      rewriting of the existing test. Will be added in a followup.
      
      Reviewers: michelle, #engineering
      
      Reviewed By: michelle, #engineering
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7589
      
      GitOrigin-RevId: 56f52f4538563619458f6ac3d7b1f611f6924f5d
      b78983e6
  2. 05 Mar, 2021 2 commits
  3. 04 Mar, 2021 1 commit
    • James Bartlett's avatar
      Add Request Path Endpoint Clustering. · 555e72ba
      James Bartlett authored
      Summary:
      Adds udfs/udas for clustering request paths into unique endpoints. For instance, "/product/ABCD", "/product/EFGH"... will get clustered into one "/product/*" endpoint.
      
      These should be considered beta functions because there are some performance issues associated with using them.
      
      Separate diff on public github repo for beta scripts to use these udfs/udas.
      
      Test Plan: Added some basic tests, and tested with scripts in github diff.
      
      Reviewers: nserrino, zasgar, #engineering
      
      Reviewed By: nserrino, #engineering
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7559
      
      GitOrigin-RevId: e21781d05ff16a1fd95b6a3e16db90fd459994f1
      555e72ba
  4. 05 Mar, 2021 4 commits
    • Omid Azizi's avatar
      MMap detection: Reduce overhead with back-off · 914e99ef
      Omid Azizi authored
      Summary: Rescan: exponential back-off per PID
      
      Test Plan: dyn_lib_trace_bpf_test
      
      Reviewers: #engineering, yzhao
      
      Reviewed By: #engineering, yzhao
      
      Subscribers: yzhao
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7303
      
      GitOrigin-RevId: 56ed7224e4657d0642612f13684ae85b03e68c04
      914e99ef
    • Natalie Serrino's avatar
      PP-2486: small query broker cleanup · 5422c1de
      Natalie Serrino authored
      Summary: Vihang left a comment on D7579 after I landed it to simply the return statement in the query result forwarder. This diff addresses that and also ensures that a message that I needed when debugging a New Relic cluster will be printed out.
      
      Test Plan: existing
      
      Reviewers: vihang, michelle, #engineering
      
      Reviewed By: vihang, #engineering
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7586
      
      GitOrigin-RevId: 6ee0445b07a4e8d5562e82f144e1c9aee635846b
      5422c1de
    • Yaxiong Zhao's avatar
      Do not disable when CIDR blocks are empty · b11183cf
      Yaxiong Zhao authored
      Summary:
      There is a small risk that when CIDR blocks were never updated, such connections would stay open for
      ever. But that should be extremely rare, and if that happens, MDS probably was busted anyway,
      which would be a more severe issue.
      
      Test Plan: Manual testing
      
      Reviewers: oazizi, #engineering
      
      Reviewed By: oazizi, #engineering
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7560
      
      GitOrigin-RevId: 133089bff1035ad100984703db778c4a8a44af42
      b11183cf
    • Michelle Nguyen's avatar
      Add AKS to CLI allowlist checks · 6cfacefc
      Michelle Nguyen authored
      Summary:
      AKS clusters work with pixie, so we should add it to the cli allowlist. checking for aks clusters is more annoying than gke/eks clusters, since they dont have a special k8s version we can just check.
      instead we have to take the same route that we do with minikube, where we check the azure CLI to see if the cluster is in its list of clusters.
      
      Test Plan: created aks cluster and made sure it allowed me to deploy without warnings
      
      Reviewers: zasgar, oazizi, vihang, #engineering
      
      Reviewed By: oazizi, #engineering
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7585
      
      GitOrigin-RevId: 97ea84fccfd0f681e79a5d303a72121ed1010ec8
      6cfacefc
  5. 04 Mar, 2021 1 commit
  6. 05 Mar, 2021 2 commits
  7. 04 Mar, 2021 2 commits
    • Yaxiong Zhao's avatar
      Add a step to deploy pixie to stirling-perf cluster · 219728a9
      Yaxiong Zhao authored
      Test Plan:
      Manually tested with this step inside the main diff build, and verify that the deployment was done.
      
      The deployment can fail because of pixie deployment not finished successfully, for example,
      some pods are not healthy. We'll gradually improve the situation in the future.
      
      Reviewers: oazizi, #engineering, vihang
      
      Reviewed By: oazizi, #engineering
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7504
      
      GitOrigin-RevId: 7abbf2ea29def2f2d5c31b243c83da8973230743
      219728a9
    • Natalie Serrino's avatar
      PP-2486: Propagate correct errors in query broker execution · dd49d643
      Natalie Serrino authored
      Summary:
      When debugging the metadata UDF seg fault, it became evident that even if there is an error on the agent side, the query broker will always respond with an error that says that the client (aka UI or CLI) closed the stream. This was because of the following sequence of action that resulted in the actual error being silenced:
      1. Error on the agent side triggers the signal to close the client stream
      2. Closing the client stream
      3. Returning an error that the client stream has been closed, even though it was only closed as the result of a non-client related error.
      This diff ensures that the errors are actually propagated correctly and not silenced.
      
      Test Plan: edited tests
      
      Reviewers: vihang, michelle, zasgar, #engineering
      
      Reviewed By: michelle, #engineering
      
      JIRA Issues: PP-2486
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7579
      
      GitOrigin-RevId: a86cc746e874a694287a47259c112aac30ff4658
      dd49d643
  8. 05 Mar, 2021 2 commits
    • Vishal Jain's avatar
      Create a statefulset and move etcd operator deployment. · 0f6f6afc
      Vishal Jain authored
      Summary: Fixes bugs with upgrade by changing persistent metadata deployment to a statefulset. Also separates the kustomize file order.
      
      Test Plan: Manual. px deploy with persistent store, then upgraded with skaffold to check it was working.
      
      Reviewers: vihang, nserrino, michelle, #engineering
      
      Reviewed By: nserrino, michelle, #engineering
      
      JIRA Issues: PP-2483
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7574
      
      GitOrigin-RevId: cd1dfc681017fcb34e5966569e2c4feab73d3787
      0f6f6afc
    • Natalie Serrino's avatar
      PP-2486: Fix metadata UDF seg fault · 85b5077f
      Natalie Serrino authored
      Summary: Large clusters are more likely to have lots of k8s metadata which eventually gets expired. We were noticing that large clusters had queries that would never complete. I determined that this was due to a seg fault in Kelvin (problem 1) that was then handled weirdly by the query broker (problem 2). This diff fixes problem 1, the seg faulting in metadata UDFs causing a Kelvin crash when a pod that we didn't have metadata for was queries. Problem 2 will be fixed in a subsequent diff.
      
      Test Plan: added unit tests for missing/invalid inputs.
      
      Reviewers: michelle, vihang, vjain, zasgar, #engineering
      
      Reviewed By: michelle, vihang, #engineering
      
      JIRA Issues: PP-2486
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7578
      
      GitOrigin-RevId: 0228789fd2c44629fc64e676303994f882c574d6
      85b5077f
  9. 04 Mar, 2021 11 commits
  10. 03 Mar, 2021 2 commits
    • Omid Azizi's avatar
      Integrate TaskStructResolver into BCCWrapper · 5fd8a8e3
      Omid Azizi authored
      Summary: This diff enables the TaskStructResolver for BCC instances that can't find Linux headers on the host.
      
      Test Plan: Jenkins is not a good test environment, because it does have host headers. Still thinking of ways of testing this. Will not release until more rigorously tested.
      
      Reviewers: yzhao, jps, rcheng, #engineering
      
      Reviewed By: jps
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7525
      
      GitOrigin-RevId: 5a8feba523770e63d089226d13225dfbab831f72
      5fd8a8e3
    • Omid Azizi's avatar
      Stirlng profiler k8s scripts · a78e127d
      Omid Azizi authored
      Summary: Some scripts for deploying on k8s.
      
      Test Plan: Manual
      
      Reviewers: yzhao, jps, #engineering
      
      Reviewed By: yzhao, #engineering
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7545
      
      GitOrigin-RevId: 30002a264c655d41878a195ad584a1f0e474ead3
      a78e127d
  11. 04 Mar, 2021 1 commit
    • Omid Azizi's avatar
      Standalone Stirling profiler · fb347078
      Omid Azizi authored
      Summary:
      Made this to investigate the performance issues on gke:prod,
      but good to have around.
      
      Test Plan: None
      
      Reviewers: jps, yzhao, rcheng, #engineering
      
      Reviewed By: yzhao, #engineering
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7537
      
      GitOrigin-RevId: 0146c01421f82dd290409d2f2d98e9cf4761aeee
      fb347078
  12. 03 Mar, 2021 5 commits
  13. 02 Mar, 2021 2 commits
    • Natalie Serrino's avatar
      PP-2445: Add ability to look up UDF source executor to registry info class · 2c010c3c
      Natalie Serrino authored
      Summary: Depends on D7528, which added UDF source executor as a concept to scalar UDFs, re-using the same approach as UDTF. This diff adds the ability to the registry info class to look up which UDF Source executor type a given scalar UDF has.
      
      Test Plan: added unit test
      
      Reviewers: philkuz, zasgar, jamesbartlett, michelle, #engineering
      
      Reviewed By: philkuz, #engineering
      
      JIRA Issues: PP-2445
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7530
      
      GitOrigin-RevId: 8518fd3bbddeeaf6e32ccb5159c41ca5b2e405fc
      2c010c3c
    • Nick Lanam's avatar
      Prevent running a script without specifying required args; warn · c2d045d4
      Nick Lanam authored
      Summary:
      An argument to a script is required if its variable definition omits `defaultValue`. The UI shows this with an asterisk. This diff also refuses to run the script, and warns the user, if they try to run a script with such an argument missing.
      
      In this image, `defaultValue` is omitted in the vis spec, the argument is marked with an asterisk, the value is missing, and the run button was pressed.
      {F115650}
      
      Test Plan: Open a script, remove the `defaultValue` field from one of its variables in the vis spec, and then try to run it with the arg missing. It should refuse, and tell you to specify the arg.
      
      Reviewers: michelle, vihang, #engineering
      
      Reviewed By: michelle, #engineering
      
      Subscribers: nserrino
      
      JIRA Issues: PC-548
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7536
      
      GitOrigin-RevId: 0d9679bafe0c3f250a8e78af86a45d0f1f474977
      c2d045d4
  14. 03 Mar, 2021 1 commit
  15. 01 Mar, 2021 1 commit
    • Natalie Serrino's avatar
      PP-2445: Add source executor to scalar UDF (similar to UDTF) · 6e7726c9
      Natalie Serrino authored
      Summary: Similar to UDTFs, some UDFs should only be executed on Kelvin or PEM Carnot instances. This diff follows the same protocol used for UDTF in order to add that distinction for UDF. This will be used to help plan queries so that Kelvin-only UDFs are scheduled on Kelvin, and also alert the user with a compiler error if they are trying to run a PEM-only UDF after a blocking node (aka on a Kelvin node). While some aspects of how we handle metadata and compilation of distributed plans will change in the future with Carnot ETL, the concept of certain UDFs needing to execute in on a specific type of agent is expected to stick around in some form.
      
      Test Plan: updated unit tests
      
      Reviewers: philkuz, zasgar, jamesbartlett, michelle, #engineering
      
      Reviewed By: philkuz, #engineering
      
      JIRA Issues: PP-2445
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7528
      
      GitOrigin-RevId: 2e8674fd914978926bb3f0e52831f175acb76bcb
      6e7726c9
  16. 25 Feb, 2021 1 commit
  17. 03 Mar, 2021 1 commit
    • Vihang Mehta's avatar
      Add pprof handlers to all of our go servers · a99dca42
      Vihang Mehta authored
      Summary:
      This makes it easier to grab a profile on demand if there are any
      problems with the service.
      Note, if running `go tool pprof` on a port forwarded service, use `https+insecure`
      as the scheme to ignore the TLS cert.
      
      Test Plan:
      Deploy pixie cloud with skaffold. Portforward one of the services.
      Run `go tool pprof`, ensure that we can inspect profiles.
      
      Reviewers: zasgar, michelle, #engineering
      
      Reviewed By: zasgar, #engineering
      
      Subscribers: yzhao
      
      Differential Revision: https://phab.corp.pixielabs.ai/D7538
      
      GitOrigin-RevId: c8db56d7dcb0afe0164bebf2180c52432ed4972b
      a99dca42