layer5io / meshery-smp-action Goto Github PK

GitHub Action for pipelining microservices and Kubernetes performance testing with Meshery

Home Page: https://layer5.io/projects/nighthawk

License: Apache License 2.0

Makefile 5.88% JavaScript 2.31% Shell 91.81%

gitops-performance hacktoberfest kubernetes load-generation load-testing performance-analysis performance-optimization performance-pipeline performance-testing

meshery-smp-action's People

Contributors

Stargazers

Watchers

meshery-smp-action's Issues

[Docs] Replace image in README file

Current #State:
See the white background between the man and woman in the image below that needs to be updated to a better version in the README file.

Desired State:
This is the desired image.

Replace with the image below.

Contributor Resources

Use `mesheryctl` not bash

Current Behavior

Current tests use bash scripts to deploy service meshes and apps, but should be using mesheryctl mesh and mesheryctl app.

Contributor Guide

Rename the profiles of performance tests

Current Behavior

Performance test profiles are named as {service-mesh}-{load-generator}-{test-configuration}.yaml currently, it‘s ugly and hard to understand.

The goal of this issue is to discuss and find out the how to make the profile names be in order for people to understand what is being tested.

Add a performance profile for a Istio crypto configuration

Current Behavior

Desired Behavior

Implementation

Acceptance Tests

Mockups

Contributor Guide

Add retries and confirmations to ensure CNCF runners and machines are removed.

Description

There are some remaining CNCF runners not being remove after tests done, the number of them gradually increases over time.
We can delete them manually, but it's better to make sure they are properly removed.

The same thing happened to equinix servers deletion：

Expected Behavior

We should add retries and confirmations to ensure CNCF runners and machines are removed.

Screenshots/Logs

Environment:

Meshery Version:
Kubernetes Version:
Host OS:
Browser:

Issue Template Updates for Figma link

Current Behavior

The project's current issue templates are missing an open invitation link where new contributors can join the community's Figma team and view user interface designs and other UX projects.

Desired Situation

Each template that has a reference to Figma in its resources section should an invite link added.

Implementation

- 🎨 Wireframes and [designs for Meshery UI](https://www.figma.com/file/SMP3zxOjZztdOLtgN4dS2W/Meshery-UI) in Figma [(open invite)](https://www.figma.com/team_invite/redeem/qJy1c95qirjgWQODApilR9)

Acceptance Tests
All references to Figma include the "open invite" link.

Mockups

Contributor Guide

Update the dynamic test name in the scheduled workflow

Current Behavior

The Scheduled Benchmark Tests workflow creates dynamic test names based on the configuration of the test. Current format is shown below.

meshery-smp-action/.github/workflows/scheduled-benchmarks.yml

Line 59 in b999f0a

 test_name: '${{ matrix.service-mesh }}-${{ matrix.load-generator }}-${{ github.event.inputs.profile_filename }}${{ github.event.inputs.profile_name }}' 

So, a sample test name now is: istio-fortio-load-test.yaml

Desired Behavior

Remove the .yaml from the test name and only include the rest of the file name.

Contributor Guide

[Docs] Initialize Readme with project description

Current State:
Template content in the readme.

Desired State:
Project-specific content in the readme.

Contributor Resources

Automating initialisation of baremetal for self-hosted-runners

Current Behavior

Currently we are

Doing baremetal initialisation manually by going to the equinix UI.
Doing the setup by manually SSHing and creating users, installing dependencies etc.

The instructions are mentioned here

Desired Behavior

Ideally both these tasks should be automated, using the equinix APIs and tools like terraform

Implementation

We can use tools like terraform and use existing terraform support for equinix APIs to achieve this.
https://github.com/equinix/terraform-provider-equinix
https://github.com/equinix/cloud-provider-equinix-metal
https://github.com/machulav/ec2-github-runner#example

Acceptance Tests

Successful action runs with complete automation would solve this issue

Additional Comments

@leecalcote @gyohuangxin would creating a runner on demand (i.e. after starting a workflow) mean that the self-hosted-runner in itself would not be needed? Given we have to register a self-hosted-runner to a repository first.
https://docs.github.com/en/actions/hosting-your-own-runners/adding-self-hosted-runners

Contributor Guide

The application deployed is not reachable to Meshery

Description

The application deployed by SMP github action is not reachable to Meshery.

Expected Behavior

Root cause:

The default endpoint_url is Istio ingress url + ingress port, which returns 404 status code to Meshery.
minikube tunnel is easy to be killed.

Screenshots/Logs

Environment:

Meshery Version:
Kubernetes Version:
Host OS:
Browser:

Increase frequency of self-hosted performance tests to once per hour

Current Behavior

Tests run on a 24 hour period.

Desired Behavior

As the project ramps the diversity of testing, it would be good to get these results generated more frequently in order to iterate more quickly on test harness updates and performance test profiles.

Implementation

Increase frequency of self-hosted performance tests to once per hour

Contributor Guide

Fix failing SMP Scheduled benchmark tests

Description

The SMP becnhmark test that run as GitHub Actions are currently failing

Expected Behavior

We would want to figure out what is causing this issue and solve this so that correct and error free performance test are published on the SMP Dashboard

Screenshots/Logs

Environment:

Meshery Version:
Kubernetes Version:
Host OS:
Browser:

Contributor Guides and Resources

📝 Meshery Adapters Spreadsheet
🛠 Meshery Build & Release Strategy
📚 Instructions for contributing to documentation
- Meshery documentation site and source
🎨 Wireframes and designs for Meshery UI in Figma (open invite)
🙋🏾🙋🏼 Questions: Layer5 Discussion Forum and Layer5 Community Slack

Replace `app onboard` with `pattern apply`

Current Behavior

Currently we use app onboard to deploy manifests.

Desired Behavior

We should use pattern apply where possible as that gives us flexibility and configurability.

Implementation

Acceptance Tests

Mockups

Contributor Guide

Enable other service meshes performance tests

Current Behavior

We have included Istio, Linkerd, OSM performance tests.

Desired Behavior

We should enable other service meshes performance tests listed on https://smp-spec.io/dashboard.

Consul
App Mesh
Kuma
Cilium

Implementation

Writing bash scripts for each mesh is time consuming, so we should use mesheryctl to deploy them.
And we should use mesheryctl app onboard to deploy sample apps if #48 is readay.

Acceptance Tests

Mockups

Contributor Guide

Benchmarks fails sometimes due to `mesheryctl pattern apply` error

Description

We use mesheryctl patter apply to deploy applications and manifests on Istio mesh, but it fails sometimes, even though we increased the sleep time: https://github.com/layer5io/meshery-smp-action/runs/8251690332?check_suite_focus=true#step:5:44

Expected Behavior

Screenshots/Logs

Environment:

Meshery Version:
Kubernetes Version:
Host OS:
Browser:

Send email notifications to layer5 on job failure along with the details of the failure

If any job in this workflow fails, an email should be sent to mailto:[email protected]|[email protected] with details of the failure - https://github.com/meshery/meshery/blob/master/.github/workflows/build-and-release-stable.yml

Slack Message

Stability issues wrt Scheduled Benchmark Tests on Self Hosted Runner

Description

Multiple issues of the Scheduled Benchmark Tests on Self Hosted Runner generate different results

While here just one combination failed :- https://github.com/layer5io/meshery-smp-action/runs/7770247406 (linkerd-wrk-soak)

Here a couple failed :- https://github.com/layer5io/meshery-smp-action/actions/runs/2832186044

And here one runner startup itself failed :- https://github.com/layer5io/meshery-smp-action/actions/runs/2833335073

Expected Behavior

The behaviour should be consistent across Scheduled Benchmark Test run on self hosted runners. Given we are working with external hardware where connections can fail, ideally we should have a retry mechanism and we should work to considerably reduce the consistency

Logs

In the logs Auth seems to be a common culprit

Opening Meshery (http://localhost:31391) in browser.
Failed to open Meshery in browser, please point your browser to http://localhost:31391 to access Meshery.
authentication failed: Get "http://localhost:31[39](https://github.com/layer5io/meshery-smp-action/runs/7765562299?check_suite_focus=true#step:5:40)1/api/providers": dial tcp [::1]:31391: connect: connection refused
Verifying prerequisites...
Authentication token not found. please supply a valid user token with the --token (or -t) flag. or login with `mesheryctl system login`
Onboarding application... Standby for few minutes...
Error: Authentication token not found. please supply a valid user token with the --token (or -t) flag. or login with `mesheryctl system login`

This is seen for istio

Opening Meshery (http://192.168.49.2:32398/) in browser.
Failed to open Meshery in browser, please point your browser to http://192.168.49.2:32398/ to access Meshery.
Verifying prerequisites...
Adapter for required mesh not found
Onboarding application... Standby for few minutes...
rpc error: code = Unknown desc = no matches for kind "Gateway" in version "networking.istio.io/v1alpha3"

Error from server (NotFound): namespaces "istio-system" not found
Error from server (NotFound): namespaces "istio-system" not found
Service Mesh: Istio - ISTIO
Gateway URL: [http://192.168.49.2:](http://192.168.49.2/)

Environment:

Meshery Version: v0.6.0-rc.6fc
Kubernetes Version: minikube 1.23.2
Host OS: Ubuntu
Browser: N/A

Add newcomers-alert.yml and slack.yml [Docs]

Current State:

No newcomers-alert.yml
Out of date slack.yml

Desired State:

Updated newcomers-alert.yml and slack.yml

Contributor Resources

Make the self-hosted runner configurable as the workflow's options

Current Behavior

We had an implementation of running SMP on self-hosted runner #39 , but the configuration of self-hosted runner is hardcoded as "c3.small.x86".

Desired Behavior

We should make self-hosted runner configurable as the workflow's options, e.g. server type, location......

Implementation

Acceptance Tests

Mockups

Contributor Guide

Define benchmark test configurations

Current Behavior

The Scheduled Benchmark Tests workflow runs performance benchmark tests at regular intervals and captures the test results.

It runs the tests defined in these two test configuration files.

These test configurations are not yet defined properly.

Desired Behavior

Define proper test configurations to run benchmark tests.

Implementation

Update the test configuration files linked above.

Contributor Guide

Add sample performance test

Description
Provide a sample performance test which would be done using this action so that users can consider what runner types and benchmark configurations should be used in other tests.

CI action error due to inaccessibility of k8s cluster

Description

Meshery SMP Action is failing on some CI runs due to inaccessibility of k8s cluster present in the workflow

Expected Behavior

The error must be rectified and CI run should be free of errors

Using `pattern apply` to apply pattern files in the action

Current Behavior

Currently only Istio adapter has a pattern file and hence can use the pattern apply construct. For linkerd and OSM tests we need to use app onboard to apply their demo kubernetes manifests

Desired Behavior

We should run a common pattern apply method across all meshes

Implementation

Acceptance Tests

Runs showing that pattern apply works for all meshes

Mockups

Contributor Guide

Capture results, runners and test configurations

As we start using this action, we also need to take note on the environment specifications like the test configurations, GitHub runner specs as well as review the results of the performance benchmarks in this environment.

Contributor Guide

Mark test servers for auto-deletion using "end_at" parameter

Current Behavior

Of the scheduled tests that run multiple times a day, they have faced a few challenges. Notably, one of those challenges is in the cleanup phase once a test is complete. Currently, it is frequently the case that any number of bare metal servers that are used for testing or orphaned, and not decommissioned at the end of each test. This leaves an inordinate amount of bare metal servers, unnecessarily unavailable for used by other projects.

@vielmetti has been most helpful in identifying ways to mitigate this from happening.

Desired Behavior

All resources provisioned for a scheduled test are subsequently decommissioned at the end of that same test.

Implementation

Recently @vielmetti point this out:

You can create servers that will auto-delete themselves at a time certain, perfect for test runs. See https://deploy.equinix.com/developers/docs/metal/deploy/spot-market/#spot-market-request-creation. You want the “end_at” parameter on the API endpoint for device creation

Slack Message

Acceptance Tests

Test servers are decommissioned at end of scheduled testing period.

Contributor Guide

Notify on Star workflow reporting "null" user

Panic error happens when running benchmarking test.

Description

Panic error happens when running benchmarking test either on Github runner or CNCF cluster runner.

Logs for github runner:
https://github.com/layer5io/meshery-smp-action/runs/5493977248?check_suite_focus=true#step:6:1573

Logs for CNCF cluster runner:
https://github.com/gyohuangxin/meshery-smp-action/runs/5462624684?check_suite_focus=true#step:6:1174

Expected Behavior

Screenshots/Logs

Environment:

Meshery Version:
Kubernetes Version:
Host OS:
Browser:

Cherry-pick self-hosted commit to master branch

Regarding #38, we have implement running SMP on self-hosted CNCF cluster, and codes have been merged into self-hosted branch. We should cherry-pick it to master branch.

Contributor Guide

MVP using `mesheryctl perf apply`

Description
This repository is meant for the meshery GH action for performing SMP tests.

Desired Behavior

Using some boilerplate code from https://github.com/layer5io/meshery-smi-conformance-action, initialize this action to use mesheryctl perf subcommands for creating a performance test.

Contributor Guide

[Self hosted] Keep track of machine state to minimise waiting time

Current Behavior

Currently we are arbitrarily waiting 10 minutes waiting for the server to get provisioned and started.

meshery-smp-action/.github/workflows/scripts/start-cil-runner.sh

Line 27 in 20c0b6f

# Wait 10 minutes until the machine is running

Desired Behavior

We want to optimise on our waiting time by keeping track of the state variable by the machine

Implementation

The logic of this can look like

If "state" == "provisioning", sleep 10s...
If "state" == "active", echo "Machine successfully created!" and continue.
If "state" == "failed", echo "Failed to create machine" and exit.

in the above mentioned bash script.

A little experimentation might be required based on the intervals on which we poll. (We do not want cause anything that might come under a DOS attack :) )

Acceptance Tests

A link to the self-hosted workflow run would be key to get your changes accepted

Contributor Guide

Update minikube and kubernetes to latest versions

Current Behavior

These versions are highlighted in the readme as being used by the action currently:

          minikube version: 'v1.21.0'
          kubernetes version: 'v1.20.7'

Desired Behavior

The latest versions of these tools are:

          minikube version: 'v1.30.1'
          kubernetes version: 'v1.27.3'

Implementation

Use the latest versions in the action by default.
Update the scheduled tests to use the latest versions.

Acceptance Tests

New version of this GitHub Action is published to the marketplace.

Contributor Guides and Resources

📝 Meshery Adapters Spreadsheet
🛠 Meshery Build & Release Strategy
📚 Instructions for contributing to documentation
- Meshery documentation site and source
🎨 Wireframes and designs for Meshery UI in Figma (open invite)
🙋🏾🙋🏼 Questions: Layer5 Discussion Forum and Layer5 Community Slack

[CI] Update Slack Notification workflow

Description

List this repo's name in slack.yml workflow.

Run scheduled benchmarks on different sample applications

Current Behavior

Currently the tests are run with the sample application of the particular service mesh. This is different for each service mesh.

Desired Behavior

Add sample application as a configurable item and run tests across multiple applications for each service mesh.

A new field should be added here and the scripts needs to be changed to take in this dynamic value.

Contributor Guide

Archive Open Service Mesh Tests

Current Behavior

Open Service Mesh is an archived project. It's performance testing here can be removed.

Contributor Guides and Resources

📝 Meshery Adapters Spreadsheet
🛠 Meshery Build & Release Strategy
📚 Instructions for contributing to documentation
- Meshery documentation site and source
🎨 Wireframes and designs for Meshery UI in Figma (open invite)
🙋🏾🙋🏼 Questions: Layer5 Discussion Forum and Layer5 Community Slack

layer5io / meshery-smp-action Goto Github PK

meshery-smp-action's People

Contributors

Stargazers

Watchers

Forkers

meshery-smp-action's Issues

Current Behavior

Current Behavior

Current Behavior

Desired Behavior

Implementation

Acceptance Tests

Mockups

Description

Expected Behavior

Screenshots/Logs

Environment:

Current Behavior

Desired Situation

Implementation

Mockups

Current Behavior

Desired Behavior

Current Behavior

Desired Behavior

Implementation

Acceptance Tests

Additional Comments

Description

Expected Behavior

Screenshots/Logs

Environment:

Current Behavior

Desired Behavior

Implementation

Description

Expected Behavior

Screenshots/Logs

Environment:

Contributor Guides and Resources

Current Behavior

Desired Behavior

Implementation

Acceptance Tests

Mockups

Current Behavior

Desired Behavior

Implementation

Acceptance Tests

Mockups

Description

Expected Behavior

Screenshots/Logs

Environment:

Description

Expected Behavior

Logs

Environment:

Current Behavior

Desired Behavior

Implementation

Acceptance Tests

Mockups

Current Behavior

Desired Behavior

Implementation

Description

Expected Behavior

Current Behavior

Desired Behavior

Implementation

Acceptance Tests

Mockups

Current Behavior

Desired Behavior

Implementation

Acceptance Tests

Description

Expected Behavior