cburgmer / buildviz Goto Github PK

View Code? Open in Web Editor NEW

108.0 4.0 9.0 4.58 MB

Transparency for your build pipeline's results and runtime

Home Page: https://buildviz.cburgmer.space/

License: BSD 2-Clause "Simplified" License

Shell 5.42% Clojure 45.38% HTML 24.44% JavaScript 22.97% CSS 1.71% Dockerfile 0.08%

pipeline ci cd continuous-delivery continuous-integration jenkins gocd teamcity concourse

buildviz's People

Contributors

Stargazers

Watchers

Forkers

flosell thoughtworks-accelerators aravind666 sumonmal009 jbravo 0terpriawnipi sumpgrumycene

buildviz's Issues

Fail on invalid JSON

When storing job information, fail early for unknown parameters as to provide early feedback.

Support dots and other character in pipeline/stage names in Go.cd

Currently dots seem to trigger a 400 from buildviz server.

Add user agent to all sync jobs

Let the sync jobs be identifiable, to give API providers some way of understanding their traffic.

Slowest test cases should not included old tests

Showing older test names (or even a temporary rename which will result in a single datapoint and so be prone to outliers) does not provide value. Filtering those out might be a trivial and effective solution.

Possibility of Jenkins sync jumping over running build if newer already finished

Sync should stop for first ongoing build so we can resume later.

Support colons ":" in RSpec style test names for TeamCity

transform_tests.clj:7 disallows colons in test names. We should be more lenient, considering that we are guessing anyways.

Base URL for Jenkins fails when included %

The buildviz.jenkins.sync job fails with a String format exception if a percentage sign shows up in the base URL (say due to basic auth).

Smaller improvements

wait time graph should show which build triggered the waiting build in tooltip
pipeline runtime should show build ids in tooltip
pipeline runtime hover over line should show tooltip (not just circles)
wait time graph has not UI test

Classname optional for JSON input

Although http://llg.cubic.org/docs/junit/ describes classname as required, the JSON schema does not require a classname.

Ordering of `inputs` is important

For several inputs the ordering of the revision with source_id is important. Two same inputs in different order will not me recognised as the overall same build input.

Investigate lousy aggregation of jenkins-ci.org test results

http://cburgmer.github.io/buildviz/ci.jenkins-ci.org/#graph_averageTestRuntime has one burst per test class, and we can't shine with the hierarchical structure that we normally offer.

Build schema validation doesn't catch `null`

Null values are not caught for required start field. Fixed upstream probably, but not released: bigmlcom/closchema#35

How can we support build triggers for job level granularity in GoCD?

Pipeline triggers are modelled on stage level. Modelling this on job level doesn't seem to be straight forward.

Show last x days of builds from newest build instead of from today

Working with outdated sync can be tedious and unintuitive as the build data slowly vanishes from the graphs, as newer dates are just empty if no new sync has been undertaken. In the case of the examples inside the repo, they just plainly don't show anything for historical builds, if they are e.g. a year back.

Rather let's just show the last x days/months offset from the latest build synched.

Gosync picks up from last job `building-time` not last stage `scheduled-time`

Gosync currently will ask buildviz for the newest job and start from there. However it will look only on a stage level, and compare the stage scheduled-time, which is a few seconds earlier than the actual job building-time.

Given gosync was started at time T, it would pick up all stages scheduled until that time. If this included only jobs building at T+2, then stages scheduled at T+1 would not be included on a subsequent sync. Instead gosync would start at T+2.

gosync assumes Go.cd is hosted under /go

See e.g. hack in 4eeccf3

Return datetimes in CSV as something Excel or Google Spreadsheet can understand

No tooltips shown for Fail phases graph

.. in Chrome

Sync of renamed pipeline/stage in Go.cd will probably fail sync

Workaround for 1575 will leave the job with a nil value for start, and will probably fail at the schema validation, now that start is required.

Only solution is probably to wholly ignore past builds from renamed items.

Obscured package names collide with class of same name for runtime by class

Given package com.example.a and class a in package com.example, both accumulated test runtimes will collide when rendered in the 'Average test runtime' graph.

Feature: Group testclass runtime by package

Grouping by package structure will make it easier to find out where time is spent.

Don't show BasicAuth password for Jenkins & Go

Similar to the TeamCity sync process, don't print the password inside the URL to stdout/logs.

Gosync should not fail on invalid test result XML

Currently a 400 error fails the script

Feature: Distinguish between scheduling and assignment of jobs

Go.cd (and other's maybe as well) first schedule a job after an external event (e.g. source change), and when a build node gets available assigns a job.

Including the delay between both in the job runtime in statistics is misleading, as an increased timing there indicates that build system might be overloaded, rather than the actual step taking more time. However monitoring the delay is important when planning to reduce cycle time.

Jenkins will report multiple triggering build causes

When builds are stacking up, the following job might be triggered by multiple builds:

"causes" : [
        {
          "shortDescription" : "Started by upstream project \"Deploy\" build number 161",
          "upstreamBuild" : 161,
          "upstreamProject" : "Deploy",
          "upstreamUrl" : "job/Deploy/"
        },
        {
          "shortDescription" : "Started by upstream project \"Deploy\" build number 162",
          "upstreamBuild" : 162,
          "upstreamProject" : "Deploy",
          "upstreamUrl" : "job/Deploy/"
        }
      ]

Currently the first cause is selected, any further ignored. This then will show shorter, partial pipeline runs in the pipeline graph.
The upcoming wait time graph will also not calculate the consequently longer wait times.

Ideas for further graphs

Show

Provide CSV for all APIs

Offer CSV as alternate output so the statistics can be processed in other tools.

Clean up for 1.0

Remove source_id ?
job (in responses) vs. job-name in triggered-by information

JUnit 5 exposes formatted time in XML

The XML generated by JUnit 5 can be rejected by buildviz with a 400. The offending content looks like this:

<testcase name="my_test" classname="com.example.something" time="14,029.255">
</testcase>

Flaky tests detection only relying on build outcome?

Investigate how flaky tests are currently found.

Technically the algorithm does not need to rely on the build outcome, but just needs to look at all build pairs with same input, and find tests with different outcome.

This would make sure to find flaky tests even if the build still fails for other reasons.

Package runtime in average test class runtime graph is NaN if one child has no runtime

Changing the timespan for 'Fail phases' graph will not correctly update colors

Current phase is not shown in fail phases graph

On going phase (whether red/green) is not shown. As the end of the phase is unknown, we could take the timestamp of the last synced build as a current value.

Limit information to top N items

Provide less items in an overview, then only provide all for a "zooming in" (what ever that will mean).

TeamCity sync `triggeredBy` information

There are two fields we can extract that information from:

    "snapshot-dependencies": {
        "build": [
            {
                "buildTypeId": "SimpleSetup_Test",
                "href": "/httpAuth/app/rest/builds/id:46",
                "id": 46,
                "number": "14",
                "state": "finished",
                "status": "SUCCESS",
                "webUrl": "http://localhost:8111/viewLog.html?buildId=46&buildTypeId=SimpleSetup_Test"
            }
        ],
        "count": 1
    }

and

    "triggered": {
        "date": "20160508T071736+0000",
        "details": "##triggeredByBuildType='bt3' triggeredByBuild='14'",
        "type": "unknown"
    }

snapshot-dependencies seems to have the data in the format we need, but seems to have a bunch of problems:

This points to the previous build, even if there is a temporal disconnect due to separate user actions (running the latter job when there's a successful build of the job in the chain before seems to omit a complete chain invocation). As such not well suited for temporal analysis of the pipeline.
Requires the pipeline to use the snapshot dependencies configuration for TeamCity, which might make setup difficult as buildviz will not sync out-of-the-box for a setup without. (Unsure how widely this config is used across teams.)

triggered seems to have its own set of issues:

The necessary data is encoded in an internal representation, and might change in the future. Resolving though is possible as the alternate id resolves against /app/rest/buildTypes/bt3 for example.
In some cases if the latter job of a pipeline chain is triggered, this could lead to the previous chain to be invoked. However the triggered value will only call out the user action (type: user).

Flaky test can have higher count than flaky build graph

Current example has a flaky build with count 1 and a test with > 100 flaky failures.

Feedback

List of user feedback

"Top X diagrams": use classic bar charts, much easier to grasp
Flaky tests: show rings of flaky tests by day, younger inside
Top 5 failures: Size should show amount of runs, easier to grasp, shows same info as other graphs
Slowest 5 tests: show average as text, not as graph
Testcase vs. testclass graphs, difficult to relate
Simplify descriptions for somebody less focused on build pipeline issues

Now that we filter for recent events only the Go.cd example looks kinda empty

Go Sync: Partially re-running a failing stage might report stage as passed

In Go:
Given two failing jobs in a stage, when rerunning one of the failed one, subsequently turning into green, then the result of the stage will be incorrectly reported as passed.

Handle <properties/> entry in JUnit XML

At least that might be the culprit why curl http://localhost:3000/testsuites | sort -rnt, -k5 returns a ghost entry.

GoCD sync: Don't assume all xml files are JUnit XML

The GoCD sync should not fail on build artifact files that are not JUnit XML. The current heuristic of checking the filename is not good enough.

A simple magic check on top could suffice to weed out other XML files.

Idea: Replay recorded sync runs to test all sync implementations in the pipeline

Wiremock supports recording and replaying previous request patterns. This might enable end2end testing of the sync implementations without starting up the Vagrant boxes. This could also close the gap left when removing the previous sync through Wiremock in 9e70d51.

Go.cd naming convention uses `::` for separation, buildviz just uses whitespace

cctray.xml for Go.cd exports a "build" like pipeline :: stage :: job, we should use the same for consistency. However that would be a breaking change.

Handle fail phases for decommissioned jobs

If a job is removed or just renamed (id changes) and the last build failed, the fail phase goes on indefinitely as that job is never turned back into green.

Possible solution:
Fuzzy logic by which a certain interval without new builds means the job has been decommissioned.

Feature: Show last failing/flaky build in tooltip

Subprojects in TeamCity don't sync on Windows

Subproject names in TeamCity include colons (e.g. Project :: Subproject) which is invalid for file names under Windows.

Handle multiple same testcases in one testsuite

Right now multiple ones don't accumulate runtime but actually outcome is lower due to avg.

Runtime not optional

Although http://llg.cubic.org/docs/junit/ mentions that testcase time is optional:

Some tests graphs cannot handle missing runtime.
The test JSON schema requires runtime.

Disallow long build names/numbers due to filesystem limitation

Group jobs by project/pipeline

As teams will likely end up with multiple independent sets of jobs we should probably support some kind of separation of those sets. This could be a complex setup of multiple pipelines or many simple pipelines of microservices.

Color coding is one important visual aspect, as jobs of one group should by identifiable.

The "fail phases" graph follows a pipeline model, aggregating multiple pipelines in one overview will conflate what's happening below.

Ideas:

Provide namespaces when posting job/build information /builds/:ns/:job/:build
Trick D3 into providing similar colors for jobs of the same namespace

What we don't want:
buildviz does not want to hold information for multiple teams, rather it should be easy to deploy multiple instances of it.