lightning-ai / ecosystem-ci Goto Github PK

Automate issue discovery for your projects against Lightning nightly and releases.

Home Page: https://devblog.pytorchlightning.ai/stay-ahead-of-breaking-changes-with-the-new-lightning-ecosystem-ci-b7e1cf78a6c7

License: Apache License 2.0

Makefile 5.09% Python 87.78% Dockerfile 7.13%

ci-cd compatibility-testing deep-learning integration-testing lightweight python

ecosystem-ci's People

Contributors

Stargazers

Watchers

Forkers

akihironitta borda ultmaster diffeqml lucmos pykeen aleksanderwww raoakarsha raalsky martinkim0 abhilash1910 avr248 athitten a43501

ecosystem-ci's Issues

enable make for all configs

🚀 Feature

update Makefile to run any and all integrations

Motivation

enable full local testing, to emulate the cloud solution...

Alternatives

Additional context

inspiration/parallel solution as https://github.com/PyTorchLightning/lightning-tutorials/blob/main/Makefile

mention contact person in Slack message

🚀 Feature

Mention contact person in particular slack message

Pitch

The user can safely ignore all that is not related to him...

Alternatives

add in the receipt yaml contact with names and then add these people in message

contact:
    slack:
      - Borda
    email: ...

add a command for parsing with arg --channel slack

Additional context

Test the ecosystem integration against GPU containers

🚀 Feature

Nemo tests it's own CI on a Pytorch container from NGC (versioned as YY.MM) and these are generally available on other cloud providers too. Note that - usually once pytorch has a public release, it takes at least one month for the next container to actually have the public pytorch release. By actually have the released pytorch, I mean that the current container will have an alpha release of pytorch with some cherry-picked changes vs the actual full new release in public.

This can cause cases where improper version checking (using distutils instead of packaging.version.Version) can fail these alpha version comparison tests and cause PTL inside of the container to pick incorrect code paths. So the ecosystem CI will work fine ... but when you run it on a pytorch container released from Nvidia (ie on most cloud providers) it may fail (and not just Nemo, anything that uses PTL and hits that code path).

So maybe on a separate test prior to release, test the ecosystem CI on the latest public NGC pytorch container (or really any cloud container which has pytorch built into it). Ofc this is a big task so it's just a suggestion.

Motivation

For a current example of exactly how we have to patch for such an issue right now (wrt Pytorch 1.10, NGC Container 21.01 and Pytorch Lightning 1.5.9), https://github.com/NVIDIA/NeMo/blob/8e15ba43ba0a17b456d3bfa09444574ef1faa301/Jenkinsfile#L70-L76 due to an issue regarding torchtext.

For an extreme case of exactly how bad things become - we had to adaptively install torch, PTL and nemo dependencies based on whether the install occurred inside a container or not.. https://github.com/NVIDIA/NeMo/blob/r1.0.0rc1/setup.py#L107-L146

Pitch

Maybe test the ecosystem CI (or just even PTL alone) on the latest public NGC pytorch container (or really any cloud container which has pytorch built into it). Ofc this is a big task so it's just a suggestion.

Alternatives

Apart from manual patching of PTL source at install time, we haven't found any better solution than to wait it out for a month or two before the container actually contains the latest code from the latest torch release.

Slack action is limited to Linux only

🐛 Bug

The slack bot does not run on other OS:

Error: Container action is only supported on Linux

To Reproduce

https://github.com/PyTorchLightning/ecosystem-ci/runs/4601419524?check_suite_focus=true

Expected behavior

the Slack integration shall be running anywhere...

Additional context

See alternatives:

isolate tests & extra dependencies

🚀 Feature

isolate tests

copy all tests for a project to _itenration instead of repo root

xtra dependencies

thinking about having

dependencies:
  - name: pytorch-lightning
    HTTPS: https://github.com/PyTorchLightning/pytorch-lightning.git
    checkout: release/1.5.x

and also:

dependencies:
  - name: Cython
    checkout: 3.56

which would do pip install Cython==3.56

Motivation

simple cleaning in make or anywhere

Additional context

Add differential build for PR

🚀 Feature

When a change is just for particular configs we want to build a configuration with includes these configs as in principal al configs shall be completely independent and so isolated...
We may use GH CLI, ask if the change is only the one config and then skip all others...

Our CLI: input PR number and the config path and return bool if the config shall be build
Logic: build particular configs in all changes are only in configs and this is one of the changed
GH action: for execution steps add if: ... with skip if no need for building

Motivation

we want to be as efficient as possible so do not build anything which is not really needed
also taking too many resources with growing ecosystem size may suck pool for other our projects as the pool org-wide shared

Pitch

the targeted build will speed up PR checks and lower the costs

Alternatives

Additional context

Conda environment for specific dependencies

🚀 Feature

While the current container tests are run against bare python, there are many important libraries that require conda to be available to properly install dependencies. Therefore conda support would be the request.

Motivation

A use case for this in NeMo is Numba - we utilize it for custom CUDA kernels for RNNT loss, custom spec augment kernels and in the future for jit compiled CPU code as well. We have plenty of unit tests that are optional (and hence skipped) when these are not installed.

Another is Pynini (for WFST based text normalization and inverse text normalization) - however since the graph building takes significant time if not cached, we do not recommend performing this CPU test during your ecosystem tests. It is just an example.

The reason we prefer Numba to be installed from conda (whereas it does have a pip install path) is that often an increment in numba version is accompanied by an increment in the llvmlite library too - and this causes significant issues. llvmlite is a library that will often throw errors during upgrade, and it can be forced via --force-reinstall (for pip) but that brings about an unstable environment. Conda bypasses all of this.

Pitch

Install miniconda on the fly (or use a base container that supports conda).

Alternatives

Leave conda tests out, and request ecosystem members to make their tests skippable if dependencies are not installed. NeMo can do this (and currently does skip numba tests if its not installed or doesn't support a recent cuda version), it may be cumbersome on a large scale.

Create a GitHub issue if compatibility breaks

🚀 Feature

It would be awesome if we could set the CI up to make an issue on the corresponding CI if something was to break.

Even better would be to have a stack trace showing what broke! Something like below:

Hi! I'm the PyTorchLightning EcoSystem-CI bot.

I've detected an incompatibility with your repository and Lightning's release (1.5.x). I've attached the stack trace below for help in debugging.

Stack Trace (24/01/22)

``` stack_trace ```

Any additional breakages could update the same issue!

Add an option to specify the installation path of target_repository

Hi, I'm a contributor of https://github.com/intel-analytics/BigDL

Our project is using pytorch lightning and we want to use this CI, but there are some issues:

I noticed this CI always run pip install . in the root directory of target_repository:

ecosystem-ci/actions/assistant.py

Line 178 in 2ba4864

pip_install = "."

Our project has multiple packages which can be installed, so it doesn't contain a setup.py in the root directory.
I hope this CI can add an option to control in which directory it will run pip install ..

i.e. change the current commands:

git clone {url}
cd {repo_name}
pip install .

to:

git clone {url}
cd {repo_name}
cd {new option to specify the installation path}
pip install .

Thanks!

expose other build commands

🚀 Feature

lets being open and transparent and have it as an extra item in config:

build_commands:
   - pip install Cython==3.56
   - conda update numba --silent

Motivation

more flexibility on building environment, eventually download some external/public data needed for testing

Alternatives

Additional context

Produce collection of passing integrations

🚀 Feature

see: Lightning-AI/pytorch-lightning#14137 (comment)

Motivation

Showcase all passing integrations on a special PL page

Pitch

Get more visibility of active collaborations

Alternatives

Additional context

enable email notifications

🚀 Feature

enable some external messaging on CI failure to via email

Motivation

Expand the reach of notified users as not everyone is using Slack, and/or there are too many messages...

Alternatives

Additional context

check the following actions as options:

https://github.com/marketplace/actions/send-mailer

catching also PL deprecation warnigs

🚀 Feature

if we want to be progressive incompatibility we shall also report on deprecation warnings and eventually treat them as failer...
as an alternative, we can give a user option to set it as strict 🤔

Pitch

Compatibility for the future

Alternatives

pass pytest argument -W error::DeprecationWarning in
https://github.com/PyTorchLightning/ecosystem-ci/blob/c544b7bc10fbef6f6b498805dd08837eb1ba96a2/actions/_config.yaml#L54

Additional context

https://docs.pytest.org/en/latest/how-to/capture-warnings.html#controlling-warnings

lightning-ai / ecosystem-ci Goto Github PK

ecosystem-ci's People

Contributors

Stargazers

Watchers

Forkers

ecosystem-ci's Issues

🚀 Feature

Motivation

Alternatives

Additional context

🚀 Feature

Pitch

Alternatives

Additional context

🚀 Feature

Motivation

Pitch

Alternatives

🐛 Bug

To Reproduce

Expected behavior

Additional context

🚀 Feature

isolate tests

xtra dependencies

Motivation

Additional context

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

🚀 Feature

Motivation

Pitch

Alternatives

🚀 Feature

🚀 Feature

Motivation

Alternatives

Additional context

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

🚀 Feature

Motivation

Alternatives

Additional context

🚀 Feature

Pitch

Alternatives

Additional context

Recommend Projects

Recommend Topics

Recommend Org