Giter Club home page Giter Club logo

Comments (5)

rjzamora avatar rjzamora commented on June 16, 2024 2

See this CI run for full details

Oh interesting. Thanks for running a test PR @jrbourbeau ! I'll look into a fix.

from dask.

charlesbluca avatar charlesbluca commented on June 16, 2024

Think the larger context here is some breakage that cropped up in cuDF this week around libarrow 16.1.0 that should generally be resolved now:

So wouldn't expect to see this warning in GPU CI on subsequent runs.

Beyond that specific breakage, is there preferable behavior we'd want from pytest here? IMO, I prefer this noisier output when the module is available but broken versus the more obfuscated errors we see when the broken package is silently imported and used in testing.

from dask.

jrbourbeau avatar jrbourbeau commented on June 16, 2024

Thanks @charlesbluca. Running CI over in #11143 just to double check that things look good.

IMO, I prefer this noisier output when the module is available but broken versus the more obfuscated errors we see when the broken package is silently imported and used in testing.

Same. Given that's what pytest is doing now, I don't think we need any code changes on our end

from dask.

jrbourbeau avatar jrbourbeau commented on June 16, 2024

Indeed the libarrow issue has been resolved (πŸŽ‰ ) but there are still a couple of other failures like this

10:26:47 ____________________ test_groupby_grouper_dispatch[tasks-b] ____________________
10:26:47 [gw1] linux -- Python 3.10.14 /opt/conda/envs/dask/bin/python3.10
10:26:47 

10:26:47 key = 'b'
10:26:47 

10:26:47     @pytest.mark.gpu
10:26:47     @pytest.mark.parametrize("key", ["a", "b"])
10:26:47     def test_groupby_grouper_dispatch(key):
10:26:47         cudf = pytest.importorskip("cudf")
10:26:47     
10:26:47         # not directly used but must be imported
10:26:47         pytest.importorskip("dask_cudf")  # noqa: F841
10:26:47     
10:26:47         pdf = pd.DataFrame(
10:26:47             {
10:26:47                 "a": ["a", "b", "c", "d", "e", "f", "g", "h"],
10:26:47                 "b": [1, 2, 3, 4, 5, 6, 7, 8],
10:26:47                 "c": [1.0, 2.0, 3.5, 4.1, 5.5, 6.6, 7.9, 8.8],
10:26:47             }
10:26:47         )
10:26:47         gdf = cudf.from_pandas(pdf)
10:26:47     
10:26:47         pd_grouper = grouper_dispatch(pdf)(key=key)
10:26:47         gd_grouper = grouper_dispatch(gdf)(key=key)
10:26:47     
10:26:47         # cuDF's numeric behavior aligns with numeric_only=True
10:26:47         expect = pdf.groupby(pd_grouper).sum(numeric_only=True)
10:26:47 >       got = gdf.groupby(gd_grouper).sum()
10:26:47 

10:26:47 dask/dataframe/tests/test_groupby.py:2996: 
10:26:47 _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
10:26:47 /opt/conda/envs/dask/lib/python3.10/site-packages/cudf/core/mixins/mixin_factory.py:11: in wrapper
10:26:47     return method(self, *args1, *args2, **kwargs1, **kwargs2)
10:26:47 /opt/conda/envs/dask/lib/python3.10/site-packages/cudf/core/groupby/groupby.py:759: in _reduce
10:26:47     return self.agg(op)
10:26:47 /opt/conda/envs/dask/lib/python3.10/site-packages/nvtx/nvtx.py:116: in inner
10:26:47     result = func(*args, **kwargs)
10:26:47 /opt/conda/envs/dask/lib/python3.10/site-packages/cudf/core/groupby/groupby.py:631: in agg
10:26:47     ) = self._groupby.aggregate(columns, normalized_aggs)
10:26:47 _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
10:26:47 

10:26:47 >   ???
10:26:47 E   TypeError: function is not supported for this dtype: sum
10:26:47 

10:26:47 groupby.pyx:192: TypeError

See this CI run for full details

from dask.

rjzamora avatar rjzamora commented on June 16, 2024

Hopefully #11144 will resolve the "real" gpuci failure.

from dask.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.