Giter Club home page Giter Club logo

nd0821-c2-build-model-workflow-starter's People

Contributors

giacomov avatar mvirgo avatar sudkul avatar uanjali avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

nd0821-c2-build-model-workflow-starter's Issues

Pip subprocess error when running the project

I am getting the following error:

ERROR: Command errored out with exit status 128: git clone -q https://github.com/udacity/nd0821-c2-build-model-workflow.git

Running mlflow run . -P wandb_api_key=${WANDB_API_KEY} I am getting an error with pip failing to install a package. More specifically when trying to clone https://github.com/udacity/nd0821-c2-build-model-workflow.git. This comes from the components/get_data/conda.yml file. This repo does not exist.

We try to install a package with git under this URL: git+https://github.com/udacity/nd0821-c2-build-model-workflow.git#egg=wandb-utils&subdirectory=project/solution/components

Fail to load model in test_regression_model

In the test_regression_model component, I get and error when trying to load the model:


2021-09-28 00:27:21,210 Downloading artifacts
2021-09-28 00:27:24,128 Loading model and performing inference on test set
Traceback (most recent call last):
  File "/opt/loggi/nd0821-ml-pipeline/src/test_regression_model/run.py", line 73, in <module>
    go(args)
  File "/opt/loggi/nd0821-ml-pipeline/src/test_regression_model/run.py", line 37, in go
    sk_pipe = mlflow.sklearn.load_model(model_local_path)
  File "/home/akio/miniconda3/envs/mlflow-d4cd2edbe762204ae630d5e343c25b496b164e3c/lib/python3.9/site-packages/mlflow/sklearn/__init__.py", line 471, in load_model
    local_model_path = _download_artifact_from_uri(artifact_uri=model_uri)
  File "/home/akio/miniconda3/envs/mlflow-d4cd2edbe762204ae630d5e343c25b496b164e3c/lib/python3.9/site-packages/mlflow/tracking/artifact_utils.py", line 83, in _download_artifact_from_uri
    return get_artifact_repository(artifact_uri=root_uri).download_artifacts(
  File "/home/akio/miniconda3/envs/mlflow-d4cd2edbe762204ae630d5e343c25b496b164e3c/lib/python3.9/site-packages/mlflow/store/artifact/local_artifact_repo.py", line 79, in download_artifacts
    raise IOError("No such file or directory: '{}'".format(local_artifact_path))
OSError: No such file or directory: './artifacts/random_forest_export%3Av3'

Executing it locally, I've find out that there is a problem in the usage of the path ./artifacts/random_forest_export:v3, where it actually tries to load ./artifacts/random_forest_export%3Av3.

Download step not doing anything

After getting my environment running I am getting the following error:

ValueError: Path is not a file: data/sample1.csv

Looking at the run.py file inside of components/get_data I believe that a part of the code is missing. I see only the comments

# We stream the file so that it can be downloaded even if it is bigger
# than the available memory

and then it directly tries to log an artifact.

There is no downloading or fetching of data and there is no sample1.csv in the repository.

What I expected:
See something like in exercise_14

# Download the file streaming and write to open temp file
with requests.get(args.file_url, stream=True) as r:
    for chunk in r.iter_content(chunk_size=8192):
        fp.write(chunk)

# Make sure the file has been written to disk before uploading
# to W&B
fp.flush()

Download step "np.object" error

Greetings,

I forked the repository and when I try to test the download step from the main project, I expected it to work from components/get_data with the script below:

mlflow run . -P steps="download"

On the other hand, I am stuch with these errors:

/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/types/schema.py:49: FutureWarning: In the future `np.object` will be defined as the corresponding NumPy scalar.
  binary = (7, np.dtype("bytes"), "BinaryType", np.object)
Traceback (most recent call last):
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/bin/mlflow", line 7, in <module>
    from mlflow.cli import cli
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/__init__.py", line 52, in <module>
    import mlflow.fastai as fastai  # noqa: E402
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/fastai.py", line 22, in <module>
    from mlflow import pyfunc
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/pyfunc/__init__.py", line 219, in <module>
    import mlflow.pyfunc.model
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/pyfunc/model.py", line 17, in <module>
    from mlflow.models import Model
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/models/__init__.py", line 25, in <module>
    from .signature import ModelSignature, infer_signature
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/models/signature.py", line 12, in <module>
    from mlflow.types.schema import Schema
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/types/__init__.py", line 6, in <module>
    from .schema import DataType, ColSpec, Schema, TensorSpec
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/types/schema.py", line 20, in <module>
    class DataType(Enum):
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/types/schema.py", line 49, in DataType
    binary = (7, np.dtype("bytes"), "BinaryType", np.object)
  File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/numpy/__init__.py", line 324, in __getattr__
    raise AttributeError(__former_attrs__[attr])
AttributeError: module 'numpy' has no attribute 'object'.
`np.object` was a deprecated alias for the builtin `object`. To avoid this error in existing code, use `object` by itself. Doing this will not modify any behavior and is safe. 
The aliases was originally deprecated in NumPy 1.20; for more details and guidance see the original release note at:
    https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations

I suspect that the dependencies are a bit old, since the last commit is done about two years ago.
Do you have any idea how can we solve this problem?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.