udacity / nd0821-c2-build-model-workflow-starter Goto Github PK
View Code? Open in Web Editor NEWStarter Code for the Course 2 project of the Udacity ML DevOps Nanodegree Program
License: Other
Starter Code for the Course 2 project of the Udacity ML DevOps Nanodegree Program
License: Other
I am getting the following error:
ERROR: Command errored out with exit status 128: git clone -q https://github.com/udacity/nd0821-c2-build-model-workflow.git
Running mlflow run . -P wandb_api_key=${WANDB_API_KEY}
I am getting an error with pip failing to install a package. More specifically when trying to clone https://github.com/udacity/nd0821-c2-build-model-workflow.git. This comes from the components/get_data/conda.yml file. This repo does not exist.
We try to install a package with git under this URL: git+https://github.com/udacity/nd0821-c2-build-model-workflow.git#egg=wandb-utils&subdirectory=project/solution/components
In the test_regression_model
component, I get and error when trying to load the model:
2021-09-28 00:27:21,210 Downloading artifacts
2021-09-28 00:27:24,128 Loading model and performing inference on test set
Traceback (most recent call last):
File "/opt/loggi/nd0821-ml-pipeline/src/test_regression_model/run.py", line 73, in <module>
go(args)
File "/opt/loggi/nd0821-ml-pipeline/src/test_regression_model/run.py", line 37, in go
sk_pipe = mlflow.sklearn.load_model(model_local_path)
File "/home/akio/miniconda3/envs/mlflow-d4cd2edbe762204ae630d5e343c25b496b164e3c/lib/python3.9/site-packages/mlflow/sklearn/__init__.py", line 471, in load_model
local_model_path = _download_artifact_from_uri(artifact_uri=model_uri)
File "/home/akio/miniconda3/envs/mlflow-d4cd2edbe762204ae630d5e343c25b496b164e3c/lib/python3.9/site-packages/mlflow/tracking/artifact_utils.py", line 83, in _download_artifact_from_uri
return get_artifact_repository(artifact_uri=root_uri).download_artifacts(
File "/home/akio/miniconda3/envs/mlflow-d4cd2edbe762204ae630d5e343c25b496b164e3c/lib/python3.9/site-packages/mlflow/store/artifact/local_artifact_repo.py", line 79, in download_artifacts
raise IOError("No such file or directory: '{}'".format(local_artifact_path))
OSError: No such file or directory: './artifacts/random_forest_export%3Av3'
Executing it locally, I've find out that there is a problem in the usage of the path ./artifacts/random_forest_export:v3
, where it actually tries to load ./artifacts/random_forest_export%3Av3
.
After getting my environment running I am getting the following error:
ValueError: Path is not a file: data/sample1.csv
Looking at the run.py file inside of components/get_data I believe that a part of the code is missing. I see only the comments
# We stream the file so that it can be downloaded even if it is bigger
# than the available memory
and then it directly tries to log an artifact.
There is no downloading or fetching of data and there is no sample1.csv in the repository.
What I expected:
See something like in exercise_14
# Download the file streaming and write to open temp file
with requests.get(args.file_url, stream=True) as r:
for chunk in r.iter_content(chunk_size=8192):
fp.write(chunk)
# Make sure the file has been written to disk before uploading
# to W&B
fp.flush()
Greetings,
I forked the repository and when I try to test the download step from the main project, I expected it to work from components/get_data with the script below:
mlflow run . -P steps="download"
On the other hand, I am stuch with these errors:
/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/types/schema.py:49: FutureWarning: In the future `np.object` will be defined as the corresponding NumPy scalar.
binary = (7, np.dtype("bytes"), "BinaryType", np.object)
Traceback (most recent call last):
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/bin/mlflow", line 7, in <module>
from mlflow.cli import cli
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/__init__.py", line 52, in <module>
import mlflow.fastai as fastai # noqa: E402
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/fastai.py", line 22, in <module>
from mlflow import pyfunc
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/pyfunc/__init__.py", line 219, in <module>
import mlflow.pyfunc.model
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/pyfunc/model.py", line 17, in <module>
from mlflow.models import Model
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/models/__init__.py", line 25, in <module>
from .signature import ModelSignature, infer_signature
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/models/signature.py", line 12, in <module>
from mlflow.types.schema import Schema
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/types/__init__.py", line 6, in <module>
from .schema import DataType, ColSpec, Schema, TensorSpec
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/types/schema.py", line 20, in <module>
class DataType(Enum):
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/mlflow/types/schema.py", line 49, in DataType
binary = (7, np.dtype("bytes"), "BinaryType", np.object)
File "/home/teamx/miniconda3/envs/nyc_airbnb_dev/lib/python3.9/site-packages/numpy/__init__.py", line 324, in __getattr__
raise AttributeError(__former_attrs__[attr])
AttributeError: module 'numpy' has no attribute 'object'.
`np.object` was a deprecated alias for the builtin `object`. To avoid this error in existing code, use `object` by itself. Doing this will not modify any behavior and is safe.
The aliases was originally deprecated in NumPy 1.20; for more details and guidance see the original release note at:
https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations
I suspect that the dependencies are a bit old, since the last commit is done about two years ago.
Do you have any idea how can we solve this problem?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.