Giter Club home page Giter Club logo

foodsam's People

Contributors

jamesjg avatar niuzehai avatar starhiking avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

foodsam's Issues

FoodSAM Colab GPU Version problems mmcv-full

Hello,
I am trying to use FoodSAM in Google Colab but I am encountering a problem. As soon as I attempt to install mmcv-full and allocate GPU resources to the notebook, I receive an error named "building wheels". I did some research and found out that if I install using !pip install torch==1.12.0+cu116 torchvision==0.13.0+cu116 torchaudio==0.12.0 --extra-index-url https://download.pytorch.org/whl/cu116, I can at least install mmcv versions 2.0.0 and 2.1.0. However, unfortunately, these versions do not have the Dictation function... Does anyone know a solution?

error: subprocess-exited-with-error

× python setup.py bdist_wheel did not run successfully.
│ exit code: 1
╰─> See above for output.

note: This error originates from a subprocess, and is likely not a problem with pip.
Building wheel for mmcv-full (setup.py) ... error
ERROR: Failed building wheel for mmcv-full
Running setup.py clean for mmcv-full
Failed to build mmcv-full
ERROR: Could not build wheels for mmcv-full, which is required to install pyproject.toml-based project

ImportError: cannot import name 'DictAction' from 'mmcv.utils'

2 issues

hello i want to get a instance segmentation results.
but this only give the results of semantic segmentation and panoptic segmentation.
can you add the command for instance segmentation please?

And i cannot get the pretrained weight of [FoodSeg103-SETR-MLA].
please add the another link.
Thank you.

FoodSeg103-SETR-MLA error

We're sorry, but **** can't be found in the smu-my.sharepoint.com directory. Please try again later, while we try to automatically fix this for you.

抱歉 我是纯小白 想请教如何开始训练模型

非常感谢您的工作 现在我希望尝试去进行一些简单的修改并进行训练
但是我在readme.md中没有找到如何开始训练的教程 也没有找到train.py
不好意思打扰到您 非常期待您的回复

Compatibility Inquiry: mmsegmentation, mmengine, and mmcv Versions

Hello,

Thank you very much for sharing your work. I would like to inquire about the versions of mmsegmentation and mmengine that you used, in addition to mmcv version 1.3.0. Currently, I am using mmsegmentation version 0.11.0, mmengine version 0.10.3, and mmcv version 1.13.0. Would these versions be compatible for replication purposes? I look forward to your response.
Kind Regards,
Queenie.

pred_mask_path is not correct.

While I faced the problem when using the command metioned in your demo step.

python FoodSAM/semantic.py --img_path <path/to/img> --output <path/to/output>

However, I got this error message:

Traceback (most recent call last):
  File "FoodSAM/semantic.py", line 322, in <module>
    main(args)
  File "FoodSAM/semantic.py", line 309, in main
    enhance_masks(args.output, args.category_txt, args.color_list_path, num_class=args.num_class, area_thr=args.area_thr, ratio_thr=args.ratio_thr, top_k=args.top_k)
  File "/DATA/jyp/FoodSAM/FoodSAM/FoodSAM_tools/enhance_semantic_masks.py", line 100, in enhance_masks
    pred_mask = cv2.imread(pred_mask_path)[:,:,2]
TypeError: 'NoneType' object is not subscriptable

It seems that the code cannot find the correct path to the corresponding mask generated by the model.
Please tell me how to solve this problem.

Additionally, can the output of a food image to be a corresponding mask but not a file within plenty of files such as input.jpg, mask folders, and metadata.

Thanks!

Some questions about mmseg and requirements versions

Hello,

First of all, thank you very much for your hard work and for sharing this project with the community! I've been exploring FoodSAM and am excited about the potential it has. However, I have a few questions regarding its dependencies and compatibility with newer versions of certain libraries. Your insights on these would greatly help me integrate and potentially contribute to the project:

  • Integration of mmseg v0.11.0: I noticed that the project directly clones the mmseg v0.11.0 code. Could you share the reasoning behind this specific version choice? Are there specific features or compatibility issues with newer versions that influenced this decision? Did you make important changes in the code and using it on FoodSAM ?

  • Compatibility with Newer Versions of mmseg: Have there been any attempts to integrate newer versions of mmseg into FoodSAM? If so, were there any notable challenges or findings from those attempts?

  • Use of Older mmcv Versions: The project currently utilizes mmcv-full 1.3.0. Have there been any tests or considerations for upgrading to more recent versions, such as mmcv-full 1.7 or even mmcv 2? If compatibility issues arose, could you detail them?

Understanding these aspects would not only help me in utilizing FoodSAM more effectively but also in potentially contributing to its development, ensuring compatibility and leveraging improvements from these libraries.

Thank you for your time and assistance!

Best regards,
Jules

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.