Giter Club home page Giter Club logo

Comments (4)

melihyilmaz avatar melihyilmaz commented on June 25, 2024

Hi Ben,
We appreciate your interest in Casanovo!

  1. The config file can be user provided but the default is the casanovo/config.py we provide in the repo. You can use it as a template for your own config file and provide the path to your file.
  2. test_data_path denotes the path to the directory where you have the .mgf file you want to sequence.
  3. I added an example output file casanovo_sample_output.csv to the repo.

Let me know if you have other questions, feel free to close the issue otherwise.

from casanovo.

BenSamy2020 avatar BenSamy2020 commented on June 25, 2024

Greeting @melihyilmaz,

This is actually a really amazing tool! I have successfully started the program. Now it is running. Based on the output file you had provided can I request for a program improvement feature? The improvement I would suggest is to allow a proteomics fasta database to be provided in the command itself. Subsequently, the program would match the denovo sequenced peptides onto the fasta protein database provided and append the fasta header to the corresponding peptides. Based on this it would be easier to know from which protein these peptides are derived from. Also if the denovo sequenced peptide is absent from the provided protein database, it should be labeled as missing. I understand this is a huge ask, but this enhancement would improve downstream analysis.

Additionally, I also observed the user warning of:

rank_zero_warn("You are running on single node with no parallelization, so distributed has no effect.")
GPU available: False, used: False
TPU available: False, using: 0 TPU cores
IPU available: False, using: 0 IPUs
c:\users\parth\appdata\local\programs\python\python39\lib\site-packages\pytorch_lightning\trainer\data_loading.py:132: UserWarning: The dataloader, test_dataloader 0, does not have many workers which may be a bottleneck. Consider increasing the value of the num_workers argument(try 24 which is the number of cpus on this machine) in theDataLoader` init to improve performance.
rank_zero_warn(
Testing: 0it [00:00, ?it/s]

  1. Could you advise me on how to dedicate/allocate sufficient CPU for your program (e.g., --cpu 15). Unfortunately, the options of --cpu or --memory is not available.
  2. My computer has GPU (NVIDIA GeForce GTX 1660 SUPER), is there a way to access those using your program?
  3. Also after observing the above message I did not observed any progress for more than 15 mins. By any chance is the program stalled? Is there a way to access if the program is running in the background?

Regards,
Ben

from casanovo.

guhanrv avatar guhanrv commented on June 25, 2024

Hi @BenSamy2020,

I've been using the program recently and think I can help!

  1. You can adjust the number of CPUs in the casanovo/config.py file, which should be found in your /environment/lib/pythonversion/site-packages/casanovo/config.py file, on line 30.
  2. Yes. If you are able to run python3 from the command line, import torch, and type torch.cuda.is_available() and it returns True, that means your environment is configured to recognize your GPU, and so all you need to do is change line 31 in the same file as above to gpus = [0]. Then, when you run Casanovo, you should see GPU available: True, used: True instead.
  3. I think the GPU will help lots there. Also, check the test_batch_size (line 80 in the same config file) - it's by default set to 1024, so your screen will only update after inferring 1024 peptides. On CPU, that takes a while. So try changing that test batch size to something small and see if you see progress.

Hope this helps!

from casanovo.

BenSamy2020 avatar BenSamy2020 commented on June 25, 2024

Greetings @guhanrv,

I am really appreciative of your assistances. With regards to CPUs I will edit line 30 of config.py file.
Unfortunately, pytorch is not available on my PC and I would require to set it up. Additionally, I am a wet lab person. I will have to youtube or google some information on how to set it up before tapping onto my GPUs.

Once again thank alot!

Regards,
Ben

from casanovo.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.