Greetings, Firstly, I would like to thank you for providing this ope

Hi Ben, We appreciate your interest in Casanovo! The confi

Greeting <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Greetings <a class="user-mention notranslate" data-hovercard-type="user" data-hovercar

de novo sequencing without evaluation about casanovo HOT 4 CLOSED

BenSamy2020 commented on June 25, 2024

de novo sequencing without evaluation

from casanovo.

Comments (4)

melihyilmaz commented on June 25, 2024

Hi Ben,
We appreciate your interest in Casanovo!

The config file can be user provided but the default is the casanovo/config.py we provide in the repo. You can use it as a template for your own config file and provide the path to your file.
test_data_path denotes the path to the directory where you have the .mgf file you want to sequence.
I added an example output file casanovo_sample_output.csv to the repo.

Let me know if you have other questions, feel free to close the issue otherwise.

from casanovo.

BenSamy2020 commented on June 25, 2024

Greeting @melihyilmaz,

This is actually a really amazing tool! I have successfully started the program. Now it is running. Based on the output file you had provided can I request for a program improvement feature? The improvement I would suggest is to allow a proteomics fasta database to be provided in the command itself. Subsequently, the program would match the denovo sequenced peptides onto the fasta protein database provided and append the fasta header to the corresponding peptides. Based on this it would be easier to know from which protein these peptides are derived from. Also if the denovo sequenced peptide is absent from the provided protein database, it should be labeled as missing. I understand this is a huge ask, but this enhancement would improve downstream analysis.

Additionally, I also observed the user warning of:

rank_zero_warn("You are running on single node with no parallelization, so distributed has no effect.")
GPU available: False, used: False
TPU available: False, using: 0 TPU cores
IPU available: False, using: 0 IPUs
c:\users\parth\appdata\local\programs\python\python39\lib\site-packages\pytorch_lightning\trainer\data_loading.py:132: UserWarning: The dataloader, test_dataloader 0, does not have many workers which may be a bottleneck. Consider increasing the value of the num_workers argument(try 24 which is the number of cpus on this machine) in theDataLoader` init to improve performance.
rank_zero_warn(
Testing: 0it [00:00, ?it/s]

Could you advise me on how to dedicate/allocate sufficient CPU for your program (e.g., --cpu 15). Unfortunately, the options of --cpu or --memory is not available.
My computer has GPU (NVIDIA GeForce GTX 1660 SUPER), is there a way to access those using your program?
Also after observing the above message I did not observed any progress for more than 15 mins. By any chance is the program stalled? Is there a way to access if the program is running in the background?

Regards,
Ben

from casanovo.

guhanrv commented on June 25, 2024

Hi @BenSamy2020,

I've been using the program recently and think I can help!

You can adjust the number of CPUs in the casanovo/config.py file, which should be found in your /environment/lib/pythonversion/site-packages/casanovo/config.py file, on line 30.
Yes. If you are able to run python3 from the command line, import torch, and type torch.cuda.is_available() and it returns True, that means your environment is configured to recognize your GPU, and so all you need to do is change line 31 in the same file as above to gpus = [0]. Then, when you run Casanovo, you should see GPU available: True, used: True instead.
I think the GPU will help lots there. Also, check the test_batch_size (line 80 in the same config file) - it's by default set to 1024, so your screen will only update after inferring 1024 peptides. On CPU, that takes a while. So try changing that test batch size to something small and see if you see progress.

Hope this helps!

from casanovo.

BenSamy2020 commented on June 25, 2024

Greetings @guhanrv,

I am really appreciative of your assistances. With regards to CPUs I will edit line 30 of config.py file.
Unfortunately, pytorch is not available on my PC and I would require to set it up. Additionally, I am a wet lab person. I will have to youtube or google some information on how to set it up before tapping onto my GPUs.

Once again thank alot!

Regards,
Ben

from casanovo.

de novo sequencing without evaluation about casanovo HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent