Comments (6)
Can you post a sample of one of your mgf files, or even just the first 1000 lines of such a file?
from casanovo.
The problem seems to be that your MGF file is missing some information, in this case the precursor charges of the MS/MS spectra. Unfortunately I'm not sufficiently familiar to know whether it is expected that this information is not available for timsTOF data, but for successful spectrum identification Casanovo does require it.
The most straightforward solution would be to figure out how to include the precursor charges in the MGF export, although unfortunately I can't directly recommend the best tool to do this.
from casanovo.
My guess is that it is charge is called something else in @jonathan-krieger-bruker's file. A sample of the file's content should help us out.
from casanovo.
Hi all - thanks for helping to look into this. I tested 2 MGF files created different ways but they both look quite different then the test file or preprocessed data provided in the repo. I attach a zip file of these 2 MGFs (Top ~1000 lines or so).
Thanks again!
Jonathan
MGF_samples.zip
.
from casanovo.
The Charge state seems to be there and seems to be written in the same way as the test data. However the intensity values are very different - one looks relative and one looks absolute?
from casanovo.
Relative vs absolute intensity values shouldn't make a difference because intensities are normalized by Casanovo anyway.
In fact, I could run Casanovo successfully on the two example files you provided. I'm guessing that not all spectra are incorrect, and the one Casanovo chokes on is further down in the file you're trying to process. Would it be possible to share the full MGF with which you got the error so I can further debug the issue? Thanks.
from casanovo.
Related Issues (20)
- mzTab validation
- Automate mzTab validation HOT 7
- More information about the train/val/test split HOT 2
- WARNING: Skipped spectra with invalid precursor info HOT 1
- Export casanovo to torchscript/onnx HOT 1
- ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all() HOT 3
- Make Casanovo produce Skyline compatible output
- 9-Species Benchmark Set: Data Preprocessing Step? HOT 5
- Migrating PeptideMass, PeptideDecoder, and PeptideEncoder from depthcharge v0.2.3 to casanovo HOT 3
- Is there a way to know which spectras are ITMS instead of FTMS? HOT 3
- Numpy release 2.0.0 breaks depthcharge dependency HOT 2
- Use Fourier embeddings
- Add mzTab validator to github actions HOT 1
- Make Casanovo model compatible with torch.compile
- Designate a filename for the "best" model HOT 7
- PyTorch (GPU) not working on Win 11 box HOT 4
- ValueError: could not broadcast input array from shape (0,) into shape (25714,) HOT 5
- Eliminate the eval command HOT 4
- Greater than or equal to character ≥ causes logging error in end of run report.
- RuntimeError: CUDA error: the launch timed out and was terminated HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from casanovo.