Giter Club home page Giter Club logo

opentyper's Introduction

OpenChrom® Analytics Edition

OpenChrom® is an Open Source tool for the analysis and visualization of mass spectrometric and chromatographic data developed by Lablicate GmbH.

It is based on ChemClipse but offers additional features that cannot be part of the ChemClipse project due to licencing constrains. This is the free base version provided to the community. It does not contain vendor specific file format converters nor commercial extensions.


Please see our Code of Conduct for rules on how to interact on this site.

If you would like to contribute to this project, please have a look at the guideline.

For user- and developer documentation, have a look at our wiki.

Builds for Windows, macOS and Linux can be downloaded including proprietary converter bundles.


YourKit Logo

Performance measurements provided by the YourKit Java Profiler

opentyper's People

Contributors

eselmeister avatar mailaender avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

opentyper's Issues

PCA plot for Mass Spectra

http://sing.ei.uvigo.es/mass-up/ has one which I can't get to work on @openSUSE Linux (white display, no error message) and the 3D nature makes it very hard to grasp in static presentations where you can't rotate. This is a huge feature and requires lot's of processing steps like #3 and #11 and data reduction to be implemented first.

Revert to the simple Mass Spectrum List

c2b5b9a The new mass spectrum edit list contains irrelevant fields such as parent m/z, parent resolution, daughter m/z, daughter resolution and collision energy. TOF-TOF functionality isn't widely used. Also no one edits MALDI spectra manually.

Mass Spectrum Metadata View

To debug the current Bruker flex series .fid file format importer. I am not sure if all fields are set to things developers and users might expect nor if things are missing. Also due to the closed source nature of the plugin, people can't easily inspect it.

Improve usability of the Tree File Viewer

The current tree file viewer is a bit inconvenient. https://bugs.eclipse.org/bugs/show_bug.cgi?id=496774 It requires users to navigate manually to the folder where data is located and for MALDI-TOF MS even hand pick samples and sub-samples manually to open them. While the Bruker Biotyper Offline Classification client has it's own usability flaws, the file open browser is much more convenient. A specialized tree view structure for the MALDI workflow that adheres to a similiar concept:

  • Sample 1
    • Replicate 1
    • Replicate 2
    • Replicate 3
  • Sample 2
    • Replicate 1
    • Replicate 2
    • Replicate 3

etc. makes it less painful to mass import data.

Automatic check for mass shifts

According to the Bruker Biotyper database creation procedures mass spectra of the same technical replicate have to adhere to the following specifications:

mass range max. allowed difference of masses [Da] max. allowed deviation (500 ppm)
3000-4000 1.47 339.79
4000-5000 2.17 230.73
5000-6000 2.74 182.55
6000-7000 3.13 319.28
7000-8000 3.58 418.59
8000-9000 4.05 247.16
9000-10000 4.69 426.80
10000-11000 5.21 286.66

Measurement is done from the highest part of the peak in flexAnalysis. Doing this manually with Microsoft Excel is tedious and screams for automation or at least a nice visualization, which highlights differences of masses in a table or even better the selected peak with color.

Own .product file and standalone release

Configuring the Bruker flex plugin to not filter masses and to display profile mass spectra is a huge technical / usability barrier for most microbiologists who will try the pre-release builds.

Don't force microFlex users to click through large amounts of sub-directories

The Bruker file format consists of several sub-folders, which makes navigation hard. With the introduction of randomly generated GUIDs like 4ec601cc-355a-4346-9253-3a266905d4df from the Compass 4 series this becomes even more of a problem as data is hard to find in the file explorer:

image

The folder structure looks like this:

  • project hash (e.g. 4ec601cc-355a-4346-9253-3a266905d4df)
    • statusInfo.json (contains the status of the measurement for online identification)
    • sample hash (e.g. 8aa8ba7a-e2eb-4e5b-81cc-9bf86533624e)
      • info (file contains the analyte ID)
      • 0_C1 (folder, probably position on the target)
        • 1 (always "1")
          • 1SLin (always "1SLin")
            • acqu (textfile with instrument metadata)
            • acqus (textfile with instrument metadata)
            • fid (contains the binary data)
            • sptype (textfile containing "tof")
            • pdata (another folder)
              • 1 (folder, always "1")
                • 1r (binary data)
                • proc (textfile with instrument metadata, looks redundant)
                • procs (textfile with instrument metadata, looks redundant)

Brukers own Explorer offline identification software automatically skips folder until it sees the data, so it needs less clicks, but it can't read the automation server metadata, which is very inconvenient when working with data generated by Bruker Biotyper Online systems or Bruker Biotyper Satellite systems:

image

with the only workaround being to filter by time and date of the measurement:

image

So we have a chance to make it better than the original. I suggest not making the fid file inside the complex folder structure, but the folders above clickable and maybe even already display the analyte ID. Some kind of generic system would be a good idea so we don't end up with vendor specific file browsers.

auto-detect profile mass spectra

At least the Bruker flex file format #18 contains the information and we also parse and save it:

[...]

// MASS SPECTRUM (CENTROID OR PROFILE)
value = extractValue(ACQU_DATATYPE, line);
if(value != null) {
	if(value.contains("CONTINUOUS")) {
		massSpectrumType = 1; // profile
	} else {
		massSpectrumType = 0; // centroid
	}
}

[...]

It is exposed in the interface IRegularMassSpectrum. So in my eyes the preference can be obsoleted and there is also no need for writing specialized MALDI-TOF MS spectrum views like ProfileMassSpectrumViewTOF.java

Batch processing

Current Freeware tools provided by Bruker Daltonics such as CompassXport are command line based and designed for the Microsoft Windows operating system only. I also haven't managed to get them up and running on my office/home PCs as they might need a connection to flexAnalysis (Setup.exe seems to install weird DLLs and COM connections) or the command and control server of the MALDI operator machine.

An easy configurable GUI to walk you through the data processing and exporting to replace this functionality can be achieved with an Eclipse wizard.

Baseline correction for MALDI-TOF MS

Currently various algorithms are available to chromatograms. For MALDI MS spectra without chromatography none are available for now. As the application is slightly different, re-using existing algorithms may not be enough.

@sgibb proposed using

in a 2011 conference publication.

Depends on https://bugs.eclipse.org/bugs/show_bug.cgi?id=496773 which has been resolved already.

MSP file format support

The MALDI Biotyper system stores Main Spectra Projection (MSP) files which are used as reference with proprietary automatic biomarker matching algorithms. According to the Bruker database creation protocol multiple measurements (8 technical replicates) of a single defined strain (1 biological replicate) is needed to capture the whole biological variability of an organism. MSPs are then added to hierarchical projects/libraries to cover a whole taxonomic species.

MSPs are the sum of at least 21 spectra (8 spots, measured 3x) which undergo manual quality control to detect mass shifts that stem from erroneous sample preparation and are calibrated against 1 BTS (bacterial test standard) spot. The MSP creation itself is unsupervised and automated. It employs de-noising and patented mass corrections to the peak data.

Due to the proprietary nature of the file format and the unknown specifications which are very much built around the original Biotyper algorithm, write support can probably never achieve true compatibility. Read support is probably possible, but may just reveal a very reduced peak list with some metadata (correction factors, matching hints, strain information) that is used by database developers and partly exposed to the Bruker Biotyper Compass UI to end users.

mzIdentML support

In contrast to the proprietary MSP file format #2 the mzIdentML (previously known as analysisXML) may be suitable as an open standard to exchange library entries. It might be able to close the gap between too simplistic CSV peak lists and other raw data missing vital metadata of sample origin and search instructions to standardize matching across applications.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.