Giter Club home page Giter Club logo

nuc_processing's People

Contributors

tjs23 avatar wb104 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

nuc_processing's Issues

nun for hybrid genome

I try to process single cell hic data as following:
nuc_processing/nuc_process -i inbc_CGTCTCGT_oubc_CTGTCATT/inbc_CGTCTCGT_oubc_CTGTCATT.r?.fastq -g genome/B6 -g2 genome/Cast -re1 MboI -s 150-1000 -n 8 -f genome/bowtie2_B6/genome.fa -f2 genome/bowtie2_mask_CAST/genome_mask_CAST.fa -v -a -k -c 2

But it shows error.
nuc_process: error: unrecognized arguments: Cast genome/bowtie2_mask_CAST/genome_mask_CAST.fa

Can you please help with this?

Thanks

Gang

TypeError when creating restriction enzyme fragment file

Using the release_1.0 branch: when I attempt to run nuc_process and create the restriction enzyme track, it fails with the following error

 INFO : Creating restriction enzyme fragment file /path/to/bowtieindex/RE_frag_MboI_GCA_000001405.15_GRCh38_no_alt_analysis_set.txt
    INFO : Calculating MboI fragment locations for chr10 contig chr10
Traceback (most recent call last):
  File "/path/to/.conda/envs/nuc_process/bin/nuc_process", line 11, in <module>
    sys.exit(main())
  File "/path/to/.conda/envs/nuc_process/lib/python3.6/site-packages/nuc_processing/NucProcess.py", line 2866, in main
    lig_junc, zip_files, sam_format, verbose)
  File "/path/to/.conda/envs/nuc_process/lib/python3.6/site-packages/nuc_processing/NucProcess.py", line 2581, in nuc_process
    re1_files = [check_re_frag_file(genome_index, re1, g_fastas, align_exe, num_cpu, remap=remap)]
  File "/path/to/.conda/envs/nuc_process/lib/python3.6/site-packages/nuc_processing/NucProcess.py", line 1594, in check_re_frag_file
    frag_data[contig] = get_chromo_re_fragments(fasta_file_objs, contig, seq.upper(), re_site, cut_pos)
  File "/path/to/.conda/envs/nuc_process/lib/python3.6/site-packages/nuc_processing/NucProcess.py", line 1454, in get_chromo_re_fragments
    sub_seq = frag_seq[a:b]
TypeError: slice indices must be integers or None or have an __index__ method

This happend using different assembly fasta files so I took a look at the NucProcess.py and saw that the a and b variables at line 1454 function seem to take on non-integer values. I was able to get NucProcess to run by modifying line 1434 in the NucProcessfile to step = int(mappability_length/2), forcing step to be rounded in case it takes on a non-integer value.

With this change nuc_process produces results, however I am unsure if this may cause any unwanted effects down the line. While I didn't see how this would be influenced by my input files for how I call the nucProcess function, I may be missing something.
For completeness sake I have added my nuc_process function call.

nuc_process -f /path/to/fastas/grch38_fastas/*.fa -o sample1 -v -a -k -re1 MboI -s 150-2000 -n 4 -g /path/to/bowtieindex/GCA_000001405.15_GRCh38_no_alt_analysis_set -i /path/to/fastq1 /path/to/fastq2

Help

Hello!
I want to convert my hic contact data to NCC format,but I don't know how to get the line "The number of the ambiguity group to which the paired reads belong" and "Whether read pairs are swapped relative to original FASTQ files".Can you help me ? Thank you

Parameter for data without restriction site (e.g. DNase type)

I'm currently working with single-cell data with DNAse type.
Which doesn't contain restriction enzyme.
Is there a way to run nuc_process with that data?
Perhaps a specific way to annotate the enzymes.conf file?

Thanks and hope to hear from you again.
G.V.

Getting contact map for individual chromosomes?

Hi Tim,

Currently the contact_map.svg file shows all chromosomes for a cell. Is it possible to obtain the contact matrix for individual chromosomes in .npy or related formats? using nuc_contact_map function?

Thanks,
Tarak

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.