Giter Club home page Giter Club logo

cellranger-atac's Introduction

Building Cell Ranger ATAC 1.2.0

Build dependencies

This package has been built internally using:

  • Python 2.7.13
  • clang 7.0 (gcc/g++ should also work)
  • go 1.12

Setting up the environment and build dependencies

Example setup of build dependencies on Ubuntu 20.04

# use gcc/g++ in this case to support GLIBCXX_3.4.25 in pre-built ranger package
sudo apt-get install make gcc g++ pkg-config

# Add golang to path
wget https://golang.org/dl/go1.12.17.linux-amd64.tar.gz
tar xf go1.12.17.linux-amd64.tar.gz
export PATH=/path/to/go/bin:$PATH
export GOROOT=/path/to/go

# Source the Ranger package for python, the Martian Environment and binary dependencies
source /path/to/ranger-1.2.0-ATAC/sourceme.bash

Build command

make

Running Cell Ranger ATAC

Setting up the Runtime dependencies for Cell Ranger ATAC

source /path/to/open-source/cellranger-atac/sourceme.bash

Note about Loupe

The binaries required to generate Loupe Cell Browser (.cloupe) are not included in this repository or in the binary dependencies package Ranger. The necessary binaries, including crconverter, bamtofastq, etc. can be obtained from an existing binary version of Cell Ranger ATAC by running:

cp /path/to/cellranger-atac/cellranger-atac-cs/1.2.0/lib/bin/crconverter /path/to/open-source/cellranger-atac/lib/bin/

Support

We do not provide support for building and running this code.

The officially supported release binaries are available at: (https://support.10xgenomics.com/single-cell-atac/software/downloads/latest)

cellranger-atac's People

Contributors

flyingrobin avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

cellranger-atac's Issues

About raw_barcode_count in mark_duplicates

Thank you for the development of cellranger-atac pipeline. My colleague @haowenz and I are trying to understand the output from cellranger-atac, especially for the deduplication. During the exploration, we found that some of the most common barcodes in the dup groups did not agree with the frequency in the sample, thus the wrong barcodes became the representatives for the dup groups. After examining the code, we think the barcode_whitelist should be sorted to match the count:

https://github.com/10XGenomics/cellranger-atac/blob/main/mro/atac/stages/processing/mark_duplicates/__init__.py#L415-L421

After adding something like "barcode_whitelist = sorted(list(barcode_whitelist))" we got expected results. We are wondering if this is a valid fix. Is this a bug in CellRanger-atac? Thank you.

Title: CellRanger-ARC for Arbidosis failed with error

I am running CellRanger-ARC for Arabidopsis and I am getting the following error:

2023-06-02 14:40:02 [runtime] (failed)          ID.D3.SC_ATAC_GEX_COUNTER_CS.SC_ATAC_GEX_COUNTER._SC_ATAC_GEX_ANALYZER._PEAK_ANNOTATOR.ANNOTATE_PEAKS

[error] Pipestance failed. Error log at:
D3/SC_ATAC_GEX_COUNTER_CS/SC_ATAC_GEX_COUNTER/_SC_ATAC_GEX_ANALYZER/_PEAK_ANNOTATOR/ANNOTATE_PEAKS/fork0/join-uc5dd7a4574/_errors

Log message:
Traceback (most recent call last):
File "/data/rc/apps/rc/software/CellRanger-ARC/2.0.0/external/martian/adapters/python/martian_shell.py", line 659, in _main
stage.main()
File "/data/rc/apps/rc/software/CellRanger-ARC/2.0.0/external/martian/adapters/python/martian_shell.py", line 629, in main
lambda: self._module.join(args, outs, chunk_defs, chunk_outs)
File "/data/rc/apps/rc/software/CellRanger-ARC/2.0.0/external/martian/adapters/python/martian_shell.py", line 589, in _run
cmd()
File "/data/rc/apps/rc/software/CellRanger-ARC/2.0.0/external/martian/adapters/python/martian_shell.py", line 629, in <lambda>
lambda: self._module.join(args, outs, chunk_defs, chunk_outs)
File "/data/rc/apps/rc/software/CellRanger-ARC/2.0.0/mro/atac/stages/analysis/annotate_peaks/init.py", line 105, in join
gene_id_name_map = build_gene_id_name_map(ref_mgr)
File "/data/rc/apps/rc/software/CellRanger-ARC/2.0.0/mro/atac/stages/analysis/annotate_peaks/init.py", line 60, in build_gene_id_name_map
fields.attrs.get("gene_name", fields.attrs["gene_id"])
File "pybedtools/cbedtools.pyx", line 392, in pybedtools.cbedtools.Interval.attrs.get
File "pybedtools/cbedtools.pyx", line 180, in pybedtools.cbedtools.Attributes.init
ValueError: need more than 1 value to unpack

I have tried the following to resolve the issue:

I have updated CellRanger-ARC to the latest version.
I have checked the input data to make sure that it is in the correct format.
I have checked the software used to run CellRanger-ARC to make sure that it is up to date and that it has all of the necessary dependencies.
I am still unable to resolve the issue.

Please help me to troubleshoot this issue.

Thank you.

Odd fragment.tsv.gz files

Hi!

I inspected my fragment files using gzip -dc fragments.tsv.gz | head and got the following output:

Screen Shot 2024-03-21 at 6 05 02 PM

I believe this is causing problems when reading the file into python's pycistopic package. Other users show that their output for the same line of code is as follows:

image

Any way of removing that information at the start of the fragment file?

Thanks,
Susana

cellranger-atac argg balcklist ratio Error

As this issue suggested, stuart-lab/signac#889:
cellRanger released the new version(2.0.0) to preprocess ATAC-seq data. and it announced that the annotation track 'blacklist.bed' has been removed, so the column "blacklist_region_fragments" in the singlecell.csv output for the corresponding tracks are all zero.

but in the singlecell.csv , the output of cellranger-atac agrr , the blacklist ratio is calculated.

most importantly, the blacklist ratio seems wrong for some reason. I found this problem by running the same sample with cellranger-atac V1.2 and v2.0.0, which reported totally different blacklist ratio.

preflight error UTF-8

Hello all,
I am trying to run the cellranger atac pipeline on SRA derived fastqs. I continuously run into a preflight error:

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position 1: invalid start byte

Is there a fix for this? I tested multiple fasts to rule out input problems.

Non-nuclear read pairs

I'm using version 2.0.0. In the summary.csv file, there's a value reported for Non-nuclear read pairs. How is this number calculated? I can see in the html that it describes cell barcodes with >Q30 but i'm trying to reproduce this number using the singlecell.csv metrics and can't figure it out.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.