Skip to content

xr-song/ginkgo

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

361 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A modified version of Ginkgo copy number calling pipeline

The original version can be found here.

Prerequisites

  • Clone the current repository: git clone https://github.com/xr-song/ginkgo.git
  • Include Java in your PATH
  • Have R installed in a conda environment
  • Make sure you have the following R packages installed:
BiocManager::install("ctc")
BiocManager::install("DNAcopy") 
install.packages("inline")
install.packages("gplots")
install.packages("scales")
install.packages("plyr")
install.packages("gridExtra")
  • Run make to compile the .cpp source files:
cd /path/to/ginkgo
make
  • Prepare the genome reference files including variably sized bins and GC content per bin. The authors of Ginkgo do not provide reference for GRCh38 and T2T-CHM13v2.0.
  • Prepare the config file. See examples. Major choices include bin size, ploidy, and whether to remove bad bins. Make sure to change chosen_genome to the directory containing the reference files.

Usage

Submit a job with the following commands (replace with your actual paths):

export PATH=/path/to/environment/bin:$PATH # environment with R and required packages installed

wd=$(realpath "$(pwd)/..") # your working directory
BARCODES_LIST=$wd/barcode_lists/barcodes.txt # one barcode per line, (partially) matching the corresponding bed file name
SOURCE_DIR=/path/to/bedfile # source directory of all the bed files to consider
OUTPUT_DIR=$wd/ginkgo_output # output directory
CONFIG=/path/to/config # config file
REPO_DIR=/path/to/ginkgo # path to this repository

sh ${REPO_DIR}/scripts/run_ginkgo.sh $BARCODES_LIST $SOURCE_DIR $OUTPUT_DIR $CONFIG $REPO_DIR

Calling run_ginkgo.sh will take you through all the steps of running Ginkgo. It takes ~1 hour with <1G memory for 1k cells.

Check SegCopy file in the output directory for the called copy numbers.

About

Modified version of the single-cell copy-number variation analysis tool

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • PHP 36.3%
  • JavaScript 25.4%
  • Shell 11.5%
  • R 10.9%
  • C++ 6.5%
  • CSS 5.8%
  • Other 3.6%