-
random_dna.py : creates a random sequence of DNA of length K
usage :
python random_dna.py K -
fasta_length.py : computes the length of DNA sequences in a Fasta file
usage :
python fasta_length.py file.fa -
reverse_complement.py : computes the revese complement of a DNA sequence
usage :
python reverse_complement.py CGGGTA -
faStats : compute sequence length of all sequences in a fasta file
usage :
python faStats file.fa -
melting_temp.py : compute melting temperature of a sequence
usage :
melting_temp.py DnaSequence -
entrez_specie.py : returns the specie/organism name given an ENTREZ id.
usage :
python entrez_specie.py 1567 -
fastq_split.py : splits a merged paired-end Illumina
{basename}.fastq(or compressedfastq.gz) file in{basename}.R1.fastqand{basename}.R2.fastqusage :
python fastq_split.py paired_end_file.fastq -
centrifuge2krona : converts a centrifuge output file to a krona visualisation using
centrifuge-kreportandktImportTaxonomy.usage :
centrifuge2krona centrifuge_file.out -
sam_filter : filters a sam file on identity percentage and alignment length.
usage:
sam_filter file.sam -
bam_filter : filters a bam file on identity percentage
usage:
bam_filter file.bam -
filterFastaByLength : filters a fasta file on sequence length (min and max)
usage:
filterFastaByLength -min 1 -max 66 file.fa -
krakenTometaphlan : converts a [Kraken] style report to a [Metaphlan] style report
usage:
krakenTometaphlan -o metaphlan_report.txt kraken_report.txt -
consensusMaker : creates a consensus fasta from a samtools mpileup file
usage :
consensusMaker -o myconsensus.fa infile.mpileup -
bed2coverage : computes the 10th percentile coverage for each feature in a
BEDfileusage :
bed2coverage infile.bed -
filterFastaByName: filters a fasta file given a list of sequence names
usage :
filterFastaByName infile.fasta seqnames_to_keep.txt -o outfile.fa -
eslfasta2fastq: Extracts the headers of fasta file formatted by Easel (hmmer toolkit) and get the matching fastq records
-
usage :
eslfasta2fastq fasta_input forward.fq -fq2 reverse.fq -
parallel_download: Download files from a list of files (on file per line) in a parallel fashion using multiprocessing, and subprocess calling wget
usage:
parallel_download list_of_files.txt -
fasta_split: Splits fasta sequences in shorter sequences, using a negative binomial distribution usage:
python fasta_split.py -m 800 input_sequences.fa -
compare_fasta_seqs: Compare two or more sequences in a fasta file usage:
compare_fasta_seqs multifasta.fa
- Option 1 : use aliases in your
~/.bash_profileor~/.bashrcfile to add each tool one by one (replace/path/to/DNA_tools/with the location of the tool)
example :
echo "alias revcom=python /path/to/DNA_tools/reverse_complement.py" >> ~/.bashrc
source ~/.bashrc
- Option 2 : Add all the DNA tools at once to your
$PATHenvironment variable (replace/path/to/with the location of theDNA_toolsdirectory)
example :
echo "export PATH=$PATH:/path/to/DNA_tools/" >> ~/.bashrc
source ~/.bashrc