annotate ecoli_serotyping/README.md @ 29:5442b9d2e5d1 draft

Uploaded
author jpetteng
date Tue, 09 Jan 2018 11:02:23 -0500
parents 7c687720c9e9
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
1 # ECTyper (an easy typer)
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
2 **ECTyper** wraps a standalone serotyping module.
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
3 Supports _fasta_ and _fastq_ file formats
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
4
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
5 # Dependencies:
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
6 - python 3.6.3.*
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
7 - pandas 0.21.0.*
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
8 - samtools 1.5.*
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
9 - bowtie2 2.3.0.*
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
10 - mash 1.1.*
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
11 - bcftools 1.6.*
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
12 - biopython 1.69.*
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
13 - blast 2.2.31 .*
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
14 - seqtk 1.2.*
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
15
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
16 # Installation
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
17 1. Get `miniconda` if you do not already have `miniconda` or `anaconda`:
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
18 1. `wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
19 1. `bash miniconda.sh -b -p $HOME/miniconda`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
20 1. `export PATH="$HOME/miniconda/bin:$PATH"`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
21 2. Install ectyper
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
22 * Directly via `conda`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
23 1. `conda install -c anaconda -c bioconda -c ubcsamsung ectyper`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
24 * Through `github`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
25 1. Install dependencies
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
26 `conda install pandas samtools bowtie2 mash bcftools biopython nose blast seqtk tqdm python=3.6`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
27 1. Download git repository then unzip
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
28 `wget https://github.com/phac-nml/ecoli_serotyping/archive/master.zip`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
29 1. Install ectyper inside unzipped directory
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
30 `python setup.py install`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
31
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
32 # Basic Usage
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
33 1. Put all of your fasta/fastq files in one folder (concatenate paired files if you want the result to be considered a single entity)
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
34 1. `ectyper -i [file path]`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
35 1. View the results on the console or in `output/[datatime]/output.csv`
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
36
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
37 # Example Usage
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
38 * `ectyper -i ecoliA.fasta` for a single file
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
39 * `ectyper -i ecoliA.fasta,ecoliB.fastq,ecoliC.fna` for multiple files
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
40 * `ectyper -i ecoli_folder` for a folder
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
41
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
42 # Advanced Usage
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
43 ```
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
44 usage: ectyper [-h] -i INPUT [-d PERCENTIDENTITY] [-l PERCENTLENGTH]
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
45 [--verify] [-s] [-v] [-o OUTPUT]
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
46
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
47 optional arguments:
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
48 -h, --help show this help message and exit
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
49 -i INPUT, --input INPUT
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
50 Location of new file(s). Can be a single file or a
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
51 directory
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
52 -d PERCENTIDENTITY, --percentIdentity PERCENTIDENTITY
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
53 Percentage of identity wanted to use against the
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
54 database. From 0 to 100, default is 90%.
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
55 -l PERCENTLENGTH, --percentLength PERCENTLENGTH
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
56 Percentage of length wanted to use against the
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
57 database. From 0 to 100, default is 50%.
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
58 --verify Enable E. Coli. verification
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
59 -s, --species Enable species identification when non-ecoli genome is
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
60 found Note: refseq downloading is required when
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
61 running this option for the first time.
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
62 -v, --verbose Enable detailed output
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
63 -o OUTPUT, --output OUTPUT
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
64 Directory location of output files.
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
65 ```
7c687720c9e9 Uploaded
jpetteng
parents:
diff changeset
66 * The first time species identification is enabled you will need to wait for **ectyper** to download the reference sequence.