52
|
1 # ARGalaxy Immune Repertoire
|
|
2 This is the GitHub repository for the ARGalaxy Immune repertoire pipeline.
|
|
3 The Galaxy tool version can be found [here](https://toolshed.g2.bx.psu.edu/repository/browse_repositories_i_own?sort=name&operation=view_or_manage_repository&id=2e457d63170a4b1c).
|
|
4 The docker version can be found [here](https://github.com/ErasmusMC-Bioinformatics/ARGalaxy-docker).
|
|
5
|
|
6 ## Overview
|
|
7
|
|
8 In execution order:
|
|
9
|
|
10 #### imgt_loader or igblast
|
|
11
|
|
12 ###### imgt_loader (Recommended)
|
|
13 Start the analysis with [IMGT HighV Quest](https://www.imgt.org/HighV-QUEST/) archives.
|
|
14 An IMGT archive file holds [multiple tabular files](http://www.imgt.org/IMGT_vquest/share/textes/imgtvquest.html#output3), this script extracts the specific columns relevant to the analysis from several of these files.
|
|
15
|
|
16 `Rscript imgt_loader.r 1_Summary.txt 3_Nt-sequences.txt 5_AA-sequences.txt 6_Junction.txt 4_IMGT-gapped-AA-sequences.txt /path/to/output.txt`
|
|
17
|
|
18
|
|
19 ###### igblast
|
|
20 Start the analysis with FASTA files that are aligned with [igblast](https://www.ncbi.nlm.nih.gov/igblast/).
|
|
21 Note that this method will provide less information than the IMGT archive.
|
|
22
|
|
23 `sh igblast.sh /path/to/sequences.fasta species locus /path/to/output.txt`
|
|
24
|
|
25 #### experimental_design
|
|
26 This script will merge multiple result files (from the last step) into a single file with an additional ID and Replicate column to differentiate the individual samples during the analysis and to allow for analysis across samples.
|
|
27
|
|
28 `Rscript experimental_design.r /path/to/input_1 id_1 [/path/to/input_2 id_2] [/path/to/input_n id_n] /path/to/output`
|
|
29
|
|
30 #### report_clonality
|
|
31 The R script that creates the analysis result.
|
|
32
|
|
33 `sh r_wrapper.sh /path/to/experimental_design/output.txt /path/to/output_dir/output.html /path/to/output_dir "clonaltype" "species" "locus" "filter_productive" "clonality_method"`
|
|
34
|
|
35 ###### parameters
|
|
36 Clonaltype:
|
|
37 - none
|
|
38 - Top.V.Gene,CDR3.Seq
|
|
39 - Top.V.Gene,CDR3.Seq.DNA
|
|
40 - Top.V.Gene,Top.J.Gene,CDR3.Seq
|
|
41 - Top.V.Gene,Top.J.Gene,CDR3.Seq.DNA
|
|
42 - Top.V.Gene,Top.D.Gene,Top.J.Gene,CDR3.Seq.DNA
|
|
43
|
|
44 Species:
|
|
45 - Homo sapiens functional
|
|
46 - Homo sapiens
|
|
47 - Homo sapiens non-functional
|
|
48 - Bos taurus
|
|
49 - Bos taurus functional
|
|
50 - Bos taurus non-functional
|
|
51 - Camelus dromedarius
|
|
52 - Camelus dromedarius functional
|
|
53 - Camelus dromedarius non-functional
|
|
54 - Canis lupus familiaris
|
|
55 - Canis lupus familiaris functional
|
|
56 - Canis lupus familiaris non-functional
|
|
57 - Danio rerio
|
|
58 - Danio rerio functional
|
|
59 - Danio rerio non-functional
|
|
60 - Macaca mulatta
|
|
61 - Macaca mulatta functional
|
|
62 - Macaca mulatta non-functional
|
|
63 - Mus musculus
|
|
64 - Mus musculus functional
|
|
65 - Mus musculus non-functional
|
|
66 - Mus spretus
|
|
67 - Mus spretus functional
|
|
68 - Mus spretus non-functional
|
|
69 - Oncorhynchus mykiss
|
|
70 - Oncorhynchus mykiss functional
|
|
71 - Oncorhynchus mykiss non-functional
|
|
72 - Ornithorhynchus anatinus
|
|
73 - Ornithorhynchus anatinus functional
|
|
74 - Ornithorhynchus anatinus non-functional
|
|
75 - Oryctolagus cuniculus
|
|
76 - Oryctolagus cuniculus functional
|
|
77 - Oryctolagus cuniculus non-functional
|
|
78 - Rattus norvegicus
|
|
79 - Rattus norvegicus functional
|
|
80 - Rattus norvegicus non-functional
|
|
81 - Sus scrofa
|
|
82 - Sus scrofa functional
|
|
83 - Sus scrofa non-functional
|
|
84
|
|
85 Locus:
|
|
86 - TRA
|
|
87 - TRD
|
|
88 - TRG
|
|
89 - TRB
|
|
90 - IGH
|
|
91 - IGI
|
|
92 - IGK
|
|
93 - IGL
|
|
94
|
|
95 Filter productive:
|
|
96 - yes
|
|
97 - no
|
|
98
|
|
99 Clonality Method:
|
|
100 - none
|
|
101 - old
|
|
102 - boyd
|
|
103
|
|
104 ## complete.sh
|
|
105 This script will run all of the above for you, it will detect if you are using FASTA files or IMGT archives and use the appropriate tools.
|
|
106
|
|
107 `sh complete.sh /path/to/input_1 id_1 [/path/to/input_n id_n] /path/to/out_dir/out.html /path/to/out_dir clonaltype species locus filter_productive clonality_method`
|
|
108 See "report_clonality" for the parameter options.
|
|
109
|
|
110 ## Dependencies
|
|
111 - Linux
|
|
112 - R
|
|
113 - gridExtra
|
|
114 - ggplot2
|
|
115 - plyr
|
|
116 - data.table
|
|
117 - reshape2
|
|
118 - lymphclon
|
|
119
|
|
120 #### optional
|
|
121 - Circos
|
|
122 - IgBlast
|
|
123 - igblastwrp
|