# HG changeset patch # User rhpvorderman # Date 1637077352 0 # Node ID 81b3eb11ed2c9c752e7f80cf56f7ca363fe96a7b # Parent 0ef7f80ea0610169f78d9b410f8ea2273e701a49 "planemo upload commit 6c0195cc4de6a34e2c46d875be4fc2157a21cdf6" diff -r 0ef7f80ea061 -r 81b3eb11ed2c .gitattributes --- a/.gitattributes Thu Feb 25 13:39:50 2021 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,4 +0,0 @@ -# Auto detect text files and perform LF normalization -* text=auto -# Convert to LF line endings on checkout. -*.sh text eol=lf diff -r 0ef7f80ea061 -r 81b3eb11ed2c .gitignore --- a/.gitignore Thu Feb 25 13:39:50 2021 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,2 +0,0 @@ - -argalaxy\.tar\.gz diff -r 0ef7f80ea061 -r 81b3eb11ed2c CHANGES.md --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/CHANGES.md Tue Nov 16 15:42:32 2021 +0000 @@ -0,0 +1,3 @@ +version 1.0.1 +--------------- ++ Quote some variables to prevent ambiguous redirects diff -r 0ef7f80ea061 -r 81b3eb11ed2c LICENSE --- a/LICENSE Thu Feb 25 13:39:50 2021 +0000 +++ b/LICENSE Tue Nov 16 15:42:32 2021 +0000 @@ -1,21 +1,21 @@ -MIT License - -Copyright (c) 2018 David van Zessen - -Permission is hereby granted, free of charge, to any person obtaining a copy -of this software and associated documentation files (the "Software"), to deal -in the Software without restriction, including without limitation the rights -to use, copy, modify, merge, publish, distribute, sublicense, and/or sell -copies of the Software, and to permit persons to whom the Software is -furnished to do so, subject to the following conditions: - -The above copyright notice and this permission notice shall be included in all -copies or substantial portions of the Software. - -THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR -IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, -FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE -AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER -LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, -OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE +MIT License + +Copyright (c) 2018 David van Zessen + +Permission is hereby granted, free of charge, to any person obtaining a copy +of this software and associated documentation files (the "Software"), to deal +in the Software without restriction, including without limitation the rights +to use, copy, modify, merge, publish, distribute, sublicense, and/or sell +copies of the Software, and to permit persons to whom the Software is +furnished to do so, subject to the following conditions: + +The above copyright notice and this permission notice shall be included in all +copies or substantial portions of the Software. + +THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR +IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, +FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE +AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER +LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, +OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. \ No newline at end of file diff -r 0ef7f80ea061 -r 81b3eb11ed2c README.md --- a/README.md Thu Feb 25 13:39:50 2021 +0000 +++ b/README.md Tue Nov 16 15:42:32 2021 +0000 @@ -1,123 +1,123 @@ -# ARGalaxy Immune Repertoire -This is the GitHub repository for the ARGalaxy Immune repertoire pipeline. -The Galaxy tool version can be found [here](https://toolshed.g2.bx.psu.edu/repository/browse_repositories_i_own?sort=name&operation=view_or_manage_repository&id=2e457d63170a4b1c). -The docker version can be found [here](https://github.com/ErasmusMC-Bioinformatics/ARGalaxy-docker). - -## Overview - -In execution order: - -#### imgt_loader or igblast - -###### imgt_loader (Recommended) -Start the analysis with [IMGT HighV Quest](https://www.imgt.org/HighV-QUEST/) archives. -An IMGT archive file holds [multiple tabular files](http://www.imgt.org/IMGT_vquest/share/textes/imgtvquest.html#output3), this script extracts the specific columns relevant to the analysis from several of these files. - -`Rscript imgt_loader.r 1_Summary.txt 3_Nt-sequences.txt 5_AA-sequences.txt 6_Junction.txt 4_IMGT-gapped-AA-sequences.txt /path/to/output.txt` - - -###### igblast -Start the analysis with FASTA files that are aligned with [igblast](https://www.ncbi.nlm.nih.gov/igblast/). -Note that this method will provide less information than the IMGT archive. - -`sh igblast.sh /path/to/sequences.fasta species locus /path/to/output.txt` - -#### experimental_design -This script will merge multiple result files (from the last step) into a single file with an additional ID and Replicate column to differentiate the individual samples during the analysis and to allow for analysis across samples. - -`Rscript experimental_design.r /path/to/input_1 id_1 [/path/to/input_2 id_2] [/path/to/input_n id_n] /path/to/output` - -#### report_clonality -The R script that creates the analysis result. - -`sh r_wrapper.sh /path/to/experimental_design/output.txt /path/to/output_dir/output.html /path/to/output_dir "clonaltype" "species" "locus" "filter_productive" "clonality_method"` - -###### parameters -Clonaltype: -- none -- Top.V.Gene,CDR3.Seq -- Top.V.Gene,CDR3.Seq.DNA -- Top.V.Gene,Top.J.Gene,CDR3.Seq -- Top.V.Gene,Top.J.Gene,CDR3.Seq.DNA -- Top.V.Gene,Top.D.Gene,Top.J.Gene,CDR3.Seq.DNA - -Species: -- Homo sapiens functional -- Homo sapiens -- Homo sapiens non-functional -- Bos taurus -- Bos taurus functional -- Bos taurus non-functional -- Camelus dromedarius -- Camelus dromedarius functional -- Camelus dromedarius non-functional -- Canis lupus familiaris -- Canis lupus familiaris functional -- Canis lupus familiaris non-functional -- Danio rerio -- Danio rerio functional -- Danio rerio non-functional -- Macaca mulatta -- Macaca mulatta functional -- Macaca mulatta non-functional -- Mus musculus -- Mus musculus functional -- Mus musculus non-functional -- Mus spretus -- Mus spretus functional -- Mus spretus non-functional -- Oncorhynchus mykiss -- Oncorhynchus mykiss functional -- Oncorhynchus mykiss non-functional -- Ornithorhynchus anatinus -- Ornithorhynchus anatinus functional -- Ornithorhynchus anatinus non-functional -- Oryctolagus cuniculus -- Oryctolagus cuniculus functional -- Oryctolagus cuniculus non-functional -- Rattus norvegicus -- Rattus norvegicus functional -- Rattus norvegicus non-functional -- Sus scrofa -- Sus scrofa functional -- Sus scrofa non-functional - -Locus: -- TRA -- TRD -- TRG -- TRB -- IGH -- IGI -- IGK -- IGL - -Filter productive: -- yes -- no - -Clonality Method: -- none -- old -- boyd - -## complete.sh -This script will run all of the above for you, it will detect if you are using FASTA files or IMGT archives and use the appropriate tools. - -`sh complete.sh /path/to/input_1 id_1 [/path/to/input_n id_n] /path/to/out_dir/out.html /path/to/out_dir clonaltype species locus filter_productive clonality_method` -See "report_clonality" for the parameter options. - -## Dependencies -- Linux -- R - - gridExtra - - ggplot2 - - plyr - - data.table - - reshape2 - - lymphclon - -#### optional -- Circos -- IgBlast -- igblastwrp +# ARGalaxy Immune Repertoire +This is the GitHub repository for the ARGalaxy Immune repertoire pipeline. +The Galaxy tool version can be found [here](https://toolshed.g2.bx.psu.edu/repository/browse_repositories_i_own?sort=name&operation=view_or_manage_repository&id=2e457d63170a4b1c). +The docker version can be found [here](https://github.com/ErasmusMC-Bioinformatics/ARGalaxy-docker). + +## Overview + +In execution order: + +#### imgt_loader or igblast + +###### imgt_loader (Recommended) +Start the analysis with [IMGT HighV Quest](https://www.imgt.org/HighV-QUEST/) archives. +An IMGT archive file holds [multiple tabular files](http://www.imgt.org/IMGT_vquest/share/textes/imgtvquest.html#output3), this script extracts the specific columns relevant to the analysis from several of these files. + +`Rscript imgt_loader.r 1_Summary.txt 3_Nt-sequences.txt 5_AA-sequences.txt 6_Junction.txt 4_IMGT-gapped-AA-sequences.txt /path/to/output.txt` + + +###### igblast +Start the analysis with FASTA files that are aligned with [igblast](https://www.ncbi.nlm.nih.gov/igblast/). +Note that this method will provide less information than the IMGT archive. + +`sh igblast.sh /path/to/sequences.fasta species locus /path/to/output.txt` + +#### experimental_design +This script will merge multiple result files (from the last step) into a single file with an additional ID and Replicate column to differentiate the individual samples during the analysis and to allow for analysis across samples. + +`Rscript experimental_design.r /path/to/input_1 id_1 [/path/to/input_2 id_2] [/path/to/input_n id_n] /path/to/output` + +#### report_clonality +The R script that creates the analysis result. + +`sh r_wrapper.sh /path/to/experimental_design/output.txt /path/to/output_dir/output.html /path/to/output_dir "clonaltype" "species" "locus" "filter_productive" "clonality_method"` + +###### parameters +Clonaltype: +- none +- Top.V.Gene,CDR3.Seq +- Top.V.Gene,CDR3.Seq.DNA +- Top.V.Gene,Top.J.Gene,CDR3.Seq +- Top.V.Gene,Top.J.Gene,CDR3.Seq.DNA +- Top.V.Gene,Top.D.Gene,Top.J.Gene,CDR3.Seq.DNA + +Species: +- Homo sapiens functional +- Homo sapiens +- Homo sapiens non-functional +- Bos taurus +- Bos taurus functional +- Bos taurus non-functional +- Camelus dromedarius +- Camelus dromedarius functional +- Camelus dromedarius non-functional +- Canis lupus familiaris +- Canis lupus familiaris functional +- Canis lupus familiaris non-functional +- Danio rerio +- Danio rerio functional +- Danio rerio non-functional +- Macaca mulatta +- Macaca mulatta functional +- Macaca mulatta non-functional +- Mus musculus +- Mus musculus functional +- Mus musculus non-functional +- Mus spretus +- Mus spretus functional +- Mus spretus non-functional +- Oncorhynchus mykiss +- Oncorhynchus mykiss functional +- Oncorhynchus mykiss non-functional +- Ornithorhynchus anatinus +- Ornithorhynchus anatinus functional +- Ornithorhynchus anatinus non-functional +- Oryctolagus cuniculus +- Oryctolagus cuniculus functional +- Oryctolagus cuniculus non-functional +- Rattus norvegicus +- Rattus norvegicus functional +- Rattus norvegicus non-functional +- Sus scrofa +- Sus scrofa functional +- Sus scrofa non-functional + +Locus: +- TRA +- TRD +- TRG +- TRB +- IGH +- IGI +- IGK +- IGL + +Filter productive: +- yes +- no + +Clonality Method: +- none +- old +- boyd + +## complete.sh +This script will run all of the above for you, it will detect if you are using FASTA files or IMGT archives and use the appropriate tools. + +`sh complete.sh /path/to/input_1 id_1 [/path/to/input_n id_n] /path/to/out_dir/out.html /path/to/out_dir clonaltype species locus filter_productive clonality_method` +See "report_clonality" for the parameter options. + +## Dependencies +- Linux +- R + - gridExtra + - ggplot2 + - plyr + - data.table + - reshape2 + - lymphclon + +#### optional +- Circos +- IgBlast +- igblastwrp diff -r 0ef7f80ea061 -r 81b3eb11ed2c complete.sh --- a/complete.sh Thu Feb 25 13:39:50 2021 +0000 +++ b/complete.sh Tue Nov 16 15:42:32 2021 +0000 @@ -1,6 +1,6 @@ #!/bin/bash set -e -inputFiles=($1) +inputFiles=("$1") outputDir=$3 outputFile=$3/index.html #$1 clonalType=$4 @@ -12,16 +12,16 @@ html=$2 dir="$(cd "$(dirname "$0")" && pwd)" array=("$@") -echo "
info | |
----------------------------------- |
info | |
----------------------------------- | |
Sample $count of patient $id is not a zip file so assuming fasta/fastq, using igBLASTn |
t |
t |
CDR3 characteristics tab
- -This tab gives information on the -different characteristics of the CDR3.
- -Graphs
- -length distribution of CDR3
- -Graph visualizing for each donor the -distribution of the CDR3 length in amino acids between all sequences.
- -Amino acid composition of CDR3
- -Bar graph visualizing for each donor how -frequent each amino acid is used in the CDR3 of all sequences. The red colour -in the background indicate positively charged amino acids, and the blue colour -in the background indicate negatively charged amino acids. The amino acids are -ordered from hydrophilic (left) to hydrophobic (right).
- -Table
- -Table showing for each donor the median -CDR3 length.
- -- -
CDR3 characteristics tab
+ +This tab gives information on the +different characteristics of the CDR3.
+ +Graphs
+ +length distribution of CDR3
+ +Graph visualizing for each donor the +distribution of the CDR3 length in amino acids between all sequences.
+ +Amino acid composition of CDR3
+ +Bar graph visualizing for each donor how +frequent each amino acid is used in the CDR3 of all sequences. The red colour +in the background indicate positively charged amino acids, and the blue colour +in the background indicate negatively charged amino acids. The amino acids are +ordered from hydrophilic (left) to hydrophobic (right).
+ +Table
+ +Table showing for each donor the median +CDR3 length.
+ ++ +
Martin Krzywinski, Jacqueline Schein, İnan -Birol, Joseph Connors, Randy Gascoyne, Doug Horsman, Steven J. Jones and Marco -A. Marra (2009). Circos: An information aesthetic for comparative genomics. In Genome -Research 10.1101/gr.092759.109.
- -Circos plot showing the V-D, V-J, and D-J gen usage. -The thicker the line the more frequent this gene combination is used.
- -Martin Krzywinski, Jacqueline Schein, İnan +Birol, Joseph Connors, Randy Gascoyne, Doug Horsman, Steven J. Jones and Marco +A. Marra (2009). Circos: An information aesthetic for comparative genomics. In Genome +Research 10.1101/gr.092759.109.
+ +Circos plot showing the V-D, V-J, and D-J gen usage. +The thicker the line the more frequent this gene combination is used.
+ +For each donor a -subtab is provided containing information on the sequence overlap between -replicates of the different replicates of one donor. Dependent on the settings -of the Shared clonal types / clonality filter a clonality score is given. The clonality -score is calculated according to the formula described by Boyd -et al, PMID: 20161664.
- -For each -individual replicate the number of reads is reported. In addition, the number -of coincidences between the different replicates of the same donor are reported. -
- -- -
For each donor a +subtab is provided containing information on the sequence overlap between +replicates of the different replicates of one donor. Dependent on the settings +of the Shared clonal types / clonality filter a clonality score is given. The clonality +score is calculated according to the formula described by Boyd +et al, PMID: 20161664.
+ +For each +individual replicate the number of reads is reported. In addition, the number +of coincidences between the different replicates of the same donor are reported. +
+ ++ +
The tab allows -you to compare the VD, VJ and DJ heatmaps of multiple donors to each other.
- -By ticking the include box of a donor, the three -heatmaps of this donor is visualized underneath eachother. By clicking the -include box of multiple replicates the heatmaps of these replicates are visualized -next to each other allowing easy comparison of heatmaps.
- -- -
The tab allows +you to compare the VD, VJ and DJ heatmaps of multiple donors to each other.
+ +By ticking the include box of a donor, the three +heatmaps of this donor is visualized underneath eachother. By clicking the +include box of multiple replicates the heatmaps of these replicates are visualized +next to each other allowing easy comparison of heatmaps.
+ ++ +
Downloads -tab
- -Description
- -The -filtered dataset: Allows you to download information on V D and J gene usage, -the junctions and the amino acid sequence of the sequences that passed the chosen -filter settings. Note: this file contains only 1 sequences of a replicate -sequence.
- -Gene -frequencies
- -The dataset used to generate the distribution -of V gene families graph: Downloads a table with information used to generate -the distribution of V gene families graph.
- -The dataset used to generate the -distribution of D gene families graph: Downloads a table with information used -to generate the distribution of D gene families graph.
- -The -dataset used to generate the relative frequency of V gene usage graph: -Downloads the data used to generate the relative frequency of V gene usage -graph. In addition information on the chromosomal order of the V genes can be -found.
- -The -dataset used to generate the relative frequency of D gene usage graph: -Downloads the data used to generate the relative frequency of D gene usage -graph. In addition information on the chromosomal order of the D genes can be -found.
- -The -dataset used to generate the relative frequency of J gene usage graph: Downloads -the data used to generate the relative frequency of J gene usage graph. In -addition information on the chromosomal order of the J genes can be found.
- -The dataset used to generate the relative frequency of the D -reading frame graph: Downloads -the data used to generate the relative frequency of the D reading frame.
- -CDR3 characteristics
- -The dataset used to generate the CDR3 -length frequency graph: Downloads a table with information used for the -length distribution of CDR3 plot.
- -The -dataset used to generate the Amino Acid Composition in the CDR3 graph: -Downloads the frequency of the different amino acids in the CDR3 of all -filtered sequences. This data set is used for the generation of the Amino Acid -Composition in the CDR3 graph. In addition the order of amino acids based on hydrophobicity -in included.
- -Heatmaps
- -The -data used to generate the VD heatmap for donor name: -Downloads the data set used for the generation of the VD heatmap. For each -uploaded donor a separate download is generated.
- -The -data used to generate the VJ heatmap for donor name: -Downloads the data set used for the generation of the VJ heatmap. For each -uploaded donor a separate download is generated.
- -The -data used to generate the DJ heatmap for sample name: -Downloads the data set used for the generation of the DJ heatmap. For each -uploaded donor a separate download is generated.
- -Circos
- -The -data used to generate the VD circus plots for donor name: -Downloads the data set used for the generation of the VD heatmap. For each -uploaded donor a separate download is generated.
- -The -data used to generate the VJ circus plots for donor name: -Downloads the data set used for the generation of the VJ heatmap. For each -uploaded donor a separate download is generated.
- -The -data used to generate the DJ circus plots for sample name: -Downloads the data set used for the generation of the DJ heatmap. For each -uploaded donor a separate download is generated.
- -Clonality
- -The -dataset used to calculate clonality score: Allows you to -download information on V D and J gene usage, the junctions and the amino acid -sequence of the sequences that passed the chosen filter settings. Note: this -file contains all the replicate sequences form the different input files.
- -Sequences -that are present in more than one replicate: Allows you to -download a table containing all the sequences that were present in more than -one replicate. It provides the concidence type, clonaltype, sequence name, -replicate number, and the nucleotide sequence
- -- -
Downloads +tab
+ +Description
+ +The +filtered dataset: Allows you to download information on V D and J gene usage, +the junctions and the amino acid sequence of the sequences that passed the chosen +filter settings. Note: this file contains only 1 sequences of a replicate +sequence.
+ +Gene +frequencies
+ +The dataset used to generate the distribution +of V gene families graph: Downloads a table with information used to generate +the distribution of V gene families graph.
+ +The dataset used to generate the +distribution of D gene families graph: Downloads a table with information used +to generate the distribution of D gene families graph.
+ +The +dataset used to generate the relative frequency of V gene usage graph: +Downloads the data used to generate the relative frequency of V gene usage +graph. In addition information on the chromosomal order of the V genes can be +found.
+ +The +dataset used to generate the relative frequency of D gene usage graph: +Downloads the data used to generate the relative frequency of D gene usage +graph. In addition information on the chromosomal order of the D genes can be +found.
+ +The +dataset used to generate the relative frequency of J gene usage graph: Downloads +the data used to generate the relative frequency of J gene usage graph. In +addition information on the chromosomal order of the J genes can be found.
+ +The dataset used to generate the relative frequency of the D +reading frame graph: Downloads +the data used to generate the relative frequency of the D reading frame.
+ +CDR3 characteristics
+ +The dataset used to generate the CDR3 +length frequency graph: Downloads a table with information used for the +length distribution of CDR3 plot.
+ +The +dataset used to generate the Amino Acid Composition in the CDR3 graph: +Downloads the frequency of the different amino acids in the CDR3 of all +filtered sequences. This data set is used for the generation of the Amino Acid +Composition in the CDR3 graph. In addition the order of amino acids based on hydrophobicity +in included.
+ +Heatmaps
+ +The +data used to generate the VD heatmap for donor name: +Downloads the data set used for the generation of the VD heatmap. For each +uploaded donor a separate download is generated.
+ +The +data used to generate the VJ heatmap for donor name: +Downloads the data set used for the generation of the VJ heatmap. For each +uploaded donor a separate download is generated.
+ +The +data used to generate the DJ heatmap for sample name: +Downloads the data set used for the generation of the DJ heatmap. For each +uploaded donor a separate download is generated.
+ +Circos
+ +The +data used to generate the VD circus plots for donor name: +Downloads the data set used for the generation of the VD heatmap. For each +uploaded donor a separate download is generated.
+ +The +data used to generate the VJ circus plots for donor name: +Downloads the data set used for the generation of the VJ heatmap. For each +uploaded donor a separate download is generated.
+ +The +data used to generate the DJ circus plots for sample name: +Downloads the data set used for the generation of the DJ heatmap. For each +uploaded donor a separate download is generated.
+ +Clonality
+ +The +dataset used to calculate clonality score: Allows you to +download information on V D and J gene usage, the junctions and the amino acid +sequence of the sequences that passed the chosen filter settings. Note: this +file contains all the replicate sequences form the different input files.
+ +Sequences +that are present in more than one replicate: Allows you to +download a table containing all the sequences that were present in more than +one replicate. It provides the concidence type, clonaltype, sequence name, +replicate number, and the nucleotide sequence
+ ++ +
Gene frequencies tab
- -This tab gives
-information on the frequency of V, D and J gene usage.
Graphs
- -Distribution of
-V gene families
Bar graph showing the frequency of V gene family usage
-in all uploaded donors.
Relative
-frequency of V gene usage
Bar graph showing the frequency of V genes in all
-uploaded donors. The order of the V genes on the x-axis can be altered by
-altering the Order of V(D)J genes
-in graphs filter on the main page of the Immune repertoire pipeline.
Relative
-frequency of D gene usage
Bar graph showing the frequency of D genes in all uploaded donors. The
-order of the D genes on the x-axis can be altered by altering the Order of V(D)J genes in graphs filter on
-the main page of the Immune repertoire pipeline.
Relative
-frequency of J gene usage
Bar graph showing the frequency of J genes in all uploaded donors. The
-order of the J genes on the x-axis can be altered by altering the Order of V(D)J genes in graphs filter on
-the main page of the Immune repertoire pipeline.
D
-reading frame
Bar graph visualizing the frequency by which each D reading frame is -used in each donor.
- -Gene frequencies tab
+ +This tab gives
+information on the frequency of V, D and J gene usage.
Graphs
+ +Distribution of
+V gene families
Bar graph showing the frequency of V gene family usage
+in all uploaded donors.
Relative
+frequency of V gene usage
Bar graph showing the frequency of V genes in all
+uploaded donors. The order of the V genes on the x-axis can be altered by
+altering the Order of V(D)J genes
+in graphs filter on the main page of the Immune repertoire pipeline.
Relative
+frequency of D gene usage
Bar graph showing the frequency of D genes in all uploaded donors. The
+order of the D genes on the x-axis can be altered by altering the Order of V(D)J genes in graphs filter on
+the main page of the Immune repertoire pipeline.
Relative
+frequency of J gene usage
Bar graph showing the frequency of J genes in all uploaded donors. The
+order of the J genes on the x-axis can be altered by altering the Order of V(D)J genes in graphs filter on
+the main page of the Immune repertoire pipeline.
D
+reading frame
Bar graph visualizing the frequency by which each D reading frame is +used in each donor.
+ +In this tab -heatmaps are provided showing the frequency of different combinations V and D, -D and J and V and J genes according together. For each donor a subtab with all -three heatmaps is provided.
- -- -
In this tab +heatmaps are provided showing the frequency of different combinations V and D, +D and J and V and J genes according together. For each donor a subtab with all +three heatmaps is provided.
+ ++ +
Junction -analysis tab
- -Tables:
- -The unique -rearrangements are separated in sequences that contain both a -V, D and J gene and sequences that contain only a V and J gene.
- -Productive mean
-tables
Reports -for each donor the number of productive sequences and the mean number of -deletions, N-nucleotides and P-nucleotides. -For explanation about the different deletions and N and P nucleotides see -figure above.
- -Unproductive
-mean table
Reports -for each donor the number of unproductive sequences and the mean number of -deletions, N-nucleotides and P-nucleotides. -For explanation about the different deletions and N and P nucleotides see -figure above.
- -Productive
-median table
Reports -for each donor the number of productive sequences and the median number of -deletions, N-nucleotides and P-nucleotides. -For explanation about the different deletions and N and P nucleotides see -figure above.
- -Unproductive
-median table
Reports -for each donor the number of unproductive sequences and the median number of -deletions, N-nucleotides and P-nucleotides. -For explanation about the different deletions and N and P nucleotides see -figure above.
- -- -
Junction +analysis tab
+ +Tables:
+ +The unique +rearrangements are separated in sequences that contain both a +V, D and J gene and sequences that contain only a V and J gene.
+ +Productive mean
+tables
Reports +for each donor the number of productive sequences and the mean number of +deletions, N-nucleotides and P-nucleotides. +For explanation about the different deletions and N and P nucleotides see +figure above.
+ +Unproductive
+mean table
Reports +for each donor the number of unproductive sequences and the mean number of +deletions, N-nucleotides and P-nucleotides. +For explanation about the different deletions and N and P nucleotides see +figure above.
+ +Productive
+median table
Reports +for each donor the number of productive sequences and the median number of +deletions, N-nucleotides and P-nucleotides. +For explanation about the different deletions and N and P nucleotides see +figure above.
+ +Unproductive
+median table
Reports +for each donor the number of unproductive sequences and the median number of +deletions, N-nucleotides and P-nucleotides. +For explanation about the different deletions and N and P nucleotides see +figure above.
+ ++ +
Table showing the number and percentage of -(unique) productive and unproductive sequences per sample and per replicate.
- -- -
The definition -of unique sequences is based on the clonal type definition filter setting -chosen.
- -- -
Table showing the number and percentage of +(unique) productive and unproductive sequences per sample and per replicate.
+ ++ +
The definition +of unique sequences is based on the clonal type definition filter setting +chosen.
+ ++ +