0
|
1 Installs VirtualNormal Correction Tool
|
|
2
|
|
3 After installing this tool via admin panel, manually configure the following:
|
|
4
|
|
5 1) edit virtual_normal_correction.loc file
|
|
6
|
|
7 - change "/path/to/hg18.crr" to the location of the Complete Genomics reference crr file on your system
|
|
8 (can be downloaded from ftp://ftp.completegenomics.com/ReferenceFiles/ )
|
|
9
|
2
|
10 - change "/path/to/VN_genomes_varfiles_lists_hg18" to the location of the directory containing files with the locations of all the Complete Genomics
|
0
|
11 varfiles to be used as a virtual normal. This file should contain 1 file location per line, e.g.
|
|
12
|
3
|
13 /path/to/normal-varfile-1
|
|
14 /path/to/normal-varfile-2
|
|
15 /path/to/normal-varfile-3
|
|
16 /path/to/normal-varfile-4
|
|
17 /path/to/normal-varfile-5
|
|
18 /path/to/normal-varfile-6
|
|
19 /path/to/normal-varfile-7
|
|
20 /path/to/normal-varfile-8
|
0
|
21 ...
|
|
22
|
3
|
23 - edit the tool xml file to offer one or more different sets of virtual normals:
|
2
|
24
|
|
25 [..]
|
|
26 <!-- edit these options to reflect sets of normal you have available. The values must name files within the directories specified in data_table_conf.xml file -->
|
|
27 <param name="VNset" type="select" label="Select Virtual Normal set to use" help="1000Genomes set can only be used for hg19 samples, for hg18 54 genomes will be used.">
|
|
28 <option value="46_diversity.txt" > CG Diversity Panel and trios (54 Genomes) </option>
|
|
29 <option value="433_1000g.txt" > CG 1000G project genomes (433 Genomes) (hg19 only) </option>
|
|
30 <option value="479_diversity_1000g.txt" > Diversity and 1000G (479 genomes) (hg19 only) </option>
|
|
31 <option value="10_tutorial.txt" > Small VN for tutorial (10 Genomes) </option>
|
|
32 </param>
|
|
33 [..]
|
|
34
|
|
35 the values indicate files expected to be at the location configured in the loc file,
|
|
36
|
|
37
|
|
38 So if your .loc file looks like this:
|
|
39
|
|
40
|
3
|
41 #loc file for virtual normal tool
|
|
42 # <columns>value, dbkey, name, VN_genomes_varfiles_lists_location, VN_genomes_junctionfile_lists_location, reference_crr_cgatools</columns>
|
2
|
43
|
3
|
44 hg18 hg18 Virtual Normal hg18 /path/to/VN_genomes_varfiles_lists_hg18/ /path/to/VN_genomes_junctionfiles_lists_hg18/ /path/to/hg18.crr
|
|
45 hg19 hg19 Virtual_Normal hg19 /path/to/VN_genomes_varfiles_lists_hg19/ /path/to/VN_genomes_junctionfiles_lists_hg19/ /path/to/hg19.crr
|
2
|
46
|
|
47 And your xml file like the example above, then the tool expects the following files to exist:
|
|
48 /path/to/VN_genomes/varfiles_list_hg18/46_diversity.txt
|
|
49 /path/to/VN_genomes/varfiles_list_hg18/433_1000g.txt
|
3
|
50 ...etc..
|
2
|
51
|
3
|
52 and each of these files must contain a 1-per-line list of locations of the varfiles of the normal genomes (or junction files in case of SV analysis).
|
2
|
53
|
|
54 Varfiles can be in compressed or uncompressed form. For example, Complete Genomics' Diversity panel can be used.
|
0
|
55 (can be downloaded from ftp://ftp2.completegenomics.com/)
|
3
|
56
|
|
57
|
0
|
58
|
3
|
59 2) restart Galaxy for changes to take effect (alternatively, in the admin panel these setting can be reloaded without the need to restart your galaxy instance)
|
0
|
60
|
|
61 After this initial setup, additional normals can be added to the lists without having to restart Galaxy.
|