0
|
1 Installs VirtualNormal Correction Tool
|
|
2
|
|
3 After installing this tool via admin panel, manually configure the following:
|
|
4
|
|
5 1) edit virtual_normal_correction.loc file
|
|
6
|
|
7 - change "/path/to/hg18.crr" to the location of the Complete Genomics reference crr file on your system
|
|
8 (can be downloaded from ftp://ftp.completegenomics.com/ReferenceFiles/ )
|
|
9
|
2
|
10 - change "/path/to/VN_genomes_varfiles_lists_hg18" to the location of the directory containing files with the locations of all the Complete Genomics
|
0
|
11 varfiles to be used as a virtual normal. This file should contain 1 file location per line, e.g.
|
|
12
|
|
13 /path/to/normal-varfile-1
|
|
14 /path/to/normal-varfile-2
|
|
15 /path/to/normal-varfile-3
|
|
16 /path/to/normal-varfile-4
|
|
17 /path/to/normal-varfile-5
|
|
18 /path/to/normal-varfile-6
|
|
19 /path/to/normal-varfile-7
|
|
20 /path/to/normal-varfile-8
|
|
21 ...
|
|
22
|
2
|
23 - edit the tool xml file to offer sets of virtual normals
|
|
24
|
|
25 [..]
|
|
26 <!-- edit these options to reflect sets of normal you have available. The values must name files within the directories specified in data_table_conf.xml file -->
|
|
27 <param name="VNset" type="select" label="Select Virtual Normal set to use" help="1000Genomes set can only be used for hg19 samples, for hg18 54 genomes will be used.">
|
|
28 <option value="46_diversity.txt" > CG Diversity Panel and trios (54 Genomes) </option>
|
|
29 <option value="433_1000g.txt" > CG 1000G project genomes (433 Genomes) (hg19 only) </option>
|
|
30 <option value="479_diversity_1000g.txt" > Diversity and 1000G (479 genomes) (hg19 only) </option>
|
|
31 <option value="10_tutorial.txt" > Small VN for tutorial (10 Genomes) </option>
|
|
32 </param>
|
|
33 [..]
|
|
34
|
|
35 the values indicate files expected to be at the location configured in the loc file,
|
|
36
|
|
37
|
|
38 So if your .loc file looks like this:
|
|
39
|
|
40
|
|
41 #loc file for annovar tool
|
|
42
|
|
43 # <columns>value, dbkey, name, VN_genomes_varfiles_list, VN_genomes_junctionfile_list, reference_crr_cgatools</columns>
|
|
44
|
|
45 hg18 hg18 Virtual Normal hg18 /path/to/VN_genomes_varfiles_lists_hg18 /path/to/VN_genomes_junctionfiles_lists_hg18 /path/to/hg18.crr
|
|
46 hg19 hg19 Virtual_Normal hg19 /path/to/VN_genomes_varfiles_lists_hg19 /path/to/VN_genomes_junctionfiles_lists_hg19 /path/to/hg19.crr
|
|
47
|
|
48 And your xml file like the example above, then the tool expects the following files to exist:
|
|
49 /path/to/VN_genomes/varfiles_list_hg18/46_diversity.txt
|
|
50 /path/to/VN_genomes/varfiles_list_hg18/433_1000g.txt
|
|
51 etc
|
|
52 and containing a 1-per-line list of locations of the varfiles of the normal genomes.
|
|
53
|
|
54
|
|
55
|
|
56
|
|
57 Varfiles can be in compressed or uncompressed form. For example, Complete Genomics' Diversity panel can be used.
|
0
|
58 (can be downloaded from ftp://ftp2.completegenomics.com/)
|
|
59
|
2
|
60 - change "/path/to/VN_genomes_junctionfiles_lists_hg18" to the location of the file containing the locations of all the Complete Genomics
|
0
|
61 junctionfiles to be used as a virtual normal. This file should contain 1 file location per line. For example, Complete Genomics'
|
|
62 Diversity panel can be used. (can be downloaded from ftp://ftp2.completegenomics.com/)
|
|
63
|
|
64 2) restart Galaxy for changes to take effect
|
|
65
|
|
66 After this initial setup, additional normals can be added to the lists without having to restart Galaxy.
|