annotate README @ 5:3bd1087db05e draft

Add dependecies for bowtie, blat, and faToTwoBit
author Jim Johnson <jj@umn.edu>
date Fri, 04 Jan 2013 15:01:19 -0600
parents 679a5c7b1294
children 1ea6445491d4
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
4
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
1 The DeFuse galaxy tool is based on DeFuse_Version_0.5.0
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
2 http://sourceforge.net/apps/mediawiki/defuse/index.php?title=Main_Page
1
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
3
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
4 DeFuse is a software package for gene fusion discovery using RNA-Seq data. The software uses clusters of discordant paired end alignments to inform a split read alignment analysis for finding fusion boundaries. The software also employs a number of heuristic filters in an attempt to reduce the number of false positives and produces a fully annotated output for each predicted fusion.
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
5
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
6
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
7 Manual:
4
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
8 http://sourceforge.net/apps/mediawiki/defuse/index.php?title=DeFuse_Version_0.4.2
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
9
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
10 The included tool_dependencies.xml will download and install the defuse code.
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
11 It will set the environment variable: "DEFUSE_PATH" to the location of the defuse install.
5
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
12 The tool_dependencies.xml also has the download for bowtie.
1
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
13
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
14
4
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
15 The defuse.pl command relies on a configuration file to specifiy options, the location of reference data, and other applications that it depends upon: bowtie, bowtie-build, samtools, blat, fatotwobit, R, and Rscript.
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
16
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
17 The DeFuse galaxy tool can either construct the config.txt file that is mentioned in the defuse manual, or select an existing config.txt file in the users history.
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
18 When constructing the config.txt file, the DeFuse tool uses the values selected in: tool-data/defuse.loc
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
19 The dictionary field in the tool-data/defuse.loc can be used to set fields in the config.txt file, including the site specific location of reference data and the paths to the other application binaries.
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
20 The "Defuse parameter settings" are used to alter options in the config.txt file.
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
21
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
22 The DeFuse galaxy tool also generates a bash script to run defuse.
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
23 That script will attempt to edit the config.txt file to specifiy any unset paths to applications that defuse relies upon:
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
24 bowtie, bowtie-build, samtools, blat, fatotwobit, R, and Rscript
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
25 The script uses the using the shell "which" command to discover the application path, so the required applications should in PATH environment variable.
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
26
1
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
27
2
4245c2b047de Changes for defuse-0.4.3, modifications for non-human genomes no longer required, defuse.xml searches for location of scripts/defuse.pl
Jim Johnson <jj@umn.edu>
parents: 1
diff changeset
28 Generate Reference Datasets as described in the Manual:
4245c2b047de Changes for defuse-0.4.3, modifications for non-human genomes no longer required, defuse.xml searches for location of scripts/defuse.pl
Jim Johnson <jj@umn.edu>
parents: 1
diff changeset
29
4
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
30 The manual has detailed instructions on how to set up reference datasets for Human hg19 and hg18.
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
31 We were able to follow the same basic procedures to set up a reference for Mouse mm9.
2
4245c2b047de Changes for defuse-0.4.3, modifications for non-human genomes no longer required, defuse.xml searches for location of scripts/defuse.pl
Jim Johnson <jj@umn.edu>
parents: 1
diff changeset
32
4
679a5c7b1294 deFuse version 0.5.0 - Use tool_dependencies.xml
Jim Johnson <jj@umn.edu>
parents: 2
diff changeset
33 These datasets should be referenced in the tool-data/defuse.loc file.
1
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
34
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
35
5
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
36 External Tools
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
37 deFuse relies on other publically available tools as part of its pipeline. Some of these tools are not included with the deFuse download. Obtain these tools as detailed below.
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
38 Download bowtie:
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
39 The latest version of bowtie can be downloaded from sourceforge: http://sourceforge.net/projects/bowtie-bio/files/bowtie/. deFuse has been tested on version 0.12.5.
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
40 Set the bowtie_bin and bowtie_build_bin entries in config.txt to the fully qualified paths of the bowtie and bowtie-build binaries.
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
41 Download blat and faToTwoBit
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
42 The latest blat tool suite can be downloaded from the ucsc website: http://hgdownload.cse.ucsc.edu/admin/exe/. Download blat and faToTwoBit and set the blat_bin and fatotwobit_bin entries in config.txt to the fully qualified paths of the blat and faToTwoBit binaries.
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
43 Download R
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
44 The latest version of R can be downloaded from the R project website: http://www.r-project.org/. Install R and then locate the R and Rscript executables, and set the r_bin and rscript_bin entries in config.txt to the path of those executables.
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
45 Install the kernlab package. Run R, then at the prompt type install.packages("kernlab")
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
46 Creating required derivative files
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
47 Once the required files and tools have been downloaded, the create_reference_dataset.pl script will build any derivative files including bowtie indices and 2bit files. Run the following command. Expect this step to take at least 12 hours.
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
48 create_reference_dataset.pl -c config.txt
1
36306d8086fa Remove defuse dir
Jim Johnson <jj@umn.edu>
parents:
diff changeset
49
5
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
50
3bd1087db05e Add dependecies for bowtie, blat, and faToTwoBit
Jim Johnson <jj@umn.edu>
parents: 4
diff changeset
51