0
|
1 NGSrich - Version 0.5.4
|
|
2 Target enrichment performance for next-generation sequencing.
|
|
3
|
|
4
|
|
5 Description:
|
|
6 NGSrich is a tool for researchers concerned with target enrichment issues in next-generation sequencing.
|
|
7 It evaluates the target enrichment performance of target regions given in BED format and outputs
|
|
8 summary statistics, visualizations, and some additional files giving details on the analysis.
|
|
9
|
|
10
|
|
11 ---------------------------------------------------------------------------------------------------------
|
|
12
|
|
13
|
|
14 Usage:
|
|
15 java NGSrich evaluate -r <readsFile> -u <genomeName> -t <target>
|
|
16 [(-a|-g) <annotation>] [ -s <sName> ] [-T <tmpDir>] [ -o <outDir>] [-p <poor> -h <high>][--no-details]
|
|
17
|
|
18
|
|
19 The program should be started from the bin directory of the software.
|
|
20 Required arguments:
|
|
21 <readsFile>: Path to the read alignment file in SAM or BAM alignment format
|
|
22 (http://samtools.sourceforge.net/SAM-1.3.pdf)
|
|
23 <genomeName>: UCSC genome version name (e.g. 'hg19').
|
|
24 <target>: Path to the target file in BED annotation format
|
|
25 (http://genome.ucsc.edu/FAQ/FAQformat.html#format1, 3 columns required)
|
|
26 Optional arguments:
|
|
27 <annotation>: the local path of the annotation of the genome
|
|
28 [default: genome annotation is downloaded from the site of the ucsc genome
|
|
29 browser and placed into the temporary directory of the sample.]
|
|
30 <sName>: Name of the sample being processed - defaults to prefix of <readsFile>.
|
|
31 <tmpDir>: Temporary directory - defaults to '/tmp'.
|
|
32 <outDir>: Output directory - defaults to '<pathToReadsFile>/enrichment'.
|
|
33 <poor>: Coverage cutoff to define poorly covered genes - defaults to 2.
|
|
34 <high>: Coverage cutoff to define highly covered genes - defaults to 200.
|
|
35 --no-details: Represses the computation of the evaluation details.
|
|
36
|
|
37
|
|
38 Output:
|
|
39 1. The HTML file contains a report of the target enrichment results including
|
|
40 tables of poorly and highly covered genes (defined by cutoffs <poor> and <high>.
|
|
41 For vizualisation, plots are embedded in this document. The contents of this report
|
|
42 are actually based on an evaluation of the XML and BED files.
|
|
43 2. The XML file contains the summary statistics for the enrichment performance.
|
|
44 3. The BED file contains detailed coverage information for each single region specified in
|
|
45 the input target file.
|
|
46 4. The WIG files (http://genome.ucsc.edu/goldenPath/help/wiggle.html) contain a per-base
|
|
47 description of the coverage on the target regions, one for the target regions specified in
|
|
48 the input target file and one for the whole genome (gaps are skipped). These files can be
|
|
49 used for visualization in a Genome Browser, e.g. at http://genome.ucsc.edu/cgi-bin/hgGateway.
|
|
50
|
|
51
|
|
52 ---------------------------------------------------------------------------------------------------------
|
|
53
|
|
54
|
|
55 Several enrichment performance reports can be summarized by the command NGSrich-summarize.
|
|
56 This should also be started from the bin directory of the software.
|
|
57
|
|
58 Usage: java NGSrich summarize -r <inputIndexFile> -o <outDir> [ -p <poor> -h <high> ]
|
|
59
|
|
60 Required arguments:
|
|
61 <inputIndexFile>: Text file with each line containing the output directory of
|
|
62 the report for one of the samples to be summarized.
|
|
63 <outDir>: Output directory.
|
|
64
|
|
65 Optional arguments:
|
|
66 <poor>: Coverage cutoff to define poorly covered genes - defaults to 2.
|
|
67 <high>: Coverage cutoff to define highly covered genes - defaults to 200.
|
|
68
|
|
69
|
|
70 Output:
|
|
71 The HTML file contains a summary report and summarizing performance statistics for several
|
|
72 target enrichment experiments. For each sample, the detailed performance report can be
|
|
73 accessed by a hyperlink.
|
|
74
|
|
75
|
|
76 ---------------------------------------------------------------------------------------------------------
|
|
77
|
|
78
|
|
79 Requirements:
|
|
80 The NGSrich software runs only on 64-bit Linux. It requires an installation of R and the Java
|
|
81 Runtime Environment (JRE), which are preinstalled on most Linux distributions. If not,
|
|
82 please visit the official sites for detailed installation instructions.
|
|
83
|
|
84
|
|
85 Contact:
|
|
86 You can send an e-mail to the NGSrich mailing list at <NGSrich-users@lists.sourceforge.net>.
|
|
87 Please tell us your experiences with the software itself and the documentation. We particularly welcome new
|
|
88 bug reports and suggestions for new or enhanced features.
|
|
89
|