Repository ngsrich
Name: ngsrich
Owner: pfrommolt
Synopsis: Evaluate enrichment performance in targeted resequencing
This is a software to create detailed statistics on the target enrichment performance of libraries created with sequence capture methods, e.g. in whole-exome resequencing studies.
Type: unrestricted
Revision: 0:89ad0a9cca52
This revision can be installed: False
Times cloned / installed: 725

Repository README files - may contain important installation or license information

NGSrich - Version 0.5.4 Target enrichment performance for next-generation sequencing. Description: NGSrich is a tool for researchers concerned with target enrichment issues in next-generation sequencing. It evaluates the target enrichment performance of target regions given in BED format and outputs summary statistics, visualizations, and some additional files giving details on the analysis. --------------------------------------------------------------------------------------------------------- Usage: java NGSrich evaluate -r -u -t [(-a|-g) ] [ -s ] [-T ] [ -o ] [-p -h ][--no-details] The program should be started from the bin directory of the software. Required arguments: : Path to the read alignment file in SAM or BAM alignment format (http://samtools.sourceforge.net/SAM-1.3.pdf) : UCSC genome version name (e.g. 'hg19'). : Path to the target file in BED annotation format (http://genome.ucsc.edu/FAQ/FAQformat.html#format1, 3 columns required) Optional arguments: : the local path of the annotation of the genome [default: genome annotation is downloaded from the site of the ucsc genome browser and placed into the temporary directory of the sample.] : Name of the sample being processed - defaults to prefix of . : Temporary directory - defaults to '/tmp'. : Output directory - defaults to '/enrichment'. : Coverage cutoff to define poorly covered genes - defaults to 2. : Coverage cutoff to define highly covered genes - defaults to 200. --no-details: Represses the computation of the evaluation details. Output: 1. The HTML file contains a report of the target enrichment results including tables of poorly and highly covered genes (defined by cutoffs and . For vizualisation, plots are embedded in this document. The contents of this report are actually based on an evaluation of the XML and BED files. 2. The XML file contains the summary statistics for the enrichment performance. 3. The BED file contains detailed coverage information for each single region specified in the input target file. 4. The WIG files (http://genome.ucsc.edu/goldenPath/help/wiggle.html) contain a per-base description of the coverage on the target regions, one for the target regions specified in the input target file and one for the whole genome (gaps are skipped). These files can be used for visualization in a Genome Browser, e.g. at http://genome.ucsc.edu/cgi-bin/hgGateway. --------------------------------------------------------------------------------------------------------- Several enrichment performance reports can be summarized by the command NGSrich-summarize. This should also be started from the bin directory of the software. Usage: java NGSrich summarize -r -o [ -p -h ] Required arguments: : Text file with each line containing the output directory of the report for one of the samples to be summarized. : Output directory. Optional arguments: : Coverage cutoff to define poorly covered genes - defaults to 2. : Coverage cutoff to define highly covered genes - defaults to 200. Output: The HTML file contains a summary report and summarizing performance statistics for several target enrichment experiments. For each sample, the detailed performance report can be accessed by a hyperlink. --------------------------------------------------------------------------------------------------------- Requirements: The NGSrich software runs only on 64-bit Linux. It requires an installation of R and the Java Runtime Environment (JRE), which are preinstalled on most Linux distributions. If not, please visit the official sites for detailed installation instructions. Contact: You can send an e-mail to the NGSrich mailing list at . Please tell us your experiences with the software itself and the documentation. We particularly welcome new bug reports and suggestions for new or enhanced features.

Categories
Statistics - Tools for generating statistics