# HG changeset patch # User nml # Date 1570564125 14400 # Node ID e79a8dad83b4053989b1b8d3105f7eafcc93cc95 # Parent b02c775b27c84e8ef8695bebff2e9d21ebac83a3 "planemo upload for repository https://github.com/phac-nml/ecoli_serotyping commit 42aaf40ea7e6658cf115edd8f2014ed0db9dbe38" diff -r b02c775b27c8 -r e79a8dad83b4 README.md --- a/README.md Thu Jan 31 09:39:04 2019 -0500 +++ b/README.md Tue Oct 08 15:48:45 2019 -0400 @@ -1,4 +1,4 @@ -# EC-Typer - in silico serotyping of *Escherichia coli* species -EC-Typer is a standalone serotyping module for *Escherichia coli* typing. It supports fasta and fastq file formats. Designed by Dr. Chad Liang et al. Wrapped into Galaxy wrapper by Dr. Kyrylo Bessonov +# EC-Typer - in silico serotyping of *Escherichia coli* and other cryptic *Escherichia* species +EC-Typer is a standalone serotyping module for *Escherichia coli* typing with automatic non-E.coli species identification. It supports fasta and fastq file formats (i.e. assembled and raw reads). Designed by Dr. Chad Liang et al. Maintained by Dr. Kyrylo Bessonov. This wrapper is designed and produced by Dr. Kyrylo Bessonov For more information please refer to https://github.com/phac-nml/ecoli_serotyping diff -r b02c775b27c8 -r e79a8dad83b4 ectyper.xml --- a/ectyper.xml Thu Jan 31 09:39:04 2019 -0500 +++ b/ectyper.xml Tue Oct 08 15:48:45 2019 -0400 @@ -1,29 +1,29 @@ - + ectyper is a standalone serotyping module for Escherichia coli. It supports fasta and fastq file formats. - ectyper + ectyper 0 - #set $genomes = $genomes + ',' + str($i.name) + #set $genomes = $genomes + ',' + str($i.element_identifier) #else - #set $genomes = str($i.name) + #set $genomes = str($i.element_identifier) #end if #end for #else - ln -s "${input}" "${input.name}" && - #set $genomes = $input.name + ln -s "${input}" "${input.element_identifier}" && + #set $genomes = $input.element_identifier #end if #if $mash_input ln -s "${mash_input}" mash_sketch.msh && #end if - + ectyper --cores \${GALAXY_SLOTS:-4} --input "${genomes}" --percentIdentity '$adv_param.min_percentIdentity' @@ -41,19 +41,19 @@ ]]> - - + +
- +
- - + + adv_param['logging']==True @@ -75,7 +75,11 @@ **Syntax** -This tool identifies the serotype of Escherichia coli genome sequences based on a set of *wzm/wzt*, *wzx/wzy* and *fliC/flkA/flmA* alleles corresponding to O and H antigens, respectively. + +This tool identifies the serotype of assembled or assembly-free Escherichia coli genome sample based on a set of either *wzm/wzt* or *wzx/wzy* and *fliC/flkA/flmA* alleles corresponding to O and H antigens, respectively. +The non-E.coli genomes and other Escherichia genus species are successfully identified and well handled. The 0.9.0 version improves tool sensitivy when target alleles are truncated or +poorly covered by raw reads. + For more information please visit https://github.com/phac-nml/ecoli_serotyping. @@ -83,9 +87,12 @@ **Input:** -Accepts a variety of inputs including single or multiple FASTQ and/or FASTA file(s). Inputs might contain pure raw reads, but for more accurate results draft assemblies are recommended. +Accepts a variety of inputs including both single and/or multiple FASTQ and/or FASTA file(s). Inputs might contain pure raw reads, but for more accurate results, draft assemblies are recommended. -Optionally select a MASH RefSeq genome sketch (version 2.0 and above) for cases when O/H typing would fail. Download RefSeq genome sketch containing 91,283 genomes with 1000 hashes each directly from https://gembox.cbcb.umd.edu/mash/refseq.genomes.k21s1000.msh . +The default MASH RefSeq genome sketch is included and updated every 6 months, but one can supply custom sketch file for species identification. + +One can download RefSeq genome sketch containing approximately 91,283 genomes from https://gembox.cbcb.umd.edu/mash/refseq.genomes.k21s1000.msh. + **Output:** @@ -110,4 +117,4 @@ url = {https://github.com/phac-nml/ecoli_serotyping} } -
\ No newline at end of file +