Mercurial > repos > padge > gtf_to_bed_script
view README.rst @ 0:ed0d0eda36a9 draft default tip
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
author | padge |
---|---|
date | Wed, 29 Sep 2021 13:50:53 +0000 |
parents | |
children |
line wrap: on
line source
bwakit ======= Written by Guy Bottu for the GenePattern server of VIB BioinforlmaticsCore, takes as input a GTF file and writes a BED file in 12 column format with information about transcripts, for use with RSeqC. The "thick" information is about the coding region, ideally it goes from start codon to stop codon, but is information is lacking (e.g. because of missing sequence or missing annotation), we use the CDS information. For some transcripts there are multiple start or stop codons. We amways choose the "thick" so that is has maximum length. If there is no CDS information (as for ncRNA) the "thick" will have just a repeat of the transcript start position, as per BED convention. modified for integration under GenePattern usage : perl gtf_to_bed.pl <GTF file> <output file>