Mercurial > repos > padge > gtf_to_bed_script
annotate README.rst @ 0:ed0d0eda36a9 draft default tip
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
author | padge |
---|---|
date | Wed, 29 Sep 2021 13:50:53 +0000 |
parents | |
children |
rev | line source |
---|---|
0
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
1 bwakit |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
2 ======= |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
3 |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
4 Written by Guy Bottu for the GenePattern server of VIB BioinforlmaticsCore, |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
5 takes as input a GTF file and writes a BED file in 12 column format |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
6 with information about transcripts, for use with RSeqC. |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
7 |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
8 The "thick" information is about the coding region, ideally it goes from |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
9 start codon to stop codon, but is information is lacking (e.g. because |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
10 of missing sequence or missing annotation), we use the CDS information. |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
11 For some transcripts there are multiple start or stop codons. We amways |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
12 choose the "thick" so that is has maximum length. |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
13 |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
14 If there is no CDS information (as for ncRNA) the "thick" will have just a |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
15 repeat of the transcript start position, as per BED convention. |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
16 |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
17 modified for integration under GenePattern |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
18 |
ed0d0eda36a9
"planemo upload for repository https://github.com/usegalaxy-be/galaxytools/tree/main/gtf_to_bed commit 66fba7c9dccfddadce13aad591f441c66c3c309b-dirty"
padge
parents:
diff
changeset
|
19 usage : perl gtf_to_bed.pl <GTF file> <output file> |