annotate microsatbed/.shed.yml @ 6:c5324bf8a52c draft

Uploaded
author fubar
date Sun, 14 Jul 2024 04:49:00 +0000
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
6
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
1 name: microsatbedfubar
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
2 owner: fubar
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
3 categories:
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
4 - Sequence Analysis
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
5 description: Select microsatellites for a bed file
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
6 homepage_url: https://github.com/lmdu/pytrf
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
7 long_description: |
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
8 See https://pytrf.readthedocs.io/en/latest/ for the pytrf documentation.
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
9 A Tandem repeat (TR) in genomic sequence is a set of adjacent short DNA sequence repeated consecutively. The core sequence or repeat unit is generally called motif.
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
10 According to the motif length, tandem repeats can be classified as microsatellites and minisatellites. Microsatellites are also known as simple sequence repeats (SSRs)
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
11 or short tandem repeats (STRs) with motif length of 1-6 bp. Minisatellites are also sometimes referred to as variable number of tandem repeats (VNTRs) has longer
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
12 motif length than microsatellites. Pytrf is a lightweight Python C extension for identification of tandem repeats, for both exact or perfect SSRs.
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
13 It also can find generic tandem repeats with any size of motif, such as with maximum motif length of 100 bp. Additionally, it has capability of finding approximate or imperfect tandem repeats.
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
14 A fasta file must be supplied for processing. Different subsets of STR may be selected for output. Perfect STRs are the default, but any combination
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
15 with one or more of pefect, approxinate and generic. Designed to build some of the microsatellite tracks from https://github.com/arangrhie/T2T-Polish/tree/master/pattern for the VGP.
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
16 remote_repository_url: https://github.com/fubar2/microsatbed
c5324bf8a52c Uploaded
fubar
parents:
diff changeset
17 type: unrestricted