Mercurial > repos > cpt > cpt_export_seq_unique
annotate all_fasta.loc.sample @ 2:ba370cca3857 draft
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
author | cpt |
---|---|
date | Sun, 23 Jul 2023 20:59:40 +0000 |
parents | |
children |
rev | line source |
---|---|
2
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
1 #This file lists the locations and dbkeys of all the fasta files |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
2 #under the "genome" directory (a directory that contains a directory |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
3 #for each build). The script extract_fasta.py will generate the file |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
4 #all_fasta.loc. This file has the format (white space characters are |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
5 #TAB characters): |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
6 # |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
7 #<unique_build_id> <dbkey> <display_name> <file_path> |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
8 # |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
9 #So, all_fasta.loc could look something like this: |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
10 # |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
11 #apiMel3 apiMel3 Honeybee (Apis mellifera): apiMel3 /path/to/genome/apiMel3/apiMel3.fa |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
12 #hg19canon hg19 Human (Homo sapiens): hg19 Canonical /path/to/genome/hg19/hg19canon.fa |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
13 #hg19full hg19 Human (Homo sapiens): hg19 Full /path/to/genome/hg19/hg19full.fa |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
14 # |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
15 #Your all_fasta.loc file should contain an entry for each individual |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
16 #fasta file. So there will be multiple fasta files for each build, |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
17 #such as with hg19 above. |
ba370cca3857
planemo upload commit c2e2760ae56ed7d73f7ada10c105bf0e9bd80480
cpt
parents:
diff
changeset
|
18 # |