annotate test-data/testout @ 0:4e8e2f836d0f draft default tip

planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
author itaxotools
date Sun, 29 Jan 2023 16:25:48 +0000
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
1 <h4>########################## PARAMETERS ######################</h4>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
2 <p> input file: test-data/Pontohedyle_COI.fas </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
3 <p> Coding gaps as characters: False </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
4 <p> Maximum undetermined nucleotides allowed: 5 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
5 <p> Length of the alignment: 655 -> 655 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
6 <p> Indexing reference: Not set </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
7 <p> Read in 27 sequences </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
8 <p> query taxa: 2 - brasilensis, joni </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
9 <p> Cutoff set as: 100 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
10 <p> Number iterations of MolD set as: 10000 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
11 <p> Maximum length of raw mDNCs set as: 12 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
12 <p> Maximum length of refined mDNCs set as: 7 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
13 <p> simulated sequences up to 1 percent divergent from original ones </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
14 <p> Maximum number of sequences modified per clade 10 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
15 <p> scoring of the rDNCs; threshold in two consequtive runs: 75 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
16 <h4>########################### RESULTS ##########################</h4>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
17 <h4>************** brasilensis **************</h4>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
18 <p> Sequences analyzed: 4 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
19 <p> single nucleotide mDNCs*: 45 - 4: 'G', 16: 'C', 40: 'C', 44: 'G', 46: 'G', 68: 'G', 97: 'C', 101: 'C', 102: 'C', 167: 'G', 169: 'C', 170: 'T', 197: 'A', 202: 'G', 217: 'A', 227: 'G', 228: 'C', 239: 'T', 272: 'G', 287: 'A', 295: 'G', 310: 'C', 332: 'T', 357: 'A', 358: 'G', 365: 'T', 372: 'T', 387: 'C', 434: 'G', 456: 'G', 457: 'G', 467: 'G', 482: 'T', 483: 'G', 497: 'C', 499: 'T', 512: 'T', 518: 'A', 529: 'A', 535: 'G', 542: 'T', 543: 'C', 566: 'C', 619: 'G', 635: 'G' </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
20 <p> mDNCs* retrieved: 1048; Sites involved: 100; Independent mDNCs**: 71 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
21 <p> Shortest retrieved mDNC*: [4: 'G'] </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
22 <p> 1 rDNC_score (100): [4] - 52 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
23 <p> 2 rDNC_score (100): [4, 16] - 86 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
24 <p> 3 rDNC_score (100): [4, 16, 40] - 93 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
25 <p> Final rDNC***: [4: 'G', 16: 'C', 40: 'C'] </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
26 <p> The DNA diagnosis for the taxon brasilensis is: 'G' in the site 4, 'C' in the site 16, 'C' in the site 40. </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
27 <h4>************** joni **************</h4>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
28 <p> Sequences analyzed: 3 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
29 <p> single nucleotide mDNCs*: 10 - 31: 'A', 85: 'G', 160: 'G', 283: 'G', 298: 'G', 451: 'G', 523: 'C', 526: 'A', 578: 'C', 580: 'T' </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
30 <p> mDNCs* retrieved: 2662; Sites involved: 100; Independent mDNCs**: 50 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
31 <p> Shortest retrieved mDNC*: [31: 'A'] </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
32 <p> 1 rDNC_score (100): [31] - 65 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
33 <p> 2 rDNC_score (100): [31, 85] - 97 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
34 <p> 3 rDNC_score (100): [31, 85, 160] - 99 </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
35 <p> Final rDNC***: [31: 'A', 85: 'G', 160: 'G'] </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
36 <p> The DNA diagnosis for the taxon joni is: 'A' in the site 31, 'G' in the site 85, 'G' in the site 160. </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
37 <h4> ################################# EXPLANATIONS #################################### </h4>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
38 <p> * mDNC -(=minimal Diagnostic nucleotide combination) is a combination of nucleotides at specified sites of the alignment, </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
39 <p> unique for a query taxon. Therefore it is sufficient to differentiate a query taxon from all reference taxa in a dataset. </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
40 <p> Because it comprises minimal necessary number of nucleotide sites to differentiate a query, any mutation in the mDNC in</p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
41 <p> single specimen of a query taxon will automatically disqualify it as a diagnostic combination. </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
42 <p> </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
43 <p> ** two or more mDNCs are INDEPENDENT if they constitute non-overlapping sets of nucleotide sites. </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
44 <p> </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
45 <p> *** rDNC -(=robust/redundant Diagnostic nucleotide combination) is a combination of nucleotides at specified sites of the alignment, </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
46 <p> unique for a query taxon and (likewise mDNC) sufficient to differentiate a query taxon from all reference taxa in a dataset. </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
47 <p> However, rDNC comprises more than a minimal necessary number of diagnostic sites, and therefore is robust to single nucleotide </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
48 <p> replacements. Even if a mutation arises in one of the rDNC sites, the remaining ones will (with high probability) remain sufficient </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
49 <p> to diagnose the query taxon </p>
4e8e2f836d0f planemo upload commit 232ce39054ce38be27c436a4cabec2800e14f988-dirty
itaxotools
parents:
diff changeset
50 <h4> Final diagnosis corresponds to rDNC </h4>