Mercurial > repos > iuc > meme_meme
comparison test-data/meme_output_test1.txt @ 13:57e5d9382f36 draft
planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/meme commit e2cf796f991cbe8c96e0cc5a0056b7255ac3ad6b
author | iuc |
---|---|
date | Thu, 17 May 2018 14:10:48 -0400 |
parents | |
children | 3f0dd362b755 |
comparison
equal
deleted
inserted
replaced
12:5585f04eb317 | 13:57e5d9382f36 |
---|---|
1 ******************************************************************************** | |
2 MEME - Motif discovery tool | |
3 ******************************************************************************** | |
4 MEME version 4.12.0 (Release date: Tue Jun 27 16:22:50 2017 -0700) | |
5 | |
6 For further information on how to interpret these results or to get | |
7 a copy of the MEME software please access http://meme-suite.org . | |
8 | |
9 This file may be used as input to the MAST algorithm for searching | |
10 sequence databases for matches to groups of motifs. MAST is available | |
11 for interactive use and downloading at http://meme-suite.org . | |
12 ******************************************************************************** | |
13 | |
14 | |
15 ******************************************************************************** | |
16 REFERENCE | |
17 ******************************************************************************** | |
18 If you use this program in your research, please cite: | |
19 | |
20 Timothy L. Bailey and Charles Elkan, | |
21 "Fitting a mixture model by expectation maximization to discover | |
22 motifs in biopolymers", Proceedings of the Second International | |
23 Conference on Intelligent Systems for Molecular Biology, pp. 28-36, | |
24 AAAI Press, Menlo Park, California, 1994. | |
25 ******************************************************************************** | |
26 | |
27 | |
28 ******************************************************************************** | |
29 TRAINING SET | |
30 ******************************************************************************** | |
31 DATAFILE= meme_input_1.fasta | |
32 ALPHABET= ACDEFGHIKLMNPQRSTVWY | |
33 Sequence name Weight Length Sequence name Weight Length | |
34 ------------- ------ ------ ------------- ------ ------ | |
35 chr21_19617074_19617124_ 1.0000 50 chr21_26934381_26934431_ 1.0000 50 | |
36 chr21_28217753_28217803_ 1.0000 50 chr21_31710037_31710087_ 1.0000 50 | |
37 chr21_31744582_31744632_ 1.0000 50 chr21_31768316_31768366_ 1.0000 50 | |
38 chr21_31914206_31914256_ 1.0000 50 chr21_31933633_31933683_ 1.0000 50 | |
39 chr21_31962741_31962791_ 1.0000 50 chr21_31964683_31964733_ 1.0000 50 | |
40 chr21_31973364_31973414_ 1.0000 50 chr21_31992870_31992920_ 1.0000 50 | |
41 chr21_32185595_32185645_ 1.0000 50 chr21_32202076_32202126_ 1.0000 50 | |
42 chr21_32253899_32253949_ 1.0000 50 chr21_32410820_32410870_ 1.0000 50 | |
43 chr21_36411748_36411798_ 1.0000 50 chr21_37838750_37838800_ 1.0000 50 | |
44 chr21_45705687_45705737_ 1.0000 50 chr21_45971413_45971463_ 1.0000 50 | |
45 chr21_45978668_45978718_ 1.0000 50 chr21_45993530_45993580_ 1.0000 50 | |
46 chr21_46020421_46020471_ 1.0000 50 chr21_46031920_46031970_ 1.0000 50 | |
47 chr21_46046964_46047014_ 1.0000 50 chr21_46057197_46057247_ 1.0000 50 | |
48 chr21_46086869_46086919_ 1.0000 50 chr21_46102103_46102153_ 1.0000 50 | |
49 chr21_47517957_47518007_ 1.0000 50 chr21_47575506_47575556_ 1.0000 50 | |
50 ******************************************************************************** | |
51 | |
52 ******************************************************************************** | |
53 COMMAND LINE SUMMARY | |
54 ******************************************************************************** | |
55 This information can also be useful in the event you wish to report a | |
56 problem with the MEME software. | |
57 | |
58 command: meme meme_input_1.fasta -o meme_test1_out -nostatus -maxsize 1000000 | |
59 | |
60 model: mod= zoops nmotifs= 1 evt= inf | |
61 object function= E-value of product of p-values | |
62 width: minw= 8 maxw= 50 | |
63 width: wg= 11 ws= 1 endgaps= yes | |
64 nsites: minsites= 2 maxsites= 30 wnsites= 0.8 | |
65 theta: spmap= pam spfuzz= 120 | |
66 global: substring= yes branching= no wbranch= no | |
67 em: prior= megap b= 7500 maxiter= 50 | |
68 distance= 1e-05 | |
69 data: n= 1500 N= 30 shuffle= -1 | |
70 | |
71 sample: seed= 0 ctfrac= -1 maxwords= -1 | |
72 Dirichlet mixture priors file: prior30.plib | |
73 Letter frequencies in dataset: | |
74 A 0.294 C 0.231 D 0.000 E 0.000 F 0.000 G 0.257 H 0.000 I 0.000 K 0.000 | |
75 L 0.000 M 0.000 N 0.000 P 0.000 Q 0.000 R 0.000 S 0.000 T 0.217 V 0.000 | |
76 W 0.000 Y 0.000 | |
77 Background letter frequencies (from dataset with add-one prior applied): | |
78 A 0.291 C 0.229 D 0.001 E 0.001 F 0.001 G 0.255 H 0.001 I 0.001 K 0.001 | |
79 L 0.001 M 0.001 N 0.001 P 0.001 Q 0.001 R 0.001 S 0.001 T 0.215 V 0.001 | |
80 W 0.001 Y 0.001 | |
81 ******************************************************************************** | |
82 | |
83 | |
84 ******************************************************************************** | |
85 MOTIF GGGGTATAAAA MEME-1 width = 11 sites = 25 llr = 239 E-value = 2.4e-011 | |
86 ******************************************************************************** | |
87 -------------------------------------------------------------------------------- | |
88 Motif GGGGTATAAAA MEME-1 Description | |
89 -------------------------------------------------------------------------------- | |
90 Simplified A 2323:a:a8a8 | |
91 pos.-specific C ::3:::::::: | |
92 probability D ::::::::::: | |
93 matrix E ::::::::::: | |
94 F ::::::::::: | |
95 G 7746::::::1 | |
96 H ::::::::::: | |
97 I ::::::::::: | |
98 K ::::::::::: | |
99 L ::::::::::: | |
100 M ::::::::::: | |
101 N ::::::::::: | |
102 P ::::::::::: | |
103 Q ::::::::::: | |
104 R ::::::::::: | |
105 S ::::::::::: | |
106 T 1:2:a:a:2:: | |
107 V ::::::::::: | |
108 W ::::::::::: | |
109 Y ::::::::::: | |
110 | |
111 bits 10.6 | |
112 9.5 | |
113 8.5 | |
114 7.4 | |
115 Relative 6.3 | |
116 Entropy 5.3 | |
117 (13.8 bits) 4.2 | |
118 3.2 | |
119 2.1 * ** | |
120 1.1 ** ******** | |
121 0.0 ----------- | |
122 | |
123 Multilevel GGGGTATAAAA | |
124 consensus AACA T | |
125 sequence | |
126 | |
127 | |
128 -------------------------------------------------------------------------------- | |
129 | |
130 -------------------------------------------------------------------------------- | |
131 Motif GGGGTATAAAA MEME-1 sites sorted by position p-value | |
132 -------------------------------------------------------------------------------- | |
133 Sequence name Start P-value Site | |
134 ------------- ----- --------- ----------- | |
135 chr21_46046964_46047014_ 13 1.06e-06 AAGGCCAGGA GGGGTATAAAA GCCTGAGAGC | |
136 chr21_46057197_46057247_ 37 3.41e-06 ACAGGCCCTG GGCATATAAAA GCC | |
137 chr21_45971413_45971463_ 10 3.41e-06 CAGGCCCTG GGCATATAAAA GCCCCAGCAG | |
138 chr21_31964683_31964733_ 14 3.41e-06 GATTCACTGA GGCATATAAAA GGCCCTCTGC | |
139 chr21_45993530_45993580_ 8 4.00e-06 CCAAGGA GGAGTATAAAA GCCCCACAAA | |
140 chr21_32202076_32202126_ 14 5.01e-06 CCACCAGCTT GAGGTATAAAA AGCCCTGTAC | |
141 chr21_46031920_46031970_ 16 6.06e-06 ATACCCAGGG AGGGTATAAAA CCTCAGCAGC | |
142 chr21_32410820_32410870_ 22 8.67e-06 AATCACTGAG GATGTATAAAA GTCCCAGGGA | |
143 chr21_32185595_32185645_ 19 8.67e-06 CACCAGAGCT GGGATATATAA AGAAGGTTCT | |
144 chr21_31992870_31992920_ 17 8.67e-06 CACTATTGAA GATGTATAAAA TTTCATTTGC | |
145 chr21_46020421_46020471_ 3 1.21e-05 GA GACATATAAAA GCCAACATCC | |
146 chr21_47517957_47518007_ 33 1.59e-05 CCGGCGGGGC GGGGTATAAAG GGGGCGG | |
147 chr21_45978668_45978718_ 5 1.59e-05 CAGA GGGGTATAAAG GTTCCGACCA | |
148 chr21_31914206_31914256_ 16 1.68e-05 CCCACTACTT AGAGTATAAAA TCATTCTGAG | |
149 chr21_32253899_32253949_ 20 2.03e-05 CACCAGCAAG GATATATAAAA GCTCAGGAGT | |
150 chr21_31744582_31744632_ 13 3.06e-05 CAGGTCTAAG AGCATATATAA CTTGGAGTCC | |
151 chr21_19617074_19617124_ 40 3.06e-05 CCTCGGGACG TGGGTATATAA | |
152 chr21_45705687_45705737_ 38 3.82e-05 CGTGGTCGCG GGGGTATAACA GC | |
153 chr21_31768316_31768366_ 1 3.82e-05 . AACGTATATAA ATGGTCCTGT | |
154 chr21_47575506_47575556_ 31 4.02e-05 GCTGCCGGTG AGCGTATAAAG GCCCTGGCG | |
155 chr21_26934381_26934431_ 28 5.52e-05 AGTCACAAGT GAGTTATAAAA GGGTCGCACG | |
156 chr21_31710037_31710087_ 15 5.94e-05 CCCAGGTTTC TGAGTATATAA TCGCCGCACC | |
157 chr21_36411748_36411798_ 23 6.78e-05 AGTTTCAGTT GGCATCtaaaa attatataac | |
158 chr21_31933633_31933683_ 3 2.08e-04 TC AGAGTATATAT AAATGTTCCT | |
159 chr21_31962741_31962791_ 14 4.05e-04 TATAACTCAG GTTGGATAAAA TAATTTGTAC | |
160 -------------------------------------------------------------------------------- | |
161 | |
162 -------------------------------------------------------------------------------- | |
163 Motif GGGGTATAAAA MEME-1 block diagrams | |
164 -------------------------------------------------------------------------------- | |
165 SEQUENCE NAME POSITION P-VALUE MOTIF DIAGRAM | |
166 ------------- ---------------- ------------- | |
167 chr21_46046964_46047014_ 1.1e-06 12_[1]_27 | |
168 chr21_46057197_46057247_ 3.4e-06 36_[1]_3 | |
169 chr21_45971413_45971463_ 3.4e-06 9_[1]_30 | |
170 chr21_31964683_31964733_ 3.4e-06 13_[1]_26 | |
171 chr21_45993530_45993580_ 4e-06 7_[1]_32 | |
172 chr21_32202076_32202126_ 5e-06 13_[1]_26 | |
173 chr21_46031920_46031970_ 6.1e-06 15_[1]_24 | |
174 chr21_32410820_32410870_ 8.7e-06 21_[1]_18 | |
175 chr21_32185595_32185645_ 8.7e-06 18_[1]_21 | |
176 chr21_31992870_31992920_ 8.7e-06 16_[1]_23 | |
177 chr21_46020421_46020471_ 1.2e-05 2_[1]_37 | |
178 chr21_47517957_47518007_ 1.6e-05 32_[1]_7 | |
179 chr21_45978668_45978718_ 1.6e-05 4_[1]_35 | |
180 chr21_31914206_31914256_ 1.7e-05 15_[1]_24 | |
181 chr21_32253899_32253949_ 2e-05 19_[1]_20 | |
182 chr21_31744582_31744632_ 3.1e-05 12_[1]_27 | |
183 chr21_19617074_19617124_ 3.1e-05 39_[1] | |
184 chr21_45705687_45705737_ 3.8e-05 37_[1]_2 | |
185 chr21_31768316_31768366_ 3.8e-05 [1]_39 | |
186 chr21_47575506_47575556_ 4e-05 30_[1]_9 | |
187 chr21_26934381_26934431_ 5.5e-05 27_[1]_12 | |
188 chr21_31710037_31710087_ 5.9e-05 14_[1]_25 | |
189 chr21_36411748_36411798_ 6.8e-05 22_[1]_17 | |
190 chr21_31933633_31933683_ 0.00021 2_[1]_37 | |
191 chr21_31962741_31962791_ 0.0004 13_[1]_26 | |
192 -------------------------------------------------------------------------------- | |
193 | |
194 -------------------------------------------------------------------------------- | |
195 Motif GGGGTATAAAA MEME-1 in BLOCKS format | |
196 -------------------------------------------------------------------------------- | |
197 BL MOTIF GGGGTATAAAA width=11 seqs=25 | |
198 chr21_46046964_46047014_ ( 13) GGGGTATAAAA 1 | |
199 chr21_46057197_46057247_ ( 37) GGCATATAAAA 1 | |
200 chr21_45971413_45971463_ ( 10) GGCATATAAAA 1 | |
201 chr21_31964683_31964733_ ( 14) GGCATATAAAA 1 | |
202 chr21_45993530_45993580_ ( 8) GGAGTATAAAA 1 | |
203 chr21_32202076_32202126_ ( 14) GAGGTATAAAA 1 | |
204 chr21_46031920_46031970_ ( 16) AGGGTATAAAA 1 | |
205 chr21_32410820_32410870_ ( 22) GATGTATAAAA 1 | |
206 chr21_32185595_32185645_ ( 19) GGGATATATAA 1 | |
207 chr21_31992870_31992920_ ( 17) GATGTATAAAA 1 | |
208 chr21_46020421_46020471_ ( 3) GACATATAAAA 1 | |
209 chr21_47517957_47518007_ ( 33) GGGGTATAAAG 1 | |
210 chr21_45978668_45978718_ ( 5) GGGGTATAAAG 1 | |
211 chr21_31914206_31914256_ ( 16) AGAGTATAAAA 1 | |
212 chr21_32253899_32253949_ ( 20) GATATATAAAA 1 | |
213 chr21_31744582_31744632_ ( 13) AGCATATATAA 1 | |
214 chr21_19617074_19617124_ ( 40) TGGGTATATAA 1 | |
215 chr21_45705687_45705737_ ( 38) GGGGTATAACA 1 | |
216 chr21_31768316_31768366_ ( 1) AACGTATATAA 1 | |
217 chr21_47575506_47575556_ ( 31) AGCGTATAAAG 1 | |
218 chr21_26934381_26934431_ ( 28) GAGTTATAAAA 1 | |
219 chr21_31710037_31710087_ ( 15) TGAGTATATAA 1 | |
220 chr21_36411748_36411798_ ( 23) GGCATCTAAAA 1 | |
221 chr21_31933633_31933683_ ( 3) AGAGTATATAT 1 | |
222 chr21_31962741_31962791_ ( 14) GTTGGATAAAA 1 | |
223 // | |
224 | |
225 -------------------------------------------------------------------------------- | |
226 | |
227 -------------------------------------------------------------------------------- | |
228 Motif GGGGTATAAAA MEME-1 position-specific scoring matrix | |
229 -------------------------------------------------------------------------------- | |
230 log-odds matrix: alength= 20 w= 11 n= 1200 bayes= 5.33554 E= 2.4e-011 | |
231 -32 -680 91 77 7 138 -20 55 64 107 11 150 142 72 87 396 -148 221 -140 -36 | |
232 -11 -680 89 76 7 137 -21 55 63 107 10 149 141 71 87 396 -239 220 -140 -36 | |
233 -79 41 4 21 -7 44 -62 42 -5 99 0 99 138 52 42 399 -46 223 -173 -68 | |
234 11 -677 48 47 -2 127 -43 46 27 101 3 124 138 60 62 397 -235 220 -160 -55 | |
235 -596 -820 12 -21 -53 -267 -74 37 16 44 -37 98 31 9 19 319 212 127 -193 -95 | |
236 165 -261 70 110 77 -521 -4 147 95 201 90 121 124 91 107 425 -527 314 -95 8 | |
237 -838 -990 -89 -149 -151 -841 -161 -117 -113 -66 -209 -68 -69 -129 -91 111 221 -55 -255 -173 | |
238 176 -858 -79 -103 -115 -717 -148 -95 -108 -17 -162 -61 -12 -95 -69 193 -737 52 -240 -153 | |
239 134 -686 0 16 -12 -553 -68 44 -8 96 -9 88 124 41 36 384 11 216 -177 -71 | |
240 165 -261 70 110 77 -521 -4 147 95 201 90 121 124 91 107 425 -527 314 -95 8 | |
241 147 -614 89 129 93 -121 12 160 113 217 108 144 144 111 125 447 -241 332 -81 22 | |
242 -------------------------------------------------------------------------------- | |
243 | |
244 -------------------------------------------------------------------------------- | |
245 Motif GGGGTATAAAA MEME-1 position-specific probability matrix | |
246 -------------------------------------------------------------------------------- | |
247 letter-probability matrix: alength= 20 w= 11 nsites= 25 E= 2.4e-011 | |
248 0.240000 0.000000 0.000000 0.000000 0.000000 0.680000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.080000 0.000000 0.000000 0.000000 | |
249 0.280000 0.000000 0.000000 0.000000 0.000000 0.680000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.040000 0.000000 0.000000 0.000000 | |
250 0.160000 0.320000 0.000000 0.000000 0.000000 0.360000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.160000 0.000000 0.000000 0.000000 | |
251 0.320000 0.000000 0.000000 0.000000 0.000000 0.640000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.040000 0.000000 0.000000 0.000000 | |
252 0.000000 0.000000 0.000000 0.000000 0.000000 0.040000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.960000 0.000000 0.000000 0.000000 | |
253 0.960000 0.040000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 | |
254 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 | |
255 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 | |
256 0.760000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.240000 0.000000 0.000000 0.000000 | |
257 0.960000 0.040000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 | |
258 0.840000 0.000000 0.000000 0.000000 0.000000 0.120000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.040000 0.000000 0.000000 0.000000 | |
259 -------------------------------------------------------------------------------- | |
260 | |
261 -------------------------------------------------------------------------------- | |
262 Motif GGGGTATAAAA MEME-1 regular expression | |
263 -------------------------------------------------------------------------------- | |
264 [GA][GA][GC][GA]TATA[AT]AA | |
265 -------------------------------------------------------------------------------- | |
266 | |
267 | |
268 | |
269 | |
270 Time 0.77 secs. | |
271 | |
272 ******************************************************************************** | |
273 | |
274 | |
275 ******************************************************************************** | |
276 SUMMARY OF MOTIFS | |
277 ******************************************************************************** | |
278 | |
279 -------------------------------------------------------------------------------- | |
280 Combined block diagrams: non-overlapping sites with p-value < 0.0001 | |
281 -------------------------------------------------------------------------------- | |
282 SEQUENCE NAME COMBINED P-VALUE MOTIF DIAGRAM | |
283 ------------- ---------------- ------------- | |
284 chr21_19617074_19617124_ 1.22e-03 39_[1(3.06e-05)] | |
285 chr21_26934381_26934431_ 2.21e-03 27_[1(5.52e-05)]_12 | |
286 chr21_28217753_28217803_ 7.29e-01 50 | |
287 chr21_31710037_31710087_ 2.37e-03 14_[1(5.94e-05)]_25 | |
288 chr21_31744582_31744632_ 1.22e-03 12_[1(3.06e-05)]_27 | |
289 chr21_31768316_31768366_ 1.53e-03 [1(3.82e-05)]_39 | |
290 chr21_31914206_31914256_ 6.70e-04 15_[1(1.68e-05)]_24 | |
291 chr21_31933633_31933683_ 1.81e-03 4_[1(4.54e-05)]_35 | |
292 chr21_31962741_31962791_ 1.61e-02 50 | |
293 chr21_31964683_31964733_ 1.36e-04 13_[1(3.41e-06)]_26 | |
294 chr21_31973364_31973414_ 1.99e-01 50 | |
295 chr21_31992870_31992920_ 3.47e-04 16_[1(8.67e-06)]_23 | |
296 chr21_32185595_32185645_ 3.47e-04 18_[1(8.67e-06)]_21 | |
297 chr21_32202076_32202126_ 2.01e-04 13_[1(5.01e-06)]_26 | |
298 chr21_32253899_32253949_ 8.11e-04 19_[1(2.03e-05)]_20 | |
299 chr21_32410820_32410870_ 3.47e-04 21_[1(8.67e-06)]_18 | |
300 chr21_36411748_36411798_ 2.71e-03 22_[1(6.78e-05)]_17 | |
301 chr21_37838750_37838800_ 8.23e-02 50 | |
302 chr21_45705687_45705737_ 1.53e-03 37_[1(3.82e-05)]_2 | |
303 chr21_45971413_45971463_ 1.36e-04 9_[1(3.41e-06)]_30 | |
304 chr21_45978668_45978718_ 6.37e-04 4_[1(1.59e-05)]_35 | |
305 chr21_45993530_45993580_ 1.60e-04 7_[1(4.00e-06)]_32 | |
306 chr21_46020421_46020471_ 4.83e-04 2_[1(1.21e-05)]_37 | |
307 chr21_46031920_46031970_ 2.43e-04 15_[1(6.06e-06)]_24 | |
308 chr21_46046964_46047014_ 4.26e-05 12_[1(1.06e-06)]_27 | |
309 chr21_46057197_46057247_ 1.36e-04 36_[1(3.41e-06)]_3 | |
310 chr21_46086869_46086919_ 4.30e-02 50 | |
311 chr21_46102103_46102153_ 4.30e-02 50 | |
312 chr21_47517957_47518007_ 6.37e-04 32_[1(1.59e-05)]_7 | |
313 chr21_47575506_47575556_ 1.61e-03 30_[1(4.02e-05)]_9 | |
314 -------------------------------------------------------------------------------- | |
315 | |
316 ******************************************************************************** | |
317 | |
318 | |
319 ******************************************************************************** | |
320 Stopped because requested number of motifs (1) found. | |
321 ******************************************************************************** | |
322 | |
323 CPU: ThinkPad-T450s | |
324 | |
325 ******************************************************************************** |