comparison ETn_example/DESCRIPTION.txt @ 18:565118df598a draft

Uploaded 20170531
author fabio
date Wed, 31 May 2017 11:07:22 -0400
parents 4c8a31fb202a
children
comparison
equal deleted inserted replaced
17:81faa1567178 18:565118df598a
1 This example contains two region datasets "ETn fixed", "Control" and one feature "Recombination hotspots content".
2 In particular, the region dataset "ETn fixed" contains 1296 genomic regions of 64 kb surrounding
3 fixed ETns elements (32-kb flanking sequences upstream and 32-kb flanking sequences downstream
4 of each element). The region dataset "Control" contains 1142 regions of 64 kb without elements,
5 used as control in the test. The regions are aligned around their center (i.e. around the ETn integration
6 sites).
7 Recombination hotspots measurements are associated to each "ETn fixed" and "Control" region. In
8 particular, this feature is measured in 1-kb windows, so that each region is associated to a recombination
9 hotspots curve made of 64 values. The measurement used is the feature content, i.e. the
10 fraction of the 1-kb window that is covered by recombination hotspots
11
12 Data have been collected and pre-processed by: R Campos-Sanchez, MA Cremona, A Pini, F
13 Chiaromonte and KD Makova (2016). Integration and fixation preferences of human and mouse
14 endogenous retroviruses uncovered with Functional Data Analysis. PLoS Computational Biology.
15 12(6): 1-41.
16 Fixed ETn positions come from: Y Zhang, IA Maksakova, L Gagnier, LN van de Lagemaat, DL
17 Mager (2008). Genome-wide assessments reveal extremely high levels of polymorphism of two
18 active families of mouse endogenous retroviral elements. PLoS Genetics. 4: e1000007.
19 Recombination hotspots data come from: H Brunschwig, L Levi, E Ben-David, RW Williams,
20 B Yakir, S Shifman (2012). Fine-scale maps of recombination rates and hotspots in the mouse
21 genome. Genetics. 191: 757-764.