(A) GC content variance around CO breakpoints (blue dots and line). The window 0 on the x-axis is the GC content of the breakpoints and the negative and positive values represent the distance away from the breakpoints. Each https://datingranking.net/farmersonly-review of these windows is defined as 2 kb sequence and the GC content is calculated for each window. The red dots and line are one of the GC content random samples simulated like the numbers of CO breakpoints (blue dot and line). After 10,000 repeats, not one of random samples is as extreme as the observed (blue line) (P <0.0001). (B) Relationship between recombination and GC content. When the chromosomes are dissected into 10 kb non-overlapping regions, recombination rate (cM/Mb) and GC content can be obtained for each of them. After the bins are sorted by the GC content, the windows are divided into 31 groups based on GC content (approximately 20% to 51%, 1% interval), and the average (and s.e.m.) recombination rates reported for each group.
In both we dissect the genome into 10 kb non-overlapping windows of which there are 19,297. First, we ask about the raw correlation between GC% and cM/Mb for these windows, which as expected is positive and significant (Spearman’s rho = 0.192; P <10 -15 ). Second, we wish to know the average effect of increasing one unit in either parameter on the other. Given the noise in the data (and given that current recombination rate need not imply the ancestral recombination rate) we approach this issue using a smoothing approach. We start by rank ordering all windows by GC content and then dividing them into blocks of 1% GC range, after excluding windows with more than 10% ‘N'. The resulting plot is highly skewed by bins with very high GC (55% to 58%) as these have very few data points (Additional file 1: Figure S10E) (the same outliers likely effect the raw correlation too). Removing these three results in a more consistent trend (Additional file 1: Figure S10F). This also suggests that below circa 20% GC the recombination rate is zero (Additional file 1: Figure S10F). Removing those with GC <20% and, more generally, any bins with fewer than 100 windows (all bins with GC < 20% have fewer than 100 windows) leaves 18,680 (96.8%) of the windows, these having a GC content between approximately 20% and 51%.
Matchmaking anywhere between recombination and you may GC-stuff
From the observation, i estimate one to on average a-1 cm/Mb escalation in recombination price try associated with a boost in GC articles around 0.5%. However a-1% escalation in GC stuff corresponds to an about 2 cM/Mb boost in recombination rates. We stop you to given the visible rareness regarding NCO gene conversion process, about on the bee genome, extrapolation from GC stuff so you’re able to mediocre crossing-over rate thus is apparently justifiable, no less than to possess GC articles more 20%. I notice as well one to at the high GC content the recombination rates can be more otherwise underestimated. This could mirror a discordance ranging from newest and you can earlier in the day recombination pricing.
Talking about regularly construct Profile 4B, and this presents a somewhat looks-100 % free (once smoothing) monotonic dating between them parameters
Crossing-more rates is also associated with nucleotide range, gene thickness, and you may backup count variation nations (Contour S11-S13 from inside the Even more file 1) . Offered our elimination of hetSNPs away from study the second result is perhaps not trivially an excellent CNV related artifact. Our very own good-size analyses tell you a confident relationship ranging from nucleotide assortment and recombination price anyway the fresh scales away from ten, one hundred, 200, or five-hundred kb sequence screen (Profile S11 for the Most file step one). Which bolsters early in the day analyses, certainly which claimed the newest trend but think it is becoming non-high, if you’re various other said a trend anywhere between society hereditary prices from recombination and you can genetic diversity. The brand new development accords towards the notion that recombination causes smaller Mountain-Robertson disturbance ergo providing reduced rates out-of hitchhiking and background possibilities, very enabling higher range. We and see a strong negative correlation between recombination and you will gene occurrence (Contour S12 when you look at the Additional file step one) and you may a strong confident relationship between recombination together with period of multi-backup countries at the some windows brands (Shape S13 for the Additional file step 1). New relationship which have CNVs was in line with a job getting low-allelic recombination creating duplications and deletions thru unequal crossing-over .