A Medley of Potpourri

Thursday, December 10, 2020

Genetic history of Europe

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/Genetic_history_of_Europe

Admixture plots of modern West Eurasian populations based on seven components:
South/West European North/East European Caucasus
Middle Eastern South Asian East Asian North African/Sub-Saharan African

The European genetic structure (based on 273,464 SNPs). Three levels of structure as revealed by PC analysis are shown: A) inter-continental; B) intra-continental; and C) inside a single country (Estonia), where median values of the PC1&2 are shown. D) European map illustrating the origin of sample and population size. CEU – Utah residents with ancestry from Northern and Western Europe, CHB – Han Chinese from Beijing, JPT – Japanese from Tokyo, and YRI – Yoruba from Ibadan, Nigeria.

The genetic history of Europe since the Upper Paleolithic is inseparable from that of wider Western Eurasia. By about 50-40,000 years ago (50-40 ka) a basal West Eurasian lineage had emerged (alongside a separate East Asian lineage) out of the undifferentiated "non-African" lineage of 70-50 ka. Both basal East and West Eurasians acquired Neanderthal admixture in Europe and Asia.

European early modern humans (EEMH) lineages between 40 and 26 ka (Aurignacian) were still part of a large Western Eurasian "meta-population", related to Central and Western Asian populations.

Divergence into genetically distinct sub-populations within Western Eurasia is a result of increased selection pressure and founder effects during the Last Glacial Maximum (LGM, Gravettian). By the end of the LGM, after 20 ka, A Western European lineage, dubbed West European Hunter-Gatherer (WHG) emerges from the Solutrean refugium during the European Mesolithic. These mesolithic hunter-gatherer cultures are substantially replaced in the Neolithic Revolution by the arrival of Early European Farmers (EEF) lineages derived from mesolithic populations of West Asia (Anatolia and the Caucasus). In the European Bronze Age, there were again substantial population replacements in parts of Europe by the intrusion of Ancient North Eurasian (ANE) lineages from the Pontic–Caspian steppes. These Bronze Age population replacements are associated with the Beaker culture archaeologically and with the Indo-European expansion linguistically.

As a result of the population movements during the Mesolithic to Bronze Age, modern European populations are distinguished by differences in WHG, EEF and ANE ancestry. Admixture rates varied geographically; in the late Neolithic, WHG ancestry in farmers in Hungary was at around 10%, in Germany around 25% and in Iberia as high as 50%. The contribution of EEF is more significant in Mediterranean Europe, and declines towards northern and northeastern Europe, where WHG ancestry is stronger; the Sardinians are considered to be the closest European group to the population of the EEF. ANE ancestry is found throughout Europe, with maxima of about 20% found in Baltic people and Finns.

Ethnogenesis of the modern ethnic groups of Europe in the historical period is associated with numerous admixture events, primarily those associated with the Roman, Germanic, Norse, Slavic, Arab and Turkish expansions.

Research into the genetic history of Europe became possible in the second half of the 20th century, but did not yield results with high resolution before the 1990s. In the 1990s, preliminary results became possible, but they remained mostly limited to studies of mitochondrial and Y-chromosomal lineages. Autosomal DNA became more easily accessible in the 2000s, and since the mid-2010s, results of previously unattainable resolution, many of them based on full-genome analysis of ancient DNA, have been published at an accelerated pace.

Prehistory

Distribution of the Neanderthals, and main sites

The prehistory of the European peoples can be traced by the examination of archaeological sites, linguistic studies and by the examination of the DNA of the people who live in Europe or from ancient DNA. The research continues and so theories rise and fall. Although it is possible to track migrations of people across Europe using founder analysis of DNA, most information on these movements comes from archaeology.

It is important to note that the settlement of Europe did not occur in discrete migrations, as might appear to be suggested. Rather, the settlement process was complex and "likely to have occurred in multiple waves from the east and to have been subsequently obscured by millennia of recurrent gene flow".

Due to natural selection, the percentage of Neanderthal DNA in ancient Europeans gradually decreased over time. From 45,000 BP to 7,000 BP, the percentage dropped from around 3–6% to 2%. The removal of Neanderthal-derived alleles occurred more frequently around genes than other parts of the genome.

Palaeolithic

Neanderthals inhabited much of Europe and western Asia from as far back as 130,000 years ago. They existed in Europe as late as 30,000 years ago. They were eventually replaced by anatomically modern humans (AMH; sometimes known as Cro-Magnons), who began to appear in Europe circa 40,000 years ago. Given that the two hominid species likely coexisted in Europe, anthropologists have long wondered whether the two interacted. The question was resolved only in 2010, when it was established that Eurasian populations exhibit Neanderthal admixture, estimated at 1.5–2.1% on average. The question now became whether this admixture had taken place in Europe, or rather in the Levant, prior to AMH migration into Europe.

There has also been speculation about the inheritance of specific genes from Neanderthals. For example, one MAPT locus 17q21.3 which is split into deep genetic lineages H1 and H2. Since the H2 lineage seems restricted to European populations, several authors had argued for inheritance from Neanderthals beginning in 2005. However the preliminary results from the sequencing of the full Neanderthal Genome at that time (2009), failed to uncover evidence of interbreeding between Neanderthals and modern humans. By 2010, findings by Svante Pääbo (Max Planck Institute for Evolutionary Anthropology at Leipzig, Germany), Richard E. Green (University of California, Santa Cruz), and David Reich (Harvard Medical School), comparing the genetic material from the bones of three Neanderthals with that from five modern humans, did show a relationship between Neanderthals and modern people outside Africa.

Upper Paleolithic

Replacement of Neanderthals by early modern humans

It is thought that modern humans began to inhabit Europe during the Upper Paleolithic about 40,000 years ago. Some evidence shows the spread of the Aurignacian culture.

From a purely patrilineal, Y-chromosome perspective, it is possible that the old Haplogroup C1a2, F and/or E may be those with the oldest presence in Europe. They have been found in some very old human remains in Europe. However, other haplogroup are far more common among living European males.

Haplogroup I (M170), which is now relatively common and widespread within Europe, may represent a Palaeolithic marker – its age has been estimated at ~ 22,000 BP. While it is now concentrated in Europe, it probably arose in a male from the Middle East or Caucasus, or their near descendants, c. 20–25,000 years BP, when it diverged from its immediate ancestor, haplogroup IJ. At about this time, an Upper Palaeolithic culture also appeared, known as the Gravettian.

Earlier research into Y-DNA had instead focused on haplogroup R1 (M173): the most populous lineage among living European males; R1 was also believed to have emerged ~ 40,000 BP in Central Asia. However, it is now estimated that R1 emerged substantially more recently: a 2008 study dated the most recent common ancestor of haplogroup IJ to 38,500 and haplogroup R1 to 18,000 BP. This suggested that haplogroup IJ colonists formed the first wave and haplogroup R1 arrived much later.

Thus the genetic data suggests that, at least from the perspective of patrilineal ancestry, separate groups of modern humans took two routes into Europe: from the Middle East via the Balkans and another from Central Asia via the Eurasian Steppe, to the north of the Black Sea.

Martin Richards et al. found that 15–40% of extant mtDNA lineages trace back to the Palaeolithic migrations (depending on whether one allows for multiple founder events). MtDNA haplogroup U5, dated to be ~ 40–50 kYa, arrived during the first early upper Palaeolithic colonisation. Individually, it accounts for 5–15% of total mtDNA lineages. Middle U.P. movements are marked by the haplogroups HV, I and U4. HV split into Pre-V (around 26,000 years old) and the larger branch H, both of which spread over Europe, possibly via Gravettian contacts.

Haplogroup H accounts for about half the gene lines in Europe, with many subgroups. The above mtDNA lineages or their precursors, are most likely to have arrived into Europe via the Middle East. This contrasts with Y DNA evidence, whereby some 50%-plus of male lineages are characterised by the R1 superfamily, which is of possible central Asian origin. Ornella Semino postulates that these differences "may be due in part to the apparent more recent molecular age of Y chromosomes relative to other loci, suggesting more rapid replacement of previous Y chromosomes. Gender-based differential migratory demographic behaviors will also influence the observed patterns of mtDNA and Y variation".

Last Glacial Maximum

European LGM refuges, 20 kya

Solutrean and Proto-Solutrean Cultures

Epi-Gravettian Culture

The Last Glacial Maximum ("LGM") started c. 30 ka BCE, at the end of MIS 3, leading to a depopulation of Northern Europe. According to the classical model, people took refuge in climatic sanctuaries (or refugia) as follows:

Northern Iberia and Southwest France, together making up the "Franco-Cantabrian" refugium
The Balkans
Ukraine and more generally the northern coast of the Black Sea
Italy.

This event decreased the overall genetic diversity in Europe, a "result of drift, consistent with an inferred population bottleneck during the Last Glacial Maximum". As the glaciers receded from about 16,000–13,000 years ago, Europe began to be slowly repopulated by people from refugia, leaving genetic signatures.

Some Y haplogroup I clades appear to have diverged from their parental haplogroups sometime during or shortly after the LGM. Haplogroup I2 is prevalent in the western Balkans, as well as the rest of southeastern and central-eastern Europe in more moderate frequencies. Its frequency drops rapidly in central Europe, suggesting that the survivors bearing I2 lineages expanded predominantly through south-eastern and central-eastern Europe.

Cinnioglu sees evidence for the existence of an Anatolian refuge, which also harboured Hg R1b1b2. Today, R1b dominates the y chromosome landscape of western Europe, including the British Isles, suggesting that there could have been large population composition changes based on migrations after the LGM.

Semino, Passarino and Pericic place the origins of haplogroup R1a within the Ukrainian ice-age refuge. Its current distribution in eastern Europe and parts of Scandinavia are in part reflective of a re-peopling of Europe from the southern Russian/Ukrainian steppes after the Late Glacial Maximum.

From an mtDNA perspective, Richards et al. found that the majority of mtDNA diversity in Europe is accounted for by post-glacial re-expansions during the late upper Palaeolithic/ Mesolithic. "The regional analyses lend some support to the suggestion that much of western and central Europe was repopulated largely from the southwest when the climate improved. The lineages involved include much of the most common haplogroup, H, as well as much of K, T, W, and X." The study could not determine whether there were new migrations of mtDNA lineages from the near east during this period; a significant input was deemed unlikely.

The alternative model of more refugees was discussed by Bilton et al.

From a study of 51 individuals, researchers were able to identify five separate genetic clusters of ancient Europeans during the LGM: the Věstonice Cluster (34,000–26,000 years ago), associated with the Gravettian culture; the Mal'ta Cluster (24,000–17,000), associated with the Mal'ta-Buret' culture, the El Mirón Cluster (19,000–14,000 years ago), associated with the Magdalenian culture; the Villabruna Cluster (14,000–7,000 years ago) and the Satsurblia Cluster (13,000 to 10,000 years ago).

From around 37,000 years ago, all ancient Europeans began to share some ancestry with modern Europeans. This founding population is represented by GoyetQ116-1, a 35,000 year old specimen from Belgium. This lineage disappears from the record and is not found again until 19,000 BP in Spain at El Mirón, which shows strong affinities to GoyetQ116-1. During this interval, the distinct Věstonice Cluster is predominant in Europe, even at Goyet. The re-expansion of the El Mirón Cluster coincided with warming temperatures following the retreat of the glaciers during the Last Glacial Maximum. From 37,000 to 14,000 years ago, the population of Europe consisted of an isolated population descended from a founding population that didn't interbreed significantly with other populations.

Mesolithic

Mesolithic (post-LGM) populations had diverged significantly due to their relative isolation over several millennia, due to the harsh selection pressures during the LGM, and due to the founder effects caused by the rapid expansion from LGM refugia in the beginning Mesolithic. By the end of the LGM, around 19 to 11 ka, the familiar varieties of Eurasian phenotypes had emerged. However, the lineage of Mesolithic hunter-gatherers of Western Europe (WHG) does not survive as a majority contribution in any modern population. They were most likely blue eyed, and retained the dark skin pigmentation of pre-LGM EEMH. The HERC2 and OCA2 variations for blue eyes are derived from the WHG lineage were also found in the Yamnaya people.

Around 14,000 years ago, the Villabruna Cluster shifted away from GoyetQ116-1 affinity and started to show more affinity with the Near East, a shift which coincided with the warming temperatures of the Bølling-Allerød interstadial. This genetic shift shows that Near East populations had probably already begun moving into Europe during the end of the Upper Paleolithic, about 6,000 years earlier than previously thought, before the introduction of farming. A few specimens from the Villabruna Cluster also show genetic affinities for East Asians that are derived from gene flow. The HERC2 variation for blue eyes first appears around 13,000 to 14,000 years ago in Italy and the Caucasus. The light skin pigmentation characteristic of modern Europeans is estimated to have spread across Europe in a "selective sweep" during the Mesolithic (19 to 11 ka). The associated TYRP1 SLC24A5 and SLC45A2 alleles emerge around 19 ka, still during the LGM, most likely in the Caucasus.

Neolithic

Simplified model for the demographic history of Europeans during the Neolithic period in the introduction of agriculture

Ancient European Neolithic farmers were genetically closest to modern Near-Eastern/ Anatolian populations. Genetic matrilineal distances between European Neolithic Linear Pottery Culture populations (5,500–4,900 calibrated BC) and modern Western Eurasian populations.

A big cline in genetic variation that has long been recognised in Europe seems to show important dispersals from the direction of the Middle East. This has often been linked to the spread of farming technology during the Neolithic, which has been argued to be one of the most important periods in determining modern European genetic diversity.

The Neolithic started with the introduction of farming, beginning in SE Europe approximately 10,000–3000 BCE, and extending into NW Europe between 4500–1700 BCE. During this era, the Neolithic revolution led to drastic economic as well as socio-cultural changes in Europe and this is also thought to have had a big effect on Europe's genetic diversity, especially concerning genetic lineages entering Europe from the Middle East into the Balkans. There were several phases of this period:

In a late European Mesolithic prelude to the Neolithic, it appears that Near Eastern peoples from areas that already had farming, and who also had sea-faring technology, had a transient presence in Greece (for example at Franchthi Cave).
There is consensus that agricultural technology and the main breeds of animals and plants which are farmed entered Europe from somewhere in the area of the Fertile Crescent and specifically the Levant region from the Sinai to Southern Anatolia. (Less certainly, this agricultural revolution is sometimes argued to have in turn been partly triggered by movements of people and technology coming across the Sinai from Africa.) For more see Fertile Crescent: Cosmopolitan diffusion.
A later stage of the Neolithic, the so-called Pottery Neolithic, saw an introduction of pottery into the Levant, Balkans and Southern Italy (it had been present in the area of modern Sudan for some time before it is found in the Eastern Mediterranean, but it is thought to have developed independently), and this may have also been a period of cultural transfer from the Levant into the Balkans.

An important issue regarding the genetic impact of neolithic technologies in Europe is the manner by which they were transferred into Europe. Farming was introduced by a significant migration of farmers from the Near East (Cavalli-Sforza's biological demic diffusion model) or a "cultural diffusion" or a combination of the two, and population geneticists have tried to clarify whether any genetic signatures of Near Eastern origin correspond to the expansion routes postulated by the archaeological evidence.

Martin Richards estimated that only 11% of European mtDNA is due to immigration in this period, suggesting that farming was spread primarily due to being adopted by indigenous Mesolithic populations, rather than due to immigration from Near East. Gene flow from SE to NW Europe seems to have continued in the Neolithic, the percentage significantly declining towards the British Isles.

Classical genetics also suggested that the largest admixture to the European Paleolithic/Mesolithic stock was due to the Neolithic revolution of the 7th to 5th millennia BCE. Three main mtDNA gene groups have been identified as contributing Neolithic entrants into Europe: J, T1 and U3 (in that order of importance). With others, they amount up to around 20% of the gene pool.

In 2000, Semino's study on Y DNA revealed the presence of haplotypes belonging to the large clade E1b1b1 (E-M35). These were predominantly found in the southern Balkans, southern Italy and parts of Iberia. Semino connected this pattern, along with J haplogroup subclades, to be the Y-DNA component of Cavalli-Sforza's Neolithic demic-diffusion of farmers from the Near East. Rosser et al. rather saw it as a (direct) 'North African component' in European genealogy, although they did not propose a timing and mechanism to account for it. Underhill and Kivisild (2007) also described E1b1b as representing a late-Pleistocene migration from Africa to Europe over the Sinai Peninsula in Egypt, evidence for which does not show up in mitochondrial DNA.

The modern distribution of Y-DNA haplogroups in each European country

Concerning timing the distribution and diversity of V13 however, Battaglia et al. (2008) proposed an earlier movement whereby the E-M78* lineage ancestral to all modern E-V13 men moved rapidly out of a Southern Egyptian homeland and arrived in Europe with only Mesolithic technologies. They then suggest that the E-V13 sub-clade of E-M78 only expanded subsequently as native Balkan 'foragers-cum-farmers' adopted Neolithic technologies from the Near East. They propose that the first major dispersal of E-V13 from the Balkans may have been in the direction of the Adriatic Sea with the Neolithic Impressed Ware culture often referred to as Impressa or Cardial. Peričic et al. (2005), rather propose that the main route of E-V13 spread was along the Vardar-Morava-Danube river 'highway' system.

In contrast to Battaglia, Cruciani et al. (2007) tentatively suggested (i) a different point where the V13 mutation happened on its way from Egypt to the Balkans via the Middle East, and (ii) a later dispersal time. The authors proposed that the V13 mutation first appeared in western Asia, where it is found in low but significant frequencies, whence it entered the Balkans sometime after 11 kYa. It later experienced a rapid dispersal which he dated to c. 5300 years ago in Europe, coinciding with the Balkan Bronze Age. Like Peričic et al. they consider that "the dispersion of the E-V13 and J-M12 haplogroups seems to have mainly followed the river waterways connecting the southern Balkans to north-central Europe".

More recently, Lacan et al. (2011) announced that a 7000-year-old skeleton in a Neolithic context in a Spanish funeral cave, was an E-V13 man. (The other specimens tested from the same site were in haplogroup G2a, which has been found in Neolithic contexts throughout Europe.) Using 7 STR markers, this specimen was identified as being similar to modern individuals tested in Albania, Bosnia, Greece, Corsica, and Provence. The authors therefore proposed that, whether or not the modern distribution of E-V13 of today is a result of more recent events, E-V13 was already in Europe within the Neolithic, carried by early farmers from the Eastern Mediterranean to the Western Mediterranean, much earlier than the Bronze Age. This supports the proposals of Battaglia et al. rather than Cruciani et al. at least concerning earliest European dispersals, but E-V13 may have dispersed more than once. Even more recent than the Bronze Age, it has also been proposed that modern E-V13's modern distribution in Europe is at least partly caused by Roman era movements of people.

The migration of Neolithic farmers into Europe brought along several new adaptations. The variation for light skin colour was introduced to Europe by the neolithic farmers. After the arrival of the neolithic farmers, a SLC22A4 mutation was selected for, a mutation which probably arose to deal with ergothioneine deficiency but increases the risk of ulcerative colitis, coeliac disease, and irritable bowel syndrome.

Bronze Age

The Bronze Age saw the development of long-distance trading networks, particularly along the Atlantic Coast and in the Danube valley. There was migration from Norway to Orkney and Shetland in this period (and to a lesser extent to mainland Scotland and Ireland). There was also migration from Germany to eastern England. Martin Richards estimated that there was about 4% mtDNA immigration to Europe in the Bronze Age.

Scheme of Indo-European migrations from ca. 4000 to 1000 BC according to the Kurgan hypothesis

Another theory about the origin of the Indo-European language centres around a hypothetical Proto-Indo-European people, who, according to the Kurgan hypothesis, can be traced to north of the Black and Caspian Seas at about 4500 BCE. They domesticated the horse and possibly invented the wooden disk wheel, and are considered to have spread their culture and genes across Europe. The Y haplogroup R1a is a proposed marker of these "Kurgan" genes, as is the Y Haplogroup R1b, although these haplogroups as a whole may be much older than the language family.

In the far north, carriers of the Y-haplogroup N arrived to Europe from Siberia, eventually expanding as far as Finland, though the specific timing of their arrival is uncertain. The most common North European subclade N1c1 is estimated to be around 8,000 years old. There is evidence of human settlement in Finland dating back to 8500 BCE, linked with the Kunda culture and its putative ancestor, the Swiderian culture, but the latter is thought to have a European origin. The geographical spread of haplogroup N in Europe is well aligned with the Pit–Comb Ware culture, whose emergence is commonly dated c. 4200 BCE, and with the distribution of Uralic languages. Mitochondrial DNA studies of Sami people, haplogroup U5 are consistent with multiple migrations to Scandinavia from Volga-Ural region, starting 6,000 to 7,000 years before present.

The relationship between roles of European and Asian colonists in the prehistory of Finland is a point of some contention, and some scholars insist that Finns are "predominantly Eastern European and made up of people who trekked north from the Ukrainian refuge during the Ice Age". Farther east, the issue is less contentious. Haplogroup N carriers account for a significant part of all non-Slavic ethnic groups in northern Russia, including 37% of Karelians, 35% of Komi people (65% according to another study), 67% of Mari people, as many as 98% of Nenets people, 94% of Nganasans, and 86% to 94% of Yakuts.

The Yamnaya component contains partial ancestry from an Ancient North Eurasian component first identified in Mal'ta. According to Iosif Lazaridis, "the Ancient North Eurasian ancestry is proportionally the smallest component everywhere in Europe, never more than 20 percent, but we find it in nearly every European group we’ve studied." This genetic component does not come directly from the Mal'ta lineage itself, but a related lineage that separated from the Mal'ta lineage.

Up to a half of the Yamnaya component may have come from a Caucasus hunter-gatherer strand. On November 16, 2015, in a study published in the journal Nature Communications, geneticists announced that they had found a new fourth ancestral "tribe" or "strand" which had contributed to the modern European gene pool. They analysed genomes from two hunter-gatherers from Georgia which were 13,300 and 9,700 years old, and found that these Caucasus hunter-gatherers were probably the source of the farmer-like DNA in the Yamna. According to co-author Dr Andrea Manica of the University of Cambridge: "The question of where the Yamnaya come from has been something of a mystery up to now....we can now answer that as we've found that their genetic make-up is a mix of Eastern European Hunter-Gatherers and a population from this pocket of Caucasus hunter-gatherers who weathered much of the last Ice Age in apparent isolation."

According to Lazaridis et al. (2016), a population related to the people of the Chalcolithic Iran contributed to roughly half of the ancestry of Yamnaya populations of the Pontic–Caspian steppe. These Iranian Chalcolithic people were a mixture of "the Neolithic people of western Iran, the Levant, and Caucasus Hunter Gatherers."

The genetic variations for lactase persistence and greater height came with the Yamnaya people. The derived allele of the KITLG gene (SNP rs12821256) that is associated with – and likely causal for – blond hair in Europeans is found in populations with Eastern but not Western Hunter-Gatherers ancestry, suggesting that its origin is in the Ancient North Eurasian (ANE) population and may have been spread in Europe by individuals with steppe ancestry. Consistent with this, the earliest known individual with the derived allele is an ANE individual from the Late Upper Paleolithic Afontova Gora archaeological complex in central Siberia.

Recent history

Overview map of recent (1st to 17th centuries AD) admixture events in Europe

During the period of the Roman Empire, historical sources show that there were many movements of people around Europe, both within and outside the Empire. Historic sources sometimes cite instances of genocide inflicted by the Romans upon rebellious provincial tribes. If this did in fact occur, it would have been limited given that modern populations show considerable genetic continuity in their respective regions. The process of 'Romanisation' appears to have been accomplished by the colonisation of provinces by a few Latin speaking administrators, military personnel, settled veterans, and private citizens (merchants, traders) who emanated from the Empire's various regions (and not merely from Roman Italy). They served as a nucleus for the acculturation of local notables.

Given their small numbers and varied origins, Romanization does not appear to have left distinct genetic signatures in Europe. Indeed, Romance-speaking populations in the Balkans, like Romanians, Aromanians, Moldovans, etc. have been found to genetically resemble neighbouring Greek and South Slavic-speaking peoples rather than modern Italians, proving that they were genetically speaking, mainly through I2a2 M-423 and E1b1b1, V-13 Haplogroups native to this area.

Steven Bird has speculated that E1b1b1a was spread during the Roman era through Thracian and Dacian populations from the Balkans into the rest of Europe.

Concerning the late Roman period of (not only) Germanic "Völkerwanderung", some suggestions have been made, at least for Britain, with Y haplogroup I1a being associated with Anglo-Saxon immigration in eastern England, and R1a being associated with Norse immigration in northern Scotland.

Genetics of modern European populations

Patrilineal studies

There are four main Y-chromosome DNA haplogroups that account for most of Europe's patrilineal descent.

Haplogroup I is found in the form of various sub-clades throughout Europe and is found at highest frequencies in the Nordic Countries as I1 (Norway, Denmark, Sweden, Finland) and in the Balkan Peninsula as I2a (Bosnia and Herzegovina 65%, Croatia and Serbia). I1 is also frequent in Germany, Great Britain and Netherlands, while I2a is frequent also in Sardinia, Romania/Moldova, Bulgaria and Ukraine. This clade is found at its highest expression by far in Europe and may have been there since before the LGM.
Haplogroup E1b1b (formerly known as E3b) represents the last major direct migration from Africa into Europe. It is believed to have first appeared in the Horn of Africa approximately 26,000 years ago and dispersed to North Africa and the Near East during the late Paleolithic and Mesolithic periods. E1b1b lineages are closely linked to the diffusion of Afroasiatic languages. Although present throughout Europe, it peaks in the western Balkan region amongst Albanians and their neighbors. It is also common in Italy and the Iberian peninsula. Haplogroup E1b1b1, mainly in the form of its E1b1b1a2 (E-V13) sub-clade, reaches frequencies above 47% around the area of Kosovo. This clade is thought to have arrived in Europe from western Asia either in the later Mesolithic, or the Neolithic. North Africa subclade E-M81 is also present in Sicily and Andalusia.
Haplogroup R1b is common all over Europe, with R1b1a1a2 especially common in Western Europe. Nearly all of this R1b in Europe is in the form of the R1b1a2 (2011 name) (R-M269) sub-clade, specifically within the R-L23 sub-sub-clade whereas R1b found in Central Asia, western Asia and Africa tends to be in other clades. It has also been pointed out that outlier types are present in Europe and are particularly notable in some areas such as Sardinia and Armenia. Haplogroup R1b frequencies vary from highs in western Europe in a steadily decreasing cline with growing distance from the Atlantic: 80–90% (Welsh, Basque, Irish, Scots, Bretons) around 70–80% in other areas of Spain, Britain and France and around 40–60% in most other parts of western Europe like eastern Germany, and northern-central Italy. It drops outside this area and is around 30% or less in areas such as southern Italy, Sweden, Poland, the Balkans and Cyprus. R1b remains the most common clade as one moves east to Germany, while farther east, in Poland, R1a is more common (see below). In southeastern Europe, R1b drops behind R1a in the area in and around Hungary and Serbia but is more common both to the south and north of this region. R1b in Western Europe is dominated by at least two sub-clades, R-U106, which is distributed from the east side of the Rhine into northern and central Europe (with a strong presence in England) and R-P312, which is most common west of the Rhine, including the British Isles. Some have posited that this haplogroup's presence in Europe dates back to the LGM, while others link it to the spread of the Centum branch of the Indo-European languages.

The oldest human remains found to carry R1b so far are an individual from the Epigravettian culture context in Italy (Villabruna) who lived c. 12,000 BCE and reportedly belonged to R1b1a (L754), and the 7,000 year-old remains of a hunter-gatherer, belonging to the Samara culture of the Volga River area who carried R1b1* (R-L278*).

Haplogroup R1a, almost entirely in the R1a1a sub-clade, is prevalent in much of Eastern and Central Europe (also in South and Central Asia). For example, there is a sharp increase in R1a1 and decrease in R1b1b2 as one goes east from Germany to Poland. It also has a substantial presence in Scandinavia (particularly Norway). In the Baltic countries R1a frequencies decrease from Lithuania (45%) to Estonia (around 30%).

Putting aside small enclaves, there are also several haplogroups apart from the above four that are less prominent or most common only in certain areas of Europe.

Haplogroup G, the original Neolithic Europeans (Caucasians), is common in most parts of Europe at a low frequency, reaching peaks above 70% around Georgia and among the Madjars (although living in Asia they border the eastern perimeter of Europe), up to 10% in Sardinia, 12% in Corsica and Uppsala (Sweden), 11% in the Balkans and Portugal, 10% in Spain and 9% in European Russia. This clade is also found in the Near East.
Haplogroup N, is common only in the northeast of Europe and in the form of its N1c1 sub-clade reaches frequencies of approximately 60% among Finns and approximately 40% among Estonians, Latvians, and Lithuanians.
Haplogroup J2, in various sub-clades (J2a, J2b), is found in levels of around 15–30% in parts of the Balkans and Italy and is common all over Europe and especially the Mediterranean basin

Matrilineal studies

There have been a number of studies about the mitochondrial DNA haplogroups (mtDNA) in Europe. In contrast to Y DNA haplogroups, mtDNA haplogroups did not show as much geographical patterning, but were more evenly ubiquitous. Apart from the outlying Saami, all Europeans are characterised by the predominance of haplogroups H, U and T. The lack of observable geographic structuring of mtDNA may be due to socio-cultural factors, namely the phenomena of polygyny and patrilocality.

Genetic studies suggest some maternal gene flow to eastern Europe from eastern Asia or southern Siberia 13,000 – 6,600 years BP. Analysis of Neolithic skeletons in the Great Hungarian Plain found a high frequency of eastern Asian mtDNA haplogroups, some of which survive in modern eastern European populations. Maternal gene flow to Europe from sub-Saharan Africa began as early as 11,000 years BP, although the majority of lineages, approximately 65%, are estimated to have arrived more recently, including during the Romanization period, the Arab conquests of southern Europe, and during the Atlantic slave trade.

European population sub-structure

Genetically, Europe is relatively homogeneous, but distinct sub-population patterns of various types of genetic markers have been found, particularly along a southeast-northwest cline. For example, Cavalli-Sforza's principal component analyses revealed five major clinal patterns throughout Europe, and similar patterns have continued to be found in more recent studies.

A cline of genes with highest frequencies in the Middle East, spreading to lowest levels northwest. Cavalli-Sforza originally described this as faithfully reflecting the spread of agriculture in Neolithic times. This has been the general tendency in interpretation of all genes with this pattern.
A cline of genes with highest frequencies among Finnish and Sami in the extreme north east, and spreading to lowest frequencies in the south west.
A cline of genes with highest frequencies in the area of the lower Don and Volga rivers in southern Russia, and spreading to lowest frequencies in Spain, Southern Italy, Greece and the areas inhabited by Saami speakers in the extreme north of Scandinavia. Cavalli-Sforza associated this with the spread of Indo-European languages, which he links in turn to a "secondary expansion" after the spread of agriculture, associated with animal grazing.
A cline of genes with highest frequencies in the Balkans and Southern Italy, spreading to lowest levels in Britain and the Basque country. Cavalli-Sforza associates this with "the Greek expansion, which reached its peak in historical times around 1000 and 500 BCE but which certainly began earlier".
A cline of genes with highest frequencies in the Basque country, and lower levels beyond the area of Iberia and Southern France. In perhaps the most well-known conclusion from Cavalli-Sforza, this weakest of the five patterns was described as isolated remnants of the pre-Neolithic population of Europe, "who at least partially withstood the expansion of the cultivators". It corresponds roughly to the geographical spread of rhesus negative blood types. In particular, the conclusion that the Basques are a genetic isolate has become widely discussed, but also a controversial conclusion.

He also created a phylogenetic tree to analyse the internal relationships among Europeans. He found four major 'outliers'- Basques, Sami, Sardinians and Icelanders; a result he attributed to their relative isolation (note: the Icelanders and the Sardinians speak Indo-European languages, while the other two groups do not). Greeks and Yugoslavs represented a second group of less extreme outliers. The remaining populations clustered into several groups : "Celtic", "Germanic", "south-western Europeans", "Scandinavians" and "eastern Europeans".

A study in May 2009 of 19 populations from Europe using 270,000 SNPs highlighted the genetic diversity of European populations corresponding to the northwest to southeast gradient and distinguished "four several distinct regions" within Europe:

Finland, showing the greatest distance to the rest of Europeans.
the Baltic region (Estonia, Latvia and Lithuania), western Russia and eastern Poland.
Central and Western Europe.
Italy, "with the southern Italians being more distant".

In this study, barrier analysis revealed "genetic barriers" between Finland, Italy and other countries and that barriers could also be demonstrated within Finland (between Helsinki and Kuusamo) and Italy (between northern and southern part, Fst=0.0050). Fst (Fixation index) was found to correlate considerably with geographic distances ranging from ≤0.0010 for neighbouring populations to 0.0200–0.0230 for Southern Italy and Finland. For comparisons, pair-wise Fst of non-European samples were as follows: Europeans – Africans (Yoruba) 0.1530; Europeans – Chinese 0.1100; Africans (Yoruba) – Chinese 0.1900.

A study by Chao Tian in August 2009 extended the analysis of European population genetic structure to include additional southern European groups and Arab populations (Palestinians, Druzes...) from the Near-East. This study determined autosomal Fst between 18 population groups and concluded that, in general, genetic distances corresponded to geographical relationships with smaller values between population groups with origins in neighbouring countries/regions (for example, Greeks/Tuscans: Fst=0.0010, Greeks/Palestinians: Fst=0.0057) compared with those from very different regions in Europe (for example Greeks/Swedish: Fst=0.0087, Greeks/Russians: Fst=0.0108).

Autosomal DNA

Seldin (2006) used over 5,000 autosomal SNPs. It showed "a consistent and reproducible distinction between ‘northern’ and ‘southern’ European population groups". Most individual participants with southern European ancestry (Italians, Greeks, Portuguese, Spaniards), and Ashkenazi Jews have >85% membership in the southern population; and most northern, western, central, and eastern Europeans (Swedes, English, Irish, Germans, and Ukrainians) have >90% in the northern population group. However, many of the participants in this study were actually American citizens who self-identified with different European ethnicities based on self-reported familial pedigree.

A similar study in 2007 using samples predominantly from Europe found that the most important genetic differentiation in Europe occurs on a line from the north to the south-east (northern Europe to the Balkans), with another east-west axis of differentiation across Europe. Its findings were consistent with earlier results based on mtDNA and Y-chromosomal DNA that support the theory that modern Iberians (Spanish and Portuguese) hold the most ancient European genetic ancestry, as well as separating Basques and Sami from other European populations.

It suggested that the English and Irish cluster with other Northern and Eastern Europeans such as Germans and Poles, while some Basque and Italian individuals also clustered with Northern Europeans. Despite these stratifications, it noted that "there is low apparent diversity in Europe with the entire continent-wide samples only marginally more dispersed than single population samples elsewhere in the world".

In 2008, two international research teams published analyses of large-scale genotyping of large samples of Europeans, using over 300,000 autosomal SNPs. With the exception of usual isolates such as Basques, Finns and Sardinians, the European population lacked sharp discontinuities (clustering) as previous studies have found (see Seldin et al. 2006 and Bauchet et al. 2007), although there was a discernible south to north gradient. Overall, they found only a low level of genetic differentiation between subpopulations, and differences which did exist were characterised by a strong continent-wide correlation between geographic and genetic distance. In addition, they found that diversity was greatest in southern Europe due a larger effective population size and/or population expansion from southern to northern Europe. The researchers take this observation to imply that genetically, Europeans are not distributed into discrete populations.

A study on north-eastern populations, published in March 2013, found that Komi peoples formed a pole of genetic diversity that is distinct from other populations.

Autosomal genetic distances (Fst) based on SNPs (2009)

The genetic distance between populations is often measured by Fixation index (Fst), based on genetic polymorphism data, such as single-nucleotide polymorphisms (SNPs) or microsatellites. Fst is a special case of F-statistics, the concept developed in the 1920s by Sewall Wright. Fst is simply the correlation of randomly chosen alleles within the same sub-population relative to that found in the entire population. It is often expressed as the proportion of genetic diversity due to allele frequency differences among populations.

The values range from 0 to 1. A zero value implies that the two populations are panmictic, that they are interbreeding freely. A value of one would imply that the two populations are completely separate. The greater the Fst value, the greater the genetic distance. Essentially, these low Fst values suggest that the majority of genetic variation is at the level of individuals within the same population group (~ 85%); whilst belonging to a different population group within same ‘race’/ continent, and even to different racial/ continental groups added a much smaller degree of variation (3–8%; 6–11%, respectively).

Y chromosome

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/Y_chromosome

Human Y chromosome
Human Y chromosome (after G-banding)
Y chromosome in human male karyogram
Features
Length (bp)	57,227,415 bp (GRCh38)
No. of genes	63 (CCDS)
Type	Allosome
Centromere position	Acrocentric (10.4 Mbp)
Complete gene lists
CCDS	Gene list
HGNC	Gene list
UniProt	Gene list
NCBI	Gene list
External map viewers
Ensembl	Chromosome Y
Entrez	Chromosome Y
NCBI	Chromosome Y
UCSC	Chromosome Y
Full DNA sequences
RefSeq	NC_000024 (FASTA)
GenBank	CM000686 (FASTA)

The Y chromosome is one of two sex chromosomes (allosomes) in mammals, including humans, and many other animals. The other is the X chromosome. Y is normally the sex-determining chromosome in many species, since it is the presence or absence of Y that typically determines the male or female sex of offspring produced in sexual reproduction. In mammals, the Y chromosome contains the gene SRY, which triggers male development. The DNA in the human Y chromosome is composed of about 59 million base pairs. The Y chromosome is passed only from father to son. With a 30% difference between humans and chimpanzees, the Y chromosome is one of the fastest-evolving parts of the human genome. The human Y chromosome carries an estimated 100-200 genes, with between 45 and 73 of these protein-coding. All single-copy Y-linked genes are hemizygous (present on only one chromosome) except in cases of aneuploidy such as XYY syndrome or XXYY syndrome.

Overview

Discovery

The Y chromosome was identified as a sex-determining chromosome by Nettie Stevens at Bryn Mawr College in 1905 during a study of the mealworm Tenebrio molitor. Edmund Beecher Wilson independently discovered the same mechanisms the same year. Stevens proposed that chromosomes always existed in pairs and that the Y chromosome was the pair of the X chromosome discovered in 1890 by Hermann Henking. She realized that the previous idea of Clarence Erwin McClung, that the X chromosome determines sex, was wrong and that sex determination is, in fact, due to the presence or absence of the Y chromosome. Stevens named the chromosome "Y" simply to follow on from Henking's "X" alphabetically.

The idea that the Y chromosome was named after its similarity in appearance to the letter "Y" is mistaken. All chromosomes normally appear as an amorphous blob under the microscope and only take on a well-defined shape during mitosis. This shape is vaguely X-shaped for all chromosomes. It is entirely coincidental that the Y chromosome, during mitosis, has two very short branches which can look merged under the microscope and appear as the descender of a Y-shape.

Variations

Most therian mammals have only one pair of sex chromosomes in each cell. Males have one Y chromosome and one X chromosome, while females have two X chromosomes. In mammals, the Y chromosome contains a gene, SRY, which triggers embryonic development as a male. The Y chromosomes of humans and other mammals also contain other genes needed for normal sperm production.

There are exceptions, however. Among humans, some men have two Xs and a Y ("XXY", see Klinefelter syndrome), or one X and two Ys (see XYY syndrome), and some women have three Xs or a single X instead of a double X ("X0", see Turner syndrome). There are other exceptions in which SRY is damaged (leading to an XY female), or copied to the X (leading to an XX male).

Origins and evolution

Before Y chromosome

Many ectothermic vertebrates have no sex chromosomes. If they have different sexes, sex is determined environmentally rather than genetically. For some of them, especially reptiles, sex depends on the incubation temperature. Some vertebrates are hermaphrodites, although other than a very few ray-finned fish, they are sequential (the same organism produces male or female gametes, but never both, at different points in its life), rather than simultaneous (the same organism producing both male and female gametes at the same time).

Origin

The X and Y chromosomes are thought to have evolved from a pair of identical chromosomes, termed autosomes, when an ancestral animal developed an allelic variation, a so-called "sex locus" – simply possessing this allele caused the organism to be male. The chromosome with this allele became the Y chromosome, while the other member of the pair became the X chromosome. Over time, genes that were beneficial for males and harmful to (or had no effect on) females either developed on the Y chromosome or were acquired through the process of translocation.

Until recently, the X and Y chromosomes were thought to have diverged around 300 million years ago. However, research published in 2010, and particularly research published in 2008 documenting the sequencing of the platypus genome, has suggested that the XY sex-determination system would not have been present more than 166 million years ago, at the split of the monotremes from other mammals. This re-estimation of the age of the therian XY system is based on the finding that sequences that are on the X chromosomes of marsupials and eutherian mammals are present on the autosomes of platypus and birds. The older estimate was based on erroneous reports that the platypus X chromosomes contained these sequences.

Recombination inhibition

Recombination between the X and Y chromosomes proved harmful—it resulted in males without necessary genes formerly found on the Y chromosome, and females with unnecessary or even harmful genes previously only found on the Y chromosome. As a result, genes beneficial to males accumulated near the sex-determining genes, and recombination in this region was suppressed in order to preserve this male specific region. Over time, the Y chromosome changed in such a way as to inhibit the areas around the sex determining genes from recombining at all with the X chromosome. As a result of this process, 95% of the human Y chromosome is unable to recombine. Only the tips of the Y and X chromosomes recombine. The tips of the Y chromosome that could recombine with the X chromosome are referred to as the pseudoautosomal region. The rest of the Y chromosome is passed on to the next generation intact, allowing for its use in tracking human evolution.

Degeneration

By one estimate, the human Y chromosome has lost 1,393 of its 1,438 original genes over the course of its existence, and linear extrapolation of this 1,393-gene loss over 300 million years gives a rate of genetic loss of 4.6 genes per million years. Continued loss of genes at the rate of 4.6 genes per million years would result in a Y chromosome with no functional genes – that is the Y chromosome would lose complete function – within the next 10 million years, or half that time with the current age estimate of 160 million years. Comparative genomic analysis reveals that many mammalian species are experiencing a similar loss of function in their heterozygous sex chromosome. Degeneration may simply be the fate of all non-recombining sex chromosomes, due to three common evolutionary forces: high mutation rate, inefficient selection, and genetic drift.

However, comparisons of the human and chimpanzee Y chromosomes (first published in 2005) show that the human Y chromosome has not lost any genes since the divergence of humans and chimpanzees between 6–7 million years ago, and a scientific report in 2012 stated that only one gene had been lost since humans diverged from the rhesus macaque 25 million years ago. These facts provide direct evidence that the linear extrapolation model is flawed and suggest that the current human Y chromosome is either no longer shrinking or is shrinking at a much slower rate than the 4.6 genes per million years estimated by the linear extrapolation model.

High mutation rate

The human Y chromosome is particularly exposed to high mutation rates due to the environment in which it is housed. The Y chromosome is passed exclusively through sperm, which undergo multiple cell divisions during gametogenesis. Each cellular division provides further opportunity to accumulate base pair mutations. Additionally, sperm are stored in the highly oxidative environment of the testis, which encourages further mutation. These two conditions combined put the Y chromosome at a greater opportunity of mutation than the rest of the genome. The increased mutation opportunity for the Y chromosome is reported by Graves as a factor 4.8. However, her original reference obtains this number for the relative mutation rates in male and female germ lines for the lineage leading to humans.

The observation that the Y chromosome experiences little meiotic recombination and has an accelerated rate of mutation and degradative change compared to the rest of the genome suggests an evolutionary explanation for the adaptive function of meiosis with respect to the main body of genetic information. Brandeis proposed that the basic function of meiosis (particularly meiotic recombination) is the conservation of the integrity of the genome, a proposal consistent with the idea that meiosis is an adaptation for repairing DNA damage.

Inefficient selection

Without the ability to recombine during meiosis, the Y chromosome is unable to expose individual alleles to natural selection. Deleterious alleles are allowed to "hitchhike" with beneficial neighbors, thus propagating maladapted alleles into the next generation. Conversely, advantageous alleles may be selected against if they are surrounded by harmful alleles (background selection). Due to this inability to sort through its gene content, the Y chromosome is particularly prone to the accumulation of "junk" DNA. Massive accumulations of retrotransposable elements are scattered throughout the Y. The random insertion of DNA segments often disrupts encoded gene sequences and renders them nonfunctional. However, the Y chromosome has no way of weeding out these "jumping genes". Without the ability to isolate alleles, selection cannot effectively act upon them.

A clear, quantitative indication of this inefficiency is the entropy rate of the Y chromosome. Whereas all other chromosomes in the human genome have entropy rates of 1.5–1.9 bits per nucleotide (compared to the theoretical maximum of exactly 2 for no redundancy), the Y chromosome's entropy rate is only 0.84. This means the Y chromosome has a much lower information content relative to its overall length; it is more redundant.

Genetic drift

Even if a well adapted Y chromosome manages to maintain genetic activity by avoiding mutation accumulation, there is no guarantee it will be passed down to the next generation. The population size of the Y chromosome is inherently limited to 1/4 that of autosomes: diploid organisms contain two copies of autosomal chromosomes while only half the population contains 1 Y chromosome. Thus, genetic drift is an exceptionally strong force acting upon the Y chromosome. Through sheer random assortment, an adult male may never pass on his Y chromosome if he only has female offspring. Thus, although a male may have a well adapted Y chromosome free of excessive mutation, it may never make it into the next gene pool. The repeat random loss of well-adapted Y chromosomes, coupled with the tendency of the Y chromosome to evolve to have more deleterious mutations rather than less for reasons described above, contributes to the species-wide degeneration of Y chromosomes through Muller's ratchet.

Gene conversion

As it has been already mentioned, the Y chromosome is unable to recombine during meiosis like the other human chromosomes; however, in 2003, researchers from MIT discovered a process which may slow down the process of degradation. They found that human Y chromosome is able to "recombine" with itself, using palindrome base pair sequences. Such a "recombination" is called gene conversion.

In the case of the Y chromosomes, the palindromes are not noncoding DNA; these strings of bases contain functioning genes important for male fertility. Most of the sequence pairs are greater than 99.97% identical. The extensive use of gene conversion may play a role in the ability of the Y chromosome to edit out genetic mistakes and maintain the integrity of the relatively few genes it carries. In other words, since the Y chromosome is single, it has duplicates of its genes on itself instead of having a second, homologous, chromosome. When errors occur, it can use other parts of itself as a template to correct them.

Findings were confirmed by comparing similar regions of the Y chromosome in humans to the Y chromosomes of chimpanzees, bonobos and gorillas. The comparison demonstrated that the same phenomenon of gene conversion appeared to be at work more than 5 million years ago, when humans and the non-human primates diverged from each other.

Future evolution

In the terminal stages of the degeneration of the Y chromosome, other chromosomes increasingly take over genes and functions formerly associated with it. Finally, the Y chromosome disappears entirely, and a new sex-determining system arises. Several species of rodent in the sister families Muridae and Cricetidae have reached these stages, in the following ways:

The Transcaucasian mole vole, Ellobius lutescens, the Zaisan mole vole, Ellobius tancrei, and the Japanese spinous country rats Tokudaia osimensis and Tokudaia tokunoshimensis, have lost the Y chromosome and SRY entirely. Tokudaia spp. have relocated some other genes ancestrally present on the Y chromosome to the X chromosome. Both sexes of Tokudaia spp. and Ellobius lutescens have an XO genotype (Turner syndrome), whereas all Ellobius tancrei possess an XX genotype. The new sex-determining system(s) for these rodents remains unclear.
The wood lemming Myopus schisticolor, the Arctic lemming, Dicrostonyx torquatus, and multiple species in the grass mouse genus Akodon have evolved fertile females who possess the genotype generally coding for males, XY, in addition to the ancestral XX female, through a variety of modifications to the X and Y chromosomes.
In the creeping vole, Microtus oregoni, the females, with just one X chromosome each, produce X gametes only, and the males, XY, produce Y gametes, or gametes devoid of any sex chromosome, through nondisjunction.

Outside of the rodents, the black muntjac, Muntiacus crinifrons, evolved new X and Y chromosomes through fusions of the ancestral sex chromosomes and autosomes.

1:1 sex ratio

Fisher's principle outlines why almost all species using sexual reproduction have a sex ratio of 1:1. W. D. Hamilton gave the following basic explanation in his 1967 paper on "Extraordinary sex ratios", given the condition that males and females cost equal amounts to produce:

Suppose male births are less common than female.
A newborn male then has better mating prospects than a newborn female, and therefore can expect to have more offspring.
Therefore, parents genetically disposed to produce males tend to have more than average numbers of grandchildren born to them.
Therefore, the genes for male-producing tendencies spread, and male births become more common.
As the 1:1 sex ratio is approached, the advantage associated with producing males dies away.
The same reasoning holds if females are substituted for males throughout. Therefore, 1:1 is the equilibrium ratio.

Non-therian Y chromosome

Many groups of organisms in addition to therian mammals have Y chromosomes, but these Y chromosomes do not share common ancestry with therian Y chromosomes. Such groups include monotremes, Drosophila, some other insects, some fish, some reptiles, and some plants. In Drosophila melanogaster, the Y chromosome does not trigger male development. Instead, sex is determined by the number of X chromosomes. The D. melanogaster Y chromosome does contain genes necessary for male fertility. So XXY D. melanogaster are female, and D. melanogaster with a single X (X0), are male but sterile. There are some species of Drosophila in which X0 males are both viable and fertile.

ZW chromosomes

Other organisms have mirror image sex chromosomes: where the homogeneous sex is the male, said to have two Z chromosomes, and the female is the heterogeneous sex, and said to have a Z chromosome and a W chromosome. For example, female birds, snakes, and butterflies have ZW sex chromosomes, and males have ZZ sex chromosomes.

Non-inverted Y chromosome

There are some species, such as the Japanese rice fish, in which the XY system is still developing and cross over between the X and Y is still possible. Because the male specific region is very small and contains no essential genes, it is even possible to artificially induce XX males and YY females to no ill effect.

Multiple XY pairs

Monotremes possess four or five (platypus) pairs of XY sex chromosomes, each pair consisting of sex chromosomes with homologous regions. The chromosomes of neighboring pairs are partially homologous, such that a chain is formed during mitosis. The first X chromosome in the chain is also partially homologous with the last Y chromosome, indicating that profound rearrangements, some adding new pieces from autosomes, have occurred in history.

Platypus sex chromosomes have strong sequence similarity with the avian Z chromosome, (indicating close homology), and the SRY gene so central to sex-determination in most other mammals is apparently not involved in platypus sex-determination.

Human Y chromosome

In humans, the Y chromosome spans about 58 million base pairs (the building blocks of DNA) and represents almost 2% of the total DNA in a male cell. The human Y chromosome contains over 200 genes, at least 72 of which code for proteins. Traits that are inherited via the Y chromosome are called Y-linked traits, or holandric traits (from Ancient Greek ὅλος hólos, "whole" + ἀνδρός andrós, "male").

Men can lose the Y chromosome in a subset of cells, which is called the mosaic loss of chromosome Y (LOY). This post-zygotic mutation is strongly associated with age, affecting about 15% of men 70 years of age. Smoking is another important risk factor for LOY. It has been found that men with a higher percentage of hematopoietic stem cells in blood lacking the Y chromosome (and perhaps a higher percentage of other cells lacking it) have a higher risk of certain cancers and have a shorter life expectancy. Men with LOY (which was defined as no Y in at least 18% of their hematopoietic cells) have been found to die 5.5 years earlier on average than others. This has been interpreted as a sign that the Y chromosome plays a role going beyond sex determination and reproduction (although the loss of Y may be an effect rather than a cause). Male smokers have between 1.5 and 2 times the risk of non-respiratory cancers as female smokers.

Non-combining region of Y (NRY)

The human Y chromosome is normally unable to recombine with the X chromosome, except for small pieces of pseudoautosomal regions at the telomeres (which comprise about 5% of the chromosome's length). These regions are relics of ancient homology between the X and Y chromosomes. The bulk of the Y chromosome, which does not recombine, is called the "NRY", or non-recombining region of the Y chromosome. The single-nucleotide polymorphisms (SNPs) in this region are used to trace direct paternal ancestral lines.

Genes

Gene list

In general, the human Y chromosome is extremely gene poor—it is one of the largest gene deserts in the human genome. Disregarding pseudoautosomal genes, genes encoded on the human Y chromosome include:

NRY, with corresponding gene on X chromosome
- AMELY (amelogenin) similar to AMELX on X
- RPS4Y1/RPS4Y2/RPS4X (Ribosomal protein S4)
- DDX3Y (helicase) similar to DDX3X on X
- X-transposed region (XTR), once dubbed "PAR3" but later refuted
  - PCDH11Y (PCDH11X)
  - TGIF2LY (TGIF2LX)
NRY, other
- AZF1 (azoospermia factor 1)
- BPY2 (basic protein on the Y chromosome)
- DAZ1 (deleted in azoospermia)
- DAZ2
- DFNY1 encoding protein Deafness, Y-linked 1
- PRKY (protein kinase, Y-linked)
- RBMY1A1
- SRY (sex-determining region)
- TSPY (testis-specific protein)
- USP9Y
- UTY (ubiquitously transcribed TPR gene on Y chromosome)
- ZFY (zinc finger protein)

Y-chromosome-linked diseases

Diseases linked to the Y chromosome typically involve an aneuploidy, an atypical number of chromosomes.

Y chromosome microdeletion

Y chromosome microdeletion (YCM) is a family of genetic disorders caused by missing genes in the Y chromosome. Many affected men exhibit no symptoms and lead normal lives. However, YCM is also known to be present in a significant number of men with reduced fertility or reduced sperm count.

Defective Y chromosome

This results in the person presenting a female phenotype (i.e., is born with female-like genitalia) even though that person possesses an XY karyotype. The lack of the second X results in infertility. In other words, viewed from the opposite direction, the person goes through defeminization but fails to complete masculinization.

The cause can be seen as an incomplete Y chromosome: the usual karyotype in these cases is 45X, plus a fragment of Y. This usually results in defective testicular development, such that the infant may or may not have fully formed male genitalia internally or externally. The full range of ambiguity of structure may occur, especially if mosaicism is present. When the Y fragment is minimal and nonfunctional, the child is usually a girl with the features of Turner syndrome or mixed gonadal dysgenesis.

XXY

Klinefelter syndrome (47, XXY) is not an aneuploidy of the Y chromosome, but a condition of having an extra X chromosome, which usually results in defective postnatal testicular function. The mechanism is not fully understood; it does not seem to be due to direct interference by the extra X with expression of Y genes.

XYY

47, XYY syndrome (simply known as XYY syndrome) is caused by the presence of a single extra copy of the Y chromosome in each of a male's cells. 47, XYY males have one X chromosome and two Y chromosomes, for a total of 47 chromosomes per cell. Researchers have found that an extra copy of the Y chromosome is associated with increased stature and an increased incidence of learning problems in some boys and men, but the effects are variable, often minimal, and the vast majority do not know their karyotype.

In 1965 and 1966 Patricia Jacobs and colleagues published a chromosome survey of 315 male patients at Scotland's only special security hospital for the developmentally disabled, finding a higher than expected number of patients to have an extra Y chromosome. The authors of this study wondered "whether an extra Y chromosome predisposes its carriers to unusually aggressive behaviour", and this conjecture "framed the next fifteen years of research on the human Y chromosome".

Through studies over the next decade, this conjecture was shown to be incorrect: the elevated crime rate of XYY males is due to lower median intelligence and not increased aggression, and increased height was the only characteristic that could be reliably associated with XYY males. The "criminal karyotype" concept is therefore inaccurate.

Rare

The following Y-chromosome-linked diseases are rare, but notable because of their elucidating of the nature of the Y chromosome.

More than two Y chromosomes

Greater degrees of Y chromosome polysomy (having more than one extra copy of the Y chromosome in every cell, e.g., XYYY) are rare. The extra genetic material in these cases can lead to skeletal abnormalities, decreased IQ, and delayed development, but the severity features of these conditions are variable.

XX male syndrome

XX male syndrome occurs when there has been a recombination in the formation of the male gametes, causing the SRY portion of the Y chromosome to move to the X chromosome. When such an X chromosome contributes to the child, the development will lead to a male, because of the SRY gene.

Genetic genealogy

In human genetic genealogy (the application of genetics to traditional genealogy), use of the information contained in the Y chromosome is of particular interest because, unlike other chromosomes, the Y chromosome is passed exclusively from father to son, on the patrilineal line. Mitochondrial DNA, maternally inherited to both sons and daughters, is used in an analogous way to trace the matrilineal line.

Brain function

Research is currently investigating whether male-pattern neural development is a direct consequence of Y-chromosome-related gene expression or an indirect result of Y-chromosome-related androgenic hormone production.

Microchimerism

The presence of male chromosomes in fetal cells in the blood circulation of women was discovered in 1974.

In 1996, it was found that male fetal progenitor cells could persist postpartum in the maternal blood stream for as long as 27 years.

A 2004 study at the Fred Hutchinson Cancer Research Center, Seattle, investigated the origin of male chromosomes found in the peripheral blood of women who had not had male progeny. A total of 120 subjects (women who had never had sons) were investigated, and it was found that 21% of them had male DNA. The subjects were categorised into four groups based on their case histories:

Group A (8%) had had only female progeny.
Patients in Group B (22%) had a history of one or more miscarriages.
Patients Group C (57%) had their pregnancies medically terminated.
Group D (10%) had never been pregnant before.

The study noted that 10% of the women had never been pregnant before, raising the question of where the Y chromosomes in their blood could have come from. The study suggests that possible reasons for occurrence of male chromosome microchimerism could be one of the following:

miscarriages,
pregnancies,
vanished male twin,
possibly from sexual intercourse.

A 2012 study at the same institute has detected cells with the Y chromosome in multiple areas of the brains of deceased women.

XY sex-determination system

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/XY_sex-determination_system

Drosophila sex-chromosomes

Pollen cones of a male Ginkgo biloba tree, a dioecious species

Ovules of a female Ginkgo biloba

The XY sex-determination system is a sex-determination system used to classify many mammals, including humans, some insects (Drosophila), some snakes, some fish (guppies), and some plants (Ginkgo tree). In this system, the sex of an individual is determined by a pair of sex chromosomes. Females typically have two of the same kind of sex chromosome (XX), and are called the homogametic sex. Males typically have two different kinds of sex chromosomes (XY), and are called the heterogametic sex.

In humans, the presence of the Y chromosome is typically responsible for triggering male development; in the absence of the Y chromosome, the fetus will undergo female development. More specifically, it is the SRY gene located on the Y chromosome that is of importance to male differentiation. Variations to the sex gene karyotype could include rare disorders such as XX males (often due to translocation of the SRY gene to the X chromosome) or XY gonadal dysgenesis in people who are externally female (due to mutations in the SRY gene). In addition, other rare genetic variations such as Turners (XO) and Klinefelters (XXY) are seen as well. In most species in XY sex determination, an organism must have at least one X chromosome in order to survive.

The XY system contrasts in several ways with the ZW sex-determination system found in birds, some insects, many reptiles, and various other animals, in which the heterogametic sex is female. It had been thought for several decades that in all snakes sex was determined by the ZW system, but there had been observations of unexpected effects in the genetics of species in the families Boidae and Pythonidae; for example, parthenogenic reproduction produced only females rather than males, which is the opposite of what is to be expected in the ZW system. In the early years of the 21st century such observations prompted research that demonstrated that all pythons and boas so far investigated definitely have the XY system of sex determination.

A temperature-dependent sex determination system is found in some reptiles and fish.

Mechanisms

All animals have a set of DNA coding for genes present on chromosomes. In humans, most mammals, and some other species, two of the chromosomes, called the X chromosome and Y chromosome, code for sex. In these species, one or more genes are present on their Y chromosome that determine maleness. In this process, an X chromosome and a Y chromosome act to determine the sex of offspring, often due to genes located on the Y chromosome that code for maleness. Offspring have two sex chromosomes: an offspring with two X chromosomes will develop female characteristics, and an offspring with an X and a Y chromosome will develop male characteristics.

Humans

Human male XY chromosomes after G-banding

In humans, half of spermatozoa carry X chromosome and the other half Y chromosome. A single gene (SRY) present on the Y chromosome acts as a signal to set the developmental pathway towards maleness. Presence of this gene starts off the process of virilization. This and other factors result in the sex differences in humans. The cells in females, with two X chromosomes, undergo X-inactivation, in which one of the two X chromosomes is inactivated. The inactivated X chromosome remains within a cell as a Barr body.

Humans, as well as some other organisms, can have a rare chromosomal arrangement that is contrary to their phenotypic sex; for example, XX males or XY females (see androgen insensitivity syndrome). Additionally, an abnormal number of sex chromosomes (aneuploidy) may be present, such as Turner's syndrome, in which a single X chromosome is present, and Klinefelter's syndrome, in which two X chromosomes and a Y chromosome are present, XYY syndrome and XXYY syndrome. Other less common chromosomal arrangements include: triple X syndrome, 48, XXXX, and 49, XXXXX.

Other animals

In most mammals, sex is determined by presence of the Y chromosome. "Female" is the default sex, due to the absence of the Y chromosome. In the 1930s, Alfred Jost determined that the presence of testosterone was required for Wolffian duct development in the male rabbit.

SRY is a sex-determining gene on the Y chromosome in the therians (placental mammals and marsupials). Non-human mammals use several genes on the Y chromosome. Not all male-specific genes are located on the Y chromosome. Platypus, a monotreme, use five pairs of different XY chromosomes with six groups of male-linked genes, AMH being the master switch. Other species (including most Drosophila species) use the presence of two X chromosomes to determine femaleness: one X chromosome gives putative maleness, but the presence of Y chromosome genes is required for normal male development.

Other systems

Birds and many insects have a similar system of sex determination (ZW sex-determination system), in which it is the females that are heterogametic (ZW), while males are homogametic (ZZ).

Many insects of the order Hymenoptera instead have a haplo-diploid system, where the males are haploid (having just one chromosome of each type) while the females are diploid (with chromosomes appearing in pairs). Some other insects have the X0 sex-determination system, where just one chromosome type appears in pairs for the female but alone in the males, while all other chromosomes appear in pairs in both sexes.

Influences

Genetic

PBB Protein SRY image

In an interview for the Rediscovering Biology website, researcher Eric Vilain described how the paradigm changed since the discovery of the SRY gene:

For a long time we thought that SRY would activate a cascade of male genes. It turns out that the sex determination pathway is probably more complicated and SRY may in fact inhibit some anti-male genes.
The idea is instead of having a simplistic mechanism by which you have pro-male genes going all the way to make a male, in fact there is a solid balance between pro-male genes and anti-male genes and if there is a little too much of anti-male genes, there may be a female born and if there is a little too much of pro-male genes then there will be a male born.
We [are] entering this new era in molecular biology of sex determination where it's a more subtle dosage of genes, some pro-males, some pro-females, some anti-males, some anti-females that all interplay with each other rather than a simple linear pathway of genes going one after the other, which makes it very fascinating but very complicated to study.

In mammals, including humans, the SRY gene is responsible with triggering the development of non-differentiated gonads into testes, rather than ovaries. However, there are cases in which testes can develop in the absence of an SRY gene (see sex reversal). In these cases, the SOX9 gene, involved in the development of testes, can induce their development without the aid of SRY. In the absence of SRY and SOX9, no testes can develop and the path is clear for the development of ovaries. Even so, the absence of the SRY gene or the silencing of the SOX9 gene are not enough to trigger sexual differentiation of a fetus in the female direction. A recent finding suggests that ovary development and maintenance is an active process, regulated by the expression of a "pro-female" gene, FOXL2. In an interview for the TimesOnline edition, study co-author Robin Lovell-Badge explained the significance of the discovery:

We take it for granted that we maintain the sex we are born with, including whether we have testes or ovaries. But this work shows that the activity of a single gene, FOXL2, is all that prevents adult ovary cells turning into cells found in testes.

Implications

Looking into the genetic determinants of human sex can have wide-ranging consequences. Scientists have been studying different sex determination systems in fruit flies and animal models to attempt an understanding of how the genetics of sexual differentiation can influence biological processes like reproduction, ageing and disease.

Maternal

In humans and many other species of animals, the father determines the sex of the child. In the XY sex-determination system, the female-provided ovum contributes an X chromosome and the male-provided sperm contributes either an X chromosome or a Y chromosome, resulting in female (XX) or male (XY) offspring, respectively.

Hormone levels in the male parent affect the sex ratio of sperm in humans. Maternal influences also impact which sperm are more likely to achieve conception.

Human ova, like those of other mammals, are covered with a thick translucent layer called the zona pellucida, which the sperm must penetrate to fertilize the egg. Once viewed simply as an impediment to fertilization, recent research indicates the zona pellucida may instead function as a sophisticated biological security system that chemically controls the entry of the sperm into the egg and protects the fertilized egg from additional sperm.

Recent research indicates that human ova may produce a chemical which appears to attract sperm and influence their swimming motion. However, not all sperm are positively impacted; some appear to remain uninfluenced and some actually move away from the egg.

Maternal influences may also be possible that affect sex determination in such a way as to produce fraternal twins equally weighted between one male and one female.

The time at which insemination occurs during the estrus cycle has been found to affect the sex ratio of the offspring of humans, cattle, hamsters, and other mammals. Hormonal and pH conditions within the female reproductive tract vary with time, and this affects the sex ratio of the sperm that reach the egg.

Sex-specific mortality of embryos also occurs.

History

Ancient ideas on sex determination

Aristotle believed incorrectly that the sex of an infant is determined by how much heat a man's sperm had during insemination. He wrote:

...the semen of the male differs from the corresponding secretion of the female in that it contains a principle within itself of such a kind as to set up movements also in the embryo and to concoct thoroughly the ultimate nourishment, whereas the secretion of the female contains material alone. If, then, the male element prevails it draws the female element into itself, but if it is prevailed over it changes into the opposite or is destroyed.

Aristotle claimed in error that the male principle was the driver behind sex determination, such that if the male principle was insufficiently expressed during reproduction, the fetus would develop as a female.

20th century genetics

Nettie Stevens and Edmund Beecher Wilson are credited with independently discovering, in 1905, the chromosomal XY sex-determination system, i.e. the fact that males have XY sex chromosomes and females have XX sex chromosomes.

The first clues to the existence of a factor that determines the development of testis in mammals came from experiments carried out by Alfred Jost, who castrated embryonic rabbits in utero and noticed that they all developed as female.

In 1959, C. E. Ford and his team, in the wake of Jost's experiments, discovered that the Y chromosome was needed for a fetus to develop as male when they examined patients with Turner's syndrome, who grew up as phenotypic females, and found them to be X0 (hemizygous for X and no Y). At the same time, Jacob & Strong described a case of a patient with Klinefelter syndrome (XXY), which implicated the presence of a Y chromosome in development of maleness.

All these observations lead to a consensus that a dominant gene that determines testis development (TDF) must exist on the human Y chromosome. The search for this testis-determining factor (TDF) led a team of scientists in 1990 to discover a region of the Y chromosome that is necessary for the male sex determination, which was named SRY (sex-determining region of the Y chromosome).

Search This Blog

Thursday, December 10, 2020

Genetic history of Europe

Prehistory

Palaeolithic

Upper Paleolithic

Last Glacial Maximum

Mesolithic

Neolithic

Bronze Age

Recent history

Genetics of modern European populations

Patrilineal studies

Matrilineal studies

European population sub-structure

Autosomal DNA

Autosomal genetic distances (Fst) based on SNPs (2009)

Y chromosome

Overview

Discovery

Variations

Origins and evolution

Before Y chromosome

Origin

Recombination inhibition

Degeneration

High mutation rate

Inefficient selection

Genetic drift

Gene conversion

Future evolution

1:1 sex ratio

Non-therian Y chromosome

ZW chromosomes

Non-inverted Y chromosome

Multiple XY pairs

Human Y chromosome

Non-combining region of Y (NRY)

Genes

Gene list

Y-chromosome-linked diseases

Y chromosome microdeletion

Defective Y chromosome

XXY

XYY

Rare

More than two Y chromosomes

XX male syndrome

Genetic genealogy

Brain function

Microchimerism

XY sex-determination system

Mechanisms

Humans

Other animals

Other systems

Influences

Genetic

Implications

Maternal

History

Ancient ideas on sex determination

20th century genetics

Chronology of the universe