Haplogroup E-M2, also known as E1b1a1-M2, is a human Y-chromosome DNA haplogroup. E-M2 is primarily distributed within Africa followed by West Asia. More specifically, E-M2 is the predominant subclade in West Africa, Central Africa, Southern Africa, and the region of the African Great Lakes; it also occurs at moderate frequencies in North Africa, and the Middle East. E-M2 has several subclades, but many of these subhaplogroups are included in either E-L485 or E-U175. E-M2 is especially common among indigenous Africans who speak Niger-Congo languages, and was spread to Southern Africa and East Africa through the Bantu expansion.
Origins
The discovery of two SNPs (V38 and V100) by Trombetta et al. (2011) significantly redefined the E-V38 phylogenetic tree. This led the authors to suggest that E-V38 may have originated in East Africa. E-V38 joins the West African-affiliated E-M2 and the Northeast African-affiliated E-M329 with an earlier common ancestor who, like E-P2, may have also originated in East Africa.[5] The downstream SNP E-M180 may have originated in the humidSaharan savanna/grassland of North Africa between 14,000 BP and 10,000 BP.[6][7][8][9] According to Wood et al. (2005) and Rosa et al. (2007), such population movements changed the pre-existing population Y chromosomal diversity in Central, Southern, and Southeastern Africa, replacing the previous haplogroup frequencies in these areas with the now dominant E1b1a1 lineages. Traces of earlier inhabitants, however, can be observed today in these regions via the presence of the Y DNA haplogroups A1a, A1b, A2, A3, and B-M60 that are common in certain populations, such as the Mbuti and Khoisan.[10][11][12] Shriner et al. (2018) similarly suggests that haplogroup E1b1a-V38 traversed across the Green Sahara from east to west around 19,000 years ago, where E1b1a1-M2 may have subsequently originated in West Africa or Central Africa. Shriner et al. (2018) also traces this movement via sickle cell mutation, which likely originated during the Green Sahara period.[4]
At Taukome, in Botswana, an individual, dated to the Early Iron Age (1100 BP), carried haplogroups E1b1a1 (E-M2, E-Z1123) and L0d3b1.[13][14]
Democratic Republic of Congo
At Kindoki, in the Democratic Republic of Congo, there were three individuals, dated to the protohistoric period (230 BP, 150 BP, 230 BP); one carried haplogroups E1b1a1a1d1a2 (E-CTS99, E-CTS99) and L1c3a1b, another carried haplogroup E (E-M96, E-PF1620), and the last carried haplogroups R1b1 (R-P25 1, R-M415) and L0a1b1a1.[13][14]
Egypt
Hawass et al. (2012) determined that the ancient Egyptian mummy of an unknown man buried with Ramesses III was, because of the proven genetic relationship and a mummification process that suggested punishment, a good candidate for the pharaoh's son, Pentaweret, who was the only son to revolt against his father.[15] It was impossible to determine his cause of death.[15] Using Whit Athey's haplogroup predictor based on Y-STR values, both mummies were predicted to share the Y chromosomal haplogroup E1b1a1-M2 and 50% of their genetic material, which pointed to a father-son relationship.[15] Gad et al. (2021) indicates that Ramesses III and Unknown Man E, possibly Pentawere, carried haplogroup E1b1a.[16]
Kenya
At Deloraine Farm, in Nakuru County, Kenya, an iron metallurgist of the Iron Age carried haplogroups E1b1a1a1a1a/E-M58 and L5b1.[17][18]
At Lamu, Pate Island, Faza, in Kenya, an individual, dated between 1500 CE and 1700 CE, carried haplogroups E1b1a1a1a2a1a and L3e3a.[19]
At Taita Taveta, Makwasinyi, in Kenya, an individual, dated between 1650 CE and 1950 CE, carried haplogroups E1b1a1a1a2a1a and L4b2a.[19]
At Taita Taveta, Makwasinyi, in Kenya, an individual, dated between 1650 CE and 1950 CE, carried haplogroups E1b1a1a1a2a1a3b1d1c and L1c3b1a.[19]
At Taita Taveta, Makwasinyi, in Kenya, an individual, dated between 1650 CE and 1950 CE, carried haplogroups E1b1a1a1a2a1a and L2a1+143.[19]
At Taita Taveta, Makwasinyi, in Kenya, an individual, dated between 1667 cal CE and 1843 cal CE, carried haplogroups E1b1a1a1a2a1a3b1d1c and L2a1+143.[19]
At Taita Taveta, Makwasinyi, in Kenya, an individual, dated between 1709 cal CE and 1927 cal CE, carried haplogroups E1b1a1a1a2a1a3a1d~ and L3a2.[19]
Tanzania
At Songo Mnara, in Tanzania, an individual, dated between 1418 cal CE and 1450 cal CE, carried haplogroups E1b1a1~ and L3e2b.[19]
At Lindi, in Tanzania, an individual, dated between 1511 cal CE and 1664 cal CE, carried haplogroups E1b1a1a1a2a1a3a1d~ and L0a1a2.[19]
At a San Jose de los Naturales Royal Hospital burial site, in Mexico City, Mexico, three enslaved West Africans of West African and Southern African ancestry, dated between 1453 CE and 1626 CE, 1450 CE and 1620 CE, and 1436 CE and 1472 CE, were found; one carried haplogroups E1b1a1a1c1b/E-M263.2 and L1b2a, another carried haplogroups E1b1a1a1d1/E-P278.1/E-M425 and L3d1a1a, and the last carried haplogroups E1b1a1a1c1a1c/E-CTS8030 and L3e1a1a.[21]Human leukocyte antigen alleles further confirm that the individuals were of Sub-Saharan African origin.[22]
Portugal
At Cabeço da Amoreira, in Portugal, an enslaved West African man, who may have been from the Senegambian coastal region of Gambia, Mauritania, or Senegal, and carried haplogroups E1b1a and L3b1a, was buried among shell middens between the 16th century CE and the 18th century CE.[23]
Saint Helena
In Saint Helena, 20 freed Africans,[24][25] who were dated to the 19th century CE,[24] were also of western Central African[24][26][27] (e.g., Bantu peoples of Gabon and Angola) ancestry.[24] One female individual carried haplogroup L1b1a10b.[28] One female individual carried haplogroup L2a1f.[28] One female individual carried haplogroup L2a1a3c.[28] One male individual carried haplogroups E1b1a1a1a2a1a3b1d and L1c3a.[28] One male individual carried haplogroups E1b1a1a1a1c1a1a and L0a1b2a.[28] One male individual carried haplogroups E1b1a1a1a2a1a3b1a2a2 and L0a1e.[28] One male individual carried haplogroups E1b1a1a1a2a1a3b1 and L2a1f1.[28] One male individual carried haplogroups E1b1a1 and L3.[28] One male individual carried haplogroups E1b1a1a1a2a1a3b1d and L3e1e.[28] One male individual carried haplogroups E1b1a1a1a2a1a3a1d and L3e3b2.[28] One male individual carried haplogroups E1b1a1a1a1c1a1a3 and L3e1a3a.[28] One male individual carried haplogroups E1b1a1a1a2a1a3b1a2a2 and L2b1a.[28] One male individual carried haplogroups E1b1a1a1a2a1a3b1 and L3f1b1a.[28] One male individual carried haplogroups E1b1a1a1a2a1a3b1d1c1a and L3d3a1.[28] One male individual carried haplogroups B2a1a1a1 and L3e2b1.[28] One male individual carried haplogroups E1b1a1a1a2a1a3b1d1c1a and L2a1f.[28] One male individual carried haplogroups E1b1a1a1a1c1a1a3a1c1 and L3e1d1a.[28] One male individual carried haplogroups E1b1a1a1a2a1a3a1d and L1b1a10.[28] One male individual carried haplogroups E1b1a1a1a1c1a1a3a1c and L2a1f1.[28] One male individual carried haplogroups E1b1a1a1a1c1a1 and L2b1a.[28] An enslaved African American man and woman, from the 18th century CE Anson Street burial site in Charleston, South Carolina, who carried haplogroup L3e1e, shared this haplogroup with freed Africans in Saint Helena.[29] Based on those who were present among enlaved Africans, the ratio of males-to-females supports the conclusion of there being a strong selection bias for males in the latter period of the Trans-Atlantic Slave Trade.[24][30][31] Consequently, due to this study on the freed Africans of Saint Helena, among other studies, greater genetic insights have been made into the Trans-Atlantic Slave Trade and its effects on the demographics of Africa.[32]
At Avery’s Rest, in Chesapeake, Delaware, 3 out of 11 individuals were African Americans, who were dated between 1675 CE and 1725 CE; one was of West African ancestry and carried haplogroups E1b1a-CTS2447 and L3e3b, another was of western Central African Bantu-speaking ancestry and carried E1b1a-Z5974 and L0a1a2, and another was of West African and East African ancestry and carried E1b1a-Z5974 and L3d2.[36]
At Catoctin Furnace African American Cemetery, in Catoctin Furnace, Maryland, there were 27 African Americans found who were dated between 1774 CE and 1850 CE.[37][38] One male individual, who was of 98.14% Sub-Saharan African ancestry, carried haplogroups E1b1a1a1a1c2c and L2a1+143+@16309.[39] One male individual, who was of 83.73% Sub-Saharan African and 7.74% European ancestry, carried haplogroups E1b1a1a1a1c1b1 and L3e2a1b1.[39] One male individual, who was of 84.94% Sub-Saharan African and 9.45% European ancestry, carried haplogroups E1b1a1a1a2a1a and L2a1+143+16189 (16192)+@16309.[39] One male individual, who was of 87.83% Sub-Saharan African and 8.23% European ancestry, carried haplogroups E1b1a1a1a1c1a1a3a1d1 and L3d1b3.[39] One male individual, who was of 98.14% Sub-Saharan African ancestry, carried haplogroups E1b1a1a1a1a and L3e2a1b1.[39] One male individual, who was of 93.87% Sub-Saharan African and 2.58% European ancestry, carried haplogroups E1b1a1a1 and L3e1.[39] One male individual, who was of 98.70% Sub-Saharan African ancestry, carried haplogroups E1b1a1a1a1c1b2a and L2a1a1.[39] One male individual, who was of 97.01% Sub-Saharan African ancestry, carried haplogroups E1b1a1a1a1c1a1 and L3e2a1b1.[39] One male individual, who was of 82.31% Sub-Saharan African and 10.24% European ancestry, carried haplogroups E1b1a1a1a1c1b and L3e2a1b1.[39] One male individual, who was of 91.82% Sub-Saharan African and 5.31% European ancestry, carried haplogroups E1b1a1a1a1c1a1 and L3e2.[39] One male individual, who was of 81.18% Sub-Saharan African and 14.86% European ancestry, carried haplogroups E1b1a1~ and L2c.[39]
At an Anson Street burial site, in Charleston, South Carolina, there were 18 African Americans found who were dated to the 18th century CE.[40] Banza was of western Central African ancestry and carried haplogroups E1b1a-CTS668 and L3e3b1.[40] Lima was of West African ancestry and carried haplogroups E1b1a-M4671 and L3b3.[40] Kuto was of western Central African ancestry and carried haplogroups E1b1a-CTS2198 and L2a1a2.[40] Anika was of Sub-Saharan African ancestry and carried haplogroups E1b1a-CTS6126 and L2b1.[40] Nana was of West African ancestry and carried haplogroup L2b3a.[40] Zimbu was of western Central African ancestry and carried haplogroups E1b1a-CTS5497 and L3e1e.[40] Wuta was of Sub-Saharan African ancestry and carried haplogroups E1b1a-CTS7305 and L3e2b+152.[40] Daba was of West African ancestry and carried haplogroups E1b1a-M4273 and L2c.[40] Fumu was of Sub-Saharan African ancestry and carried haplogroups B2a1a-Y12201 and L3e2b+152.[40] Lisa was of West African ancestry and carried haplogroups E1b1a-Z6020 and H100.[40] Ganda was of West African ancestry and carried haplogroups E1b1a-CTS5612 and L1c1c.[40] Coosaw was of West African and Native American ancestry and carried haplogroups E2b1a-CTS2400 and A2.[40] Kidzera was of western Central African ancestry and carried haplogroup L2a1a2c.[40] Pita was of Sub-Saharan African ancestry and carried haplogroups E1b1a-M4287 and L3e2b.[40] Tima was of western Central African ancestry and carried haplogroup L3e1e.[40] Jode was of Sub-Saharan African ancestry and carried haplogroups E1b1a-CTS4975 and L2a1a2c.[40] Ajana was of western Central African ancestry and carried haplogroup L2a1I.[40] Isi was of western Central African ancestry and carried haplogroup L3e2a.[40]
Medical DNA
Sickle Cell
Amid the Green Sahara, the mutation for sickle cell originated in the Sahara[41] or in the northwest forest region of western Central Africa (e.g., Cameroon)[41][42] by at least 7,300 years ago,[41][42] though possibly as early as 22,000 years ago.[43][42] The ancestral sickle cell haplotype to modern haplotypes (e.g., Cameroon/Central African Republic and Benin/Senegal haplotypes) may have first arose in the ancestors of modern West Africans, bearing haplogroups E1b1a1-L485 and E1b1a1-U175 or their ancestral haplogroup E1b1a1-M4732.[41] West Africans (e.g., Yoruba and Esan of Nigeria), bearing the Benin sickle cell haplotype, may have migrated through the Northeast Africa into the western Arabia.[41] West Africans (e.g., Mende of Sierra Leone), bearing the Senegal sickle cell haplotype,[44][41] may have migrated into Mauritania (77% modern rate of occurrence) and Senegal (100%); they may also have migrated across the Sahara, into North Africa, and from North Africa, into Southern Europe, Turkey, and a region near northern Iraq and southern Turkey.[44] Some may have migrated into and introduced the Senegal and Benin sickle cell haplotypes into Basra, Iraq, where both occur equally.[44] West Africans bearing the Benin sickle cell haplotype, may have migrated into the northern region of Iraq (69.5%), Jordan (80%), Lebanon (73%), Oman (52.1%), and Egypt (80.8%).[44]
Distribution
E-M2's frequency and diversity are highest in West Africa. Within Africa, E-M2 displays a west-to-east as well as a south-to-north clinal distribution. In other words, the frequency of the haplogroup decreases as one moves from western and southern Africa toward the eastern and northern parts of Africa.[45]
E-M2 is found at low to moderate frequencies in North Africa, and Northeast Africa. Some of the lineages found in these areas are possibly due to the Bantu expansion or other migrations.[45][53] However, the discovery in 2011 of the E-M2 marker that predates E-M2 has led Trombetta et al. to suggest that E-M2 may have originated in East Africa.[5] In Eritrea and most of Ethiopia (excluding the Anuak), E-V38 is usually found in the form of E-M329, which is autochthonous, while E-M2 generally indicates Bantu migratory origins.[54][55][56]
Outside of Africa, E-M2 has been found at low frequencies. The clade has been found at low frequencies in West Asia. A few isolated occurrences of E-M2 have also been observed among populations in Southern Europe, such as Croatia, Malta, Spain and Portugal.[64][65][66][67]
The Trans-Atlantic slave trade brought people to North America, Central America and South America including the Caribbean. Consequently, the haplogroup is often observed in the United States populations in men who self-identify as African Americans.[75] It has also been observed in a number of populations in Mexico, the Caribbean, Central America, and South America among people of African descent.
E1b1a1 is defined by markers DYS271/M2/SY81, M291, P1/PN1, P189, P293, V43, and V95. Whilst E1b1a reaches its highest frequency of 81% in Senegal, only 1 of the 139 Senegalese that were tested showed M191/P86.[48] In other words, as one moves to West Africa from western Central Africa, the less subclade E1b1a1f is found. Cruciani et al. (2002) states: "A possible explanation might be that haplotype 24 chromosomes [E-M2*] were already present across the Sudanese belt when the M191 mutation, which defines haplotype 22, arose in central western Africa. Only then would a later demic expansion have brought haplotype 22 chromosomes from central western to western Africa, giving rise to the opposite clinal distributions of haplotypes 22 and 24."[46]
E1b1a1a1
E1b1a1a1 is commonly defined by M180/P88. The basal subclade is quite regularly observed in M2+ samples.
E1b1a1a1a
E1b1a1a1a is defined by marker M58. 5% (2/37) of the town Singa-Rimaïbé, Burkina Faso tested positive for E-M58.[46] 15% (10/69) of Hutus in Rwanda tested positive for M58.[45] Three South Africans tested positive for this marker.[12] One Carioca from Rio de Janeiro, Brazil tested positive for the M58 SNP.[83] The place of origin and age is unreported.
E1b1a1a1b
E1b1a1a1b is defined by M116.2, a private marker. A single carrier was found in Mali.[12][d]
E1b1a1a1c
E1b1a1a1c is defined by private marker M149. This marker was found in a single South African.[12]
E1b1a1a1d
E1b1a1a1d is defined by a private marker M155. It is known from a single carrier in Mali.[12]
E1b1a1a1e
E1b1a1a1e is defined by markers M10, M66, M156 and M195. Wairak people in Tanzania tested 4.6% (2/43) positive for E-M10.[45] E-M10 was found in a single person of the Lissongo group in the Central African Republic and two members in a "Mixed" population from the Adamawa region.[12]
E1b1a1a1f
E1b1a1a1f is defined by L485. The basal node E-L485* appears to be somewhat uncommon but has not been sufficiently tested in large populations. The ancestral L485 SNP (along with several of its subclades) was very recently discovered. Some of these SNPs have little or no published population data and/or have yet to receive nomenclature recognition by the YCC.
E1b1a1a1f1 is defined by marker L514. This SNP is currently without population study data outside of the 1000 Genomes Project.
E1b1a1a1f1a (YCC E1b1a7) is defined by marker M191/P86. Filippo et al. (2011) studied a number of African populations that were E-M2 positive and found the basal E-M191/P86 (without E-P252/U174) in a population of Gur speakers in Burkina Faso.[84] Montano et al. (2011) found similar sparse distribution of E-M191* in Nigeria, Gabon, Cameroon and Congo.[9] M191/P86 positive samples occurred in tested populations of Annang (38.3%), Ibibio (45.6%), Efik (45%), and Igbo (54.3%) living in Nigeria, West Africa.[85] E-M191/P86 appears in varying frequencies in Central and Southern Africa but almost all are also positive for P252/U174. Bantu-speaking South Africans (89/343) tested 25.9% positive and Khoe-San speaking South Africans tested 7.7% (14/183) positive for this SNP.[86] It also appears commonly in Africans living in the Americas. A population in Rio de Janeiro, Brazil tested 9.2% (12/130) positive.[83] 34.9% (29/83) of African American men tested positive for M191.[75]
Veeramah et al. (2010) studies of the recombining portions of M191 positive Y chromosomes suggest that this lineage has "diffusely spread with multiple high frequency haplotypes implying a longer evolutionary period since this haplogroup arose".[85] The subclade E1b1a1a1f1a appears to express opposite clinal distributions to E1b1a1* in the West African Savanna region. Haplogroup E1b1a1a1f1a (E-M191) has a frequency of 23% in Cameroon (where it represents 42% of haplotypes carrying the DYS271 mutation or E-M2), 13% in Burkina Faso (16% of haplotypes carrying the M2/DYS271 mutation) and only 1% in Senegal.[48] Similarly, while E1b1a reaches its highest frequency of 81% in Senegal, only 1 of the 139 Senegalese that were tested showed M191/P86.[48] In other words, as one moves to West Africa from western Central Africa, the less subclade E1b1a1f is found. "A possible explanation might be that haplotype 24 chromosomes [E-M2*] were already present across the Sudanese belt when the M191 mutation, which defines haplotype 22, arose in central western Africa. Only then would a later demic expansion have brought haplotype 22 chromosomes from central western to western Africa, giving rise to the opposite clinal distributions of haplotypes 22 and 24."[46]
E1b1a1a1f1a1 (YCC E1b1a7a) is defined by P252/U174. It appears to be the most common subclade of E-L485. It is believed to have originated near western Central Africa. It is rarely found in the most western portions of West Africa. Montano et al. (2011) found this subclade very prevalent in Nigeria and Gabon.[9] Filippo et al. (2011) estimated a tMRCA of ~4.2 kya from sample of Yoruba population positive for the SNP.[84]
E1b1a1a1f1a1b (YCC E1b1a7a2) is defined by P115. This subclade has only been observed amongst Fang people of Central Africa.[9]
E1b1a1a1f1a1c (YCC E1b1a7a3) is defined by P116. Montano et al. (2011) observed this SNP only in Gabon and a Bassa population from Cameroon.[9]
E1b1a1a1f1a1d is defined by Z1704. This subclade has been observed across Africa. The 1000 Genomes Project Consortium found this SNP in Yoruba Nigerian, three Kenyan Luhyas and one African descent Puerto Rican.[87]
E1b1a1a1f1b is defined by markers L515, L516, L517, and M263.2. This subclade was found by the researchers of Y-Chromosome Genome Comparison Project using data from the commercial bioinformatics company 23andMe.[88]
E1b1a1a1g
E1b1a1a1g (YCC E1b1a8) is defined by marker U175. The basal E-U175* is extremely rare. Montano et al. (2011) only found one out of 505 tested African subjects who was U175 positive but negative for U209.[9] Brucato et al. found similarly low frequencies of basal E-U175* in subjects in the Ivory Coast and Benin. Veeramah et al. (2010) found U175 in tested Annang (45.3%), Ibibio (37%), Efik (33.3%), and Igbo (25.3%) but did not test for U209.[85]
The supposed "Bantu haplotype" found in E-U175 carriers is "present at appreciable frequencies in other Niger–Congo languages speaking peoples as far west as Guinea-Bissau".[85] This is the modal haplotype of STR markers that is common in carriers of E-U175.[e]
E-U175 haplotype
DYS19
DYS388
DYS390
DYS391
DYS392
DYS393
15
12
21
10
11
13
E1b1a1a1g has several subclades.
E1b1a1a1g1 (YCC E1b1a8a) is defined by U209. It is the most prominent subclade of U175. This subclade has very high frequencies of over fifty percentages in Cameroonian populations of Bassa and Bakaka, possibly indicating place of origin. However, E-U209 is widely found at lower frequencies in West and Central African countries surrounding Cameroon and Gabon.[9] Brucato et al. (2010) found the SNP in a populations of Ahizi (in Ivory Coast) 38.8% (19/49), Yacouba (Ivory Coast) 27.5% (11/40), and Beninese 6.5% (5/77) respectively.[89]
E1b1a1a1g1a (YCC E1b1a8a1) is defined by U290. The Montano et al. (2011) study of U290 showed a lower frequency in Nigeria (11.7%) and western Central Africa than basal node U209. The highest population frequency rate in that study was 57.7% (15/26) in Ewondo in Cameroon.[9] 32.5% (27/83) of African American men tested by Sims et al. (2007) were positive for this SNP.[75]
E1b1a1a1g1a2 is defined by Z1725. This marker has been observed by The 1000 Genomes Project Consortium in Yoruba Nigerians and Luhya Kenyans.[87]
E1b1a1a1g1c (YCC E1b1a4) is defined by M154. A Bamilike population tested 31.3% (15/48) for the marker. Bakaka speakers from Cameroon tested 8%.[46] An Ovimbundu test population found this SNP at 14% (14/100).[90] Members of this subclade have also been found in South Africa.[91][86]
E1b1a1a1g1d is defined by V39. Trombetta et al. first published this SNP in 2011 but gave little population data about it.[5] It is only known to have been found in an African population.
E1b1a1a1h
E1b1a1a1h is defined by markers P268 and P269. It was first reported in a person from the Gambia.[92]
Prior to 2002, there were in academic literature at least seven naming systems for the Y-Chromosome Phylogenetic tree. This led to considerable confusion. In 2002, the major research groups came together and formed the Y-Chromosome Consortium (YCC). They published a joint paper that created a single new tree that all agreed to use. Later, a group of citizen scientists with an interest in population genetics and genetic genealogy formed a working group to create an amateur tree aiming at being above all timely. The table below brings together all of these works at the point of the landmark 2002 YCC Tree. This allows a researcher reviewing older published literature to quickly move between nomenclatures.
This phylogenetic tree of haplogroup subclades is based on the Y-Chromosome Consortium (YCC) 2008 Tree,[92] the ISOGG Y-DNA Haplogroup E Tree,[7] and subsequent published research.
^Van Oven M, Van Geystelen A, Kayser M, Decorte R, Larmuseau HD (2014). "Seeing the wood for the trees: a minimal reference phylogeny for the human Y chromosome". Human Mutation. 35 (2): 187–91. doi:10.1002/humu.22468. PMID24166809. S2CID23291764.
^K-M2313*, which as yet has no phylogenetic name, has been documented in two living individuals, who have ethnic ties to India and South East Asia. In addition, K-Y28299, which appears to be a primary branch of K-M2313, has been found in three living individuals from India. See: Poznik op. cit.; YFull YTree v5.08, 2017, "K-M2335", and; PhyloTree, 2017, "Details of the Y-SNP markers included in the minimal Y tree" (Access date of these pages: 9 December 2017)
^ Haplogroup S, as of 2017, is also known as K2b1a. (Previously the name Haplogroup S was assigned to K2b1a4.)
^ Haplogroup M, as of 2017, is also known as K2b1b. (Previously the name Haplogroup M was assigned to K2b1d.)
^E-M2 is approximately 7.7–7.9% of total US male population.
^The publication transposes M116.2 with M116.1 in Table 1.
^The YCAII STR marker value of 19–19 is also usually indicative of U175.
^DYS271/M2/SY81, P1/PN1, P189, P293, and M291 appear to form E1b1a1*. L576 forms a subclade immediately after the previously mentioned SNPs. L576 gave rise to a deeper subclade of M180/P88, P182, L88.3, L86, and PAGES0006. From this subclade, all the major subclades (i.e. E-U175 and E-L485) of E1b1a evolved. The exact position of V43 and V95 within these three subclades and E1b1a1a1b (M116.2), E1b1a1a1c (M149), and E1b1a1a1d (M155)
remains uncertain.
^ abcdefghMontano V, Ferri G, Marcari V, Batini C, Anyaele O, Destro-Bisol G, Comas D (July 2011). "The Bantu expansion revisited: a new analysis of Y chromosome variation in Central Western Africa". Molecular Ecology. 20 (13): 2693–708. doi:10.1111/j.1365-294X.2011.05130.x. PMID21627702. S2CID9951365.
^Gonçalves R, Rosa A, Freitas A, Fernandes A, Kivisild T, Villems R, Brehm A (November 2003). "Y-chromosome lineages in Cabo Verde Islands witness the diverse geographic origin of its first male settlers". Human Genetics. 113 (6): 467–72. doi:10.1007/s00439-003-1007-4. hdl:10400.13/3047. PMID12942365. S2CID63381583.
^Iacovacci G, D'Atanasio E, Marini O, Coppa A, Sellitto D, Trombetta B, Berti A, Cruciani F (March 2017). "Forensic data and microvariant sequence characterization of 27 Y-STR loci analyzed in four Eastern African countries". Forensic Science International. Genetics. 27: 123–131. doi:10.1016/j.fsigen.2016.12.015. PMID28068531.
^Ottoni C, Larmuseau MH, Vanderheyden N, Martínez-Labarga C, Primativo G, Biondi G, Decorte R, Rickards O (May 2011). "Deep into the roots of the Libyan Tuareg: a genetic survey of their paternal heritage". American Journal of Physical Anthropology. 145 (1): 118–24. doi:10.1002/ajpa.21473. PMID21312181.
^Robino C, Crobu F, Di Gaetano C, Bekada A, Benhamamouch S, Cerutti N, Piazza A, Inturri S, Torre C (May 2008). "Analysis of Y-chromosomal SNP haplogroups and STR haplotypes in an Algerian population sample". International Journal of Legal Medicine. 122 (3): 251–5. doi:10.1007/s00414-007-0203-5. PMID17909833. S2CID11556974.
^Hassan HY, Underhill PA, Cavalli-Sforza LL, Ibrahim ME (November 2008). "Y-chromosome variation among Sudanese: restricted gene flow, concordance with language, geography, and history". American Journal of Physical Anthropology. 137 (3): 316–23. doi:10.1002/ajpa.20876. PMID18618658.
^Mršić G, Gršković B, Vrdoljak A, Popović M, Valpotić I, Anđelinović Š, Stenzl V, Ehler E, Urban L, Lacković G, Underhill P, Primorac D (July 2012). "Croatian national reference Y-STR haplotype database". Molecular Biology Reports. 39 (7). Branka Grskovic, Andro Vrdoljak, Maja Popovic, Ivica Valpotic, Simun Andelinovic, Vlastimil Stenzl, Edvard Ehler, Ludvik Urban, Gordana Lackovic, Peter Underhill, Dragan Primorac: 7727–41. doi:10.1007/s11033-012-1610-3. PMID22391654. S2CID18011987.
^Regueiro M, Cadenas AM, Gayden T, Underhill PA, Herrera RJ (2006). "Iran: tricontinental nexus for Y-chromosome driven migration". Human Heredity. 61 (3): 132–43. doi:10.1159/000093774. PMID16770078. S2CID7017701.
^Al-Zahery N, Semino O, Benuzzi G, Magri C, Passarino G, Torroni A, Santachiara-Benerecetti AS (September 2003). "Y-chromosome and mtDNA polymorphisms in Iraq, a crossroad of the early human dispersal and of post-Neolithic migrations". Molecular Phylogenetics and Evolution. 28 (3): 458–72. doi:10.1016/S1055-7903(03)00039-3. PMID12927131.
^Nuñez C, Baeta M, Sosa C, Casalod Y, Ge J, Budowle B, Martínez-Jarreta B (December 2010). "Reconstructing the population history of Nicaragua by means of mtDNA, Y-chromosome STRs, and autosomal STR markers". American Journal of Physical Anthropology. 143 (4): 591–600. doi:10.1002/ajpa.21355. PMID20721944.
^Rojas W, Parra MV, Campo O, Caro MA, Lopera JG, Arias W, Duque C, Naranjo A, García J, Vergara C, Lopera J, Hernandez E, Valencia A, Caicedo Y, Cuartas M, Gutiérrez J, López S, Ruiz-Linares A, Bedoya G (September 2010). "Genetic make up and structure of Colombian populations by means of uniparental and biparental DNA markers". American Journal of Physical Anthropology. 143 (1): 13–20. doi:10.1002/ajpa.21270. PMID20734436.
^de Azevedo DA, da Silva LA, Gusmão L, de Carvalho EF (December 2009). "Analysis of Y chromosome SNPs in Alagoas, Northeastern Brazil". Forensic Science International: Genetics Supplement Series. 2 (1): 421–422. doi:10.1016/j.fsigss.2009.08.166.
^Nascimento E, Cerqueira E, Azevedo E, Freitas V, Azevedo D (December 2009). "The Africa male lineages of Bahia's people—Northeast Brazil: A preliminary SNPs study". Forensic Science International: Genetics Supplement Series. 2 (1): 349–350. doi:10.1016/j.fsigss.2009.07.010.
^Tanya M Simms 2011, The Peopling of the Bahamas: A Phylogeographical
Perspective pg. 194
^ abHünemeier T, Carvalho C, Marrero AR, Salzano FM, Pena SD, Bortolini MC (June 2007). "Niger-Congo speaking populations and the formation of the Brazilian gene pool: mtDNA and Y-chromosome data". American Journal of Physical Anthropology. 133 (2): 854–67. doi:10.1002/ajpa.20604. PMID17427922.
^Brito P, Carvalho M, Gomes V, Melo MM, Bogas V, Balsa F, et al. (December 2011). "Y-SNP analysis in an Angola population". Forensic Science International: Genetics Supplement Series. 3 (1): e369–e370. doi:10.1016/j.fsigss.2011.09.046.
^Underhill PA, Shen P, Lin AA, Jin L, Passarino G, Yang WH, Kauffman E, Bonné-Tamir B, Bertranpetit J, Francalacci P, Ibrahim M, Jenkins T, Kidd JR, Mehdi SQ, Seielstad MT, Wells RS, Piazza A, Davis RW, Feldman MW, Cavalli-Sforza LL, Oefner PJ (November 2000). "Y chromosome sequence variation and the history of human populations". Nature Genetics. 26 (3): 358–61. doi:10.1038/81685. PMID11062480. S2CID12893406.
Underhill, Peter A.; Shen, Peidong; Lin, Alice A.; Jin, Li; et al. (November 2000). "Y chromosome sequence variation and the history of human populations". Nature Genetics. 26 (3): 358–361. doi:10.1038/81685. PMID11062480. S2CID12893406.