Girish Mahajan (Editor)

Haplogroup M (mtDNA)

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Ancestor
  
L3

Haplogroup M (mtDNA)

Possible time of origin
  
60,000 years before present

Possible place of origin
  
Descendants
  
M1, M2, M3, M4'45, M5, M6, M7, M8, M9, M10'42, M12'G, M13, M14, M15, M21, M27, M28, M29'Q, M31'32, M33, M34, M35, M36, M39, M40, M41, M44, M46, M47'50, M48, M49, M51, D

Defining mutations
  
263, 489, 10400, 14783, 15043

In human mitochondrial genetics, Haplogroup M is a human mitochondrial DNA (mtDNA) haplogroup. An enormous haplogroup spanning all the continents, the macro-haplogroup M, like its sibling N, is a descendant of haplogroup L3.

Contents

All mtDNA haplogroups considered native outside of Africa are descendants of either haplogroup M or its sibling haplogroup N. The geographical distributions of M and N are associated with discussions concerning out of Africa migrations and the subsequent colonization of the rest of the world. In particular, it is often taken to indicate that it is very likely that there was one particularly major prehistoric migration of humans out of Africa, and that both M and N were part of this colonization process.

Origins

There is a debate concerning geographical origins of Haplogroup M and its sibling haplogroup N. Both lineages are thought to have been the main surviving lineages involved in the out of Africa migration (or migrations) because all indigenous lineages found outside Africa belong to haplogroup M or haplogroup N. Yet to be conclusively determined is whether the mutations that define haplogroups M and N occurred in Africa before the exit from Africa or in Asia after the exit from Africa. Determining the origins of haplogroup M is further complicated by the fact that it is found in Africa and outside of Africa.

It is generally accepted that haplogroup M evolved shortly after the emergence of its parent clade haplogroup L3. Apart from haplogroup M and its sibling haplogroup N, the numerous other subclades of L3 are largely restricted to Africa, which suggests that L3 arose in Africa.

Haplogroup M1

Much of discussion concerning the origins of haplogroup M has been related to its subclade haplogroup M1, which is the only variant of macrohaplogroup M found in Africa. Two possibilities were being considered as potential explanations for the presence of M1 in Africa:

  1. M was present in the ancient population which later gave rise to both M1 in Africa, and M more generally found in Eurasia.
  2. The presence of M1 in Africa is the result of a back-migration from Asia which occurred sometime after the Out of Africa migration 40,000 years ago.

Haplogroup M23

In 2009, two independent publications reported a rare, deep-rooted subclade of haplogroup M, referred to as M23, that is present in Madagascar. The contemporary populations of Madagascar were formed in the last 2,000 years by the admixture of African and Southeast Asian (Austronesian) populations. M23 seems to be restricted to Madagascar as it was not detected anywhere else. M23 could have been brought to Madagascar from Asia where most deep rooted subclades of Haplogroup M are found.

Asian origin hypothesis

According to this theory, anatomically modern humans carrying ancestral haplogroup L3 lineages were involved in the Out of Africa migration from East Africa into Asia. Somewhere in Asia, the ancestral L3 lineages gave rise to haplogroups M and N. The ancestral L3 lineages were then lost by genetic drift as they are infrequent outside Africa. The hypothesis of Asia as the place of origin of macrohaplogroup M is supported by the following:

  1. The highest frequencies worldwide of macrohaplogroup M are observed in Asia, specifically in Bangladesh, China, India, Japan, Nepal, and Tibet, where frequencies range from 60%-80%. The total frequency of M subclades is even higher in some populations of Siberia or the Americas, but these small populations tend to exhibit strong genetic drift effects, and often their geographical neighbors exhibit very different frequencies.
  2. Deep time depth >50,000 years of western, central, southern and eastern Indian haplogroups M2, M38, M54, M58, M33, M6, M61, M62 and the distribution of macrohaplogroup M, do not rule out the possibility of macrohaplogroup M arising in Indian population.
  3. With the exception of the African specific M1, India has several M lineages that emerged directly from the root of haplogroup M.
  4. Only two subclades of haplogroup M, M1 and M23, are found in Africa, whereas numerous subclades are found outside Africa (with some discussion possible only about sub-clade M1, concerning which see below).
  5. Specifically concerning M1

African origin hypothesis

According to this theory, haplogroups M and N arose from L3 in an East African population that had been isolated from other African populations. Members of this population were involved in the out Africa migration and only carried M and N lineages. With the possible exception of haplogroup M1, all other M and N clades in Africa were lost by genetic drift.

The African origin of Haplogroup M is supported by the following arguments and evidence.

  1. L3, the parent clade of haplogroup M, is found throughout Africa, but is rare outside Africa. According to Toomas Kivisild (2003), "the lack of L3 lineages other than M and N in India and among non-African mitochondria in general suggests that the earliest migration(s) of modern humans already carried these two mtDNA ancestors, via a departure route over the Horn of Africa."
  2. Ancestral L3 lineages that gave rise to M and N have not been discovered outside Africa.
  3. Specifically concerning at least M1:

Dispersal

A number of studies have proposed that the ancestors of modern haplogroup M dispersed from Africa through the southern route across the Horn of Africa along the coastal regions of Asia onwards to New Guinea and Australia. These studies suggested that the migrations of haplogroups M and N occurred separately with haplogroup N heading northwards from East Africa to the Levant. However, the results of numerous recent studies indicate that there was only one migration out of Africa and that haplogroups M and N were part of the same migration. This is based on the analysis of a number of relict populations along the proposed beachcombing route from Africa to Australia, all of which possessed both haplogroups N and M.

A 2008 study by Abu-Amero et al., suggests that the Arabian Peninsula may have been the main route out of Africa. However, as the region lacks of autochthonous clades of haplogroups M and N the authors suggest that the area has been a more recent receptor of human migrations than an ancient demographic expansion center along the southern coastal route as proposed under the single migration Out-of-Africa scenario of the African origin hypothesis.

Distribution

M is the single most common mtDNA haplogroup in Asia, and peaks in Japan and Tibet, where it represents on average about 70% of the maternal lineages (160/216 = 74% Tibet, 205/282 = 73% Tōkai, 231/326 = 71% Okinawa, 148/211 = 70% Japanese, 50/72 = 69% Tibet, 150/217 = 69% Hokkaidō, 24/35 = 69% Zhongdian Tibetan, 175/256 = 68% northern Kyūshū, 38/56 = 68% Qinghai Tibetan, 16/24 = 67% Diqing Tibetan, 66/100 = 66% Miyazaki, 33/51 = 65% Ainu, 214/336 = 64% Tōhoku, 75/118 = 64% Tokyo (JPT)) and is ubiquitous in India and South Korea, where it has approximately 60% frequency. Among Chinese people both inside and outside of China, haplogroup M accounts for approximately 50% of all mtDNA on average, but the frequency varies from approximately 40% in Hans from Hunan and Fujian in southern China to approximately 60% in Shenyang, Liaoning in northeastern China.

Haplogroup M accounts for approximately 42% of all mtDNA in Filipinos, among whom it is represented mainly by M7c3c and E. In Vietnam, haplogroup M has been found in 37% (52/139) to 48% (20/42) of samples of Vietnamese and in 32% (54/168) of a sample of Chams from Bình Thuận Province. Haplogroup M accounts for 43% (92/214) of all mtDNA in a sample of Laotians, with its subclade M7 (M7b, M7c, and M7e) alone accounting for a full third of all haplogroup M, or 14.5% (31/214) of the total sample.

In Oceania, Haplogroup M has been found in 35% (17/48) of a sample of Papua New Guinea highlanders from the Bundi area and in 28% (9/32) of a sample of Aboriginal Australians from Kalumburu in northwestern Australia. A study published in 2008 found Haplogroup M in 42% (60/144) of a pool of samples from nine language groups in the Admiralty Islands of Papua New Guinea, of which 50 belonged to typically Near Oceanian subclades (Q1, Q2) and 10 belonged to typically East or Southeast Asian subclades (M7b, M7c1c, E1b). In a study published in 2015, Haplogroup M was found in 21% (18/86) of a sample of Fijians and in 0% (0/21) of a sample of Rotumans.

Haplogroup M is also relatively common in Northeast Africa, occurring especially among Somalis, Libyans and Oromos at frequencies over 20%. Toward the northwest, the lineage is found at comparable frequencies among the Tuareg in Mali and Burkina Faso, particularly the M1a2 subclade (18.42%).

Due to its great age, haplogroup M is an mtDNA lineage which does not correspond well to present-day ethnic groups. It is found among Siberian, Native American, East Asian, Southeast Asian, Central Asian, South Asian, Melanesian, European, Northeast African, and various Middle Eastern populations at varying frequencies.

Among the descendant lineages of haplogroup M are C, D, E, G, Q, and Z. Z and G are found in North Eurasian populations, C and D exists among North Eurasian and Native American populations, E is observed in Southeast Asian populations, and Q is common among Melanesian populations. The lineages M2, M3, M4, M5, M6, M18 and M25 are exclusive to South Asia, with M2 reported to be the oldest lineage on the Indian sub-continent.

In 2013, four ancient specimens dated to around 2,500 BC-500 AD, which were excavated from the Tell Ashara (Terqa) and Tell Masaikh (Kar-Assurnasirpal) archaeological sites in the Euphrates valley, were found to belong to mtDNA haplotypes associated with the M4b1, M49 and/or M61 haplogroups. Since these clades are not found among the current inhabitants of the area, they are believed to have been brought at a more remote period from east of Mesopotamia; possibly by either merchants or the founders of the ancient Terqa population.

In 2016, three Late Pleistocene European hunter-gatherers were also found to carry M lineages. Two of the specimens were from the Goyet archaeological site in Belgium and were dated to 34,000 and 35,000 years ago, respectively. The other ancient individual hailed from the La Rochette site in France and was dated to 28,000 years ago.

Subgroups distribution

  • Haplogroup M1'20'51 (14110)
  • Haplogroup M1 [1] - found in the Nile Valley, Horn of Africa, Maghreb, Sahara, Mediterranean, and Middle East
  • M20 - in China.
  • M51 - in Cambodia.
  • Haplogroup M2 [2] - found in South Asia, with highest concentrations in SE India and Bangladesh; oldest haplogroup M lineage on the Indian sub-continent. Also found with low frequency in southwestern China.
  • M2a - most common in Bangladesh
  • M2b - most common in SE India
  • Haplogroup M3 [3] - found mainly in South Asia, with highest concentrations in west and NW India
  • M4'30
  • Haplogroup M4 [4] - found mainly in South Asia but some sequences in Eastern Saudi Arabia
  • Haplogroup M4a - found in Gujarat, India
  • Haplogroup M4b - found among ancient specimens in the Euphrates valley
  • Haplogroup M30 - mainly in India, found in Middle East and North Africa.
  • Haplogroup M18'38
  • Haplogroup M18 - found among Tharus in southern Nepal and tribal people in Andhra Pradesh
  • Haplogroup M38 - found with high frequency among Tharus from Morang District of southeastern Nepal and as singletons among Tharus from Chitwan District of south-central Nepal and Hindus from New Delhi
  • Haplogroup M37
  • Haplogroup M37a - found in Gujarat, India
  • Haplogroup M5 [5] - found in South Asia
  • Haplogroup M5a - found in Orissa, India
  • Haplogroup M6 [6] - found mainly in South Asia, with highest concentrations in mid-eastern India and Kashmir
  • Haplogroup M6b - found in Kerala, India
  • Haplogroup M61 - found among ancient specimens in the Euphrates valley
  • Haplogroup M7 [7] - found in East Asia, especially in Japan, southern China, Vietnam, and Laos
  • Haplogroup M8
  • Haplogroup M8a: [8] - found in East Asia and Central Asia with low frequency
  • Haplogroup M8a2 - frequently found in indigenous peoples of Kamchatka (Koryaks, Itelmens), and with lower frequency among Chukchis, Koreans, Altayans, Mongolians, Khakassians, and Tuvans
  • Haplogroup CZ
  • Haplogroup C [9] - found especially in Siberia
  • Haplogroup C1 [10] - found in Asia and America (Native Americans and Hispanics in particular)
  • Haplogroup C4
  • Haplogroup C7 [11] - found in China and Ukraine. [12]
  • Haplogroup Z [13] - found in Northeast Europe, Central Asia and East Asia, including among Finns, Hazara, Japanese, Koreans, Chinese, Russians, and Sami
  • Haplogroup M9 [14] - found in East Asia and Central Asia, especially in Tibet
  • Haplogroup E - a subclade of M9 - found especially in Taiwan (aborigines), Maritime Southeast Asia, and the Mariana Islands
  • Haplogroup M10 [15] - small clade found in East Asia, Southeast Asia, Bangladesh, Central Asia, southern Siberia, and Belarus
  • Haplogroup M11 [16] - small clade found especially among the Chinese and also in some Koreans, Tibetans, and Bangladeshis
  • Haplogroup M12'G
  • Haplogroup M12 [17] - small clade found especially among the aborigines of Hainan Island as well as in other populations of China, Japan, Korea, Pashtuns, Tibet, and Vietnam
  • Haplogroup G [18] - found especially in Japan, Mongolia, and Tibet and in indigenous peoples of Kamchatka (Koryaks, Alyutors, Itelmens), with some isolated instances in diverse places of Asia
  • Haplogroup M13 - small clade found among Tibetans in Tibet, Oirat Mongols in Xinjiang, Barghuts in Hulunbuir, and Yakuts and Dolgans in central Siberia
  • Haplogroup M14 - found in Tibet
  • Haplogroup M15 - found in Tibet
  • Haplogroup M17 - found in Luzon and in the Chams of southern Vietnam
  • Haplogroup M19 - found in the Batak people of Palawan
  • Haplogroup M21 [19] - small clade found in SE Asia and Bangladesh
  • Haplogroup M23'75
  • M23 - found in Madagascar
  • M75 - found in China
  • Haplogroup M24 - found in Palawan
  • Haplogroup M27 [20] - found in Melanesia
  • Haplogroup M28 [21] - found in Melanesia and in a single Han individual from Taiwan
  • Haplogroup M29'Q
  • Haplogroup M29 [22] - found in Melanesia
  • Haplogroup Q [23] - found in Melanesia and Australia (Aborigines)
  • Haplogroup M31 [24] - found among the Onge, in the Andaman Islands
  • Haplogroup M32 [25] - found in Andaman Islands
  • Haplogroup M33 [26] - small clade found in South Asia, Belarus, southern China, and in two Han Chinese living in Southern California
  • Haplogroup M33a - found in Gujarat, India
  • Haplogroup M34 [27] - small clade found in South Asia
  • Haplogroup M34a - found in Karnataka, India
  • Haplogroup M35 [28] - small clade found in South Asia
  • Haplogroup M35a - found in India
  • Haplogroup M35b - found in Karnataka, India and Nepal. Found in Slovakia.
  • Haplogroup M39 [29] - found in South Asia
  • Haplogroup M40 [30] - found in South Asia
  • Haplogroup M41 - found in South Asia
  • Haplogroup M41b - found in Andhra Pradesh, India
  • Haplogroup M41c - found in Andhra Pradesh, India
  • Haplogroup M42 [31] - found among Australian Aborigines
  • Haplogroup M48 [32] - rare clade found in Saudi Arabia
  • Haplogroup M49 - found among ancient specimens in the Euphrates valley
  • Haplogroup M80 - found in Palawan
  • Haplogroup D - found in Andaman Jawara tribe, Eastern Eurasia, Native Americans, Central Asia and occasionally also in West Asia and Europe.
  • Tree

    This phylogenetic tree of haplogroup M subclades is based on the paper by Mannis van Oven and Manfred Kayser Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation and subsequent published research.

  • M
  • M1
  • M1a
  • M1a1
  • M1a1a
  • M1a1b
  • M1a1b1
  • M1a1c
  • M1a1d
  • M1a1e
  • M1a1f
  • M1a2
  • M1a2a
  • M1a2b
  • M1a3
  • M1a3a
  • M1a3b
  • M1a4
  • M1a5
  • M1b
  • M1b1
  • M1b1a
  • M1b2
  • M1b2a
  • M2
  • M2a
  • M2a1
  • M2a2
  • M2a3
  • M2b
  • M2b1
  • M2b2
  • M3
  • M3a
  • M4"45
  • M4
  • M4a
  • M4b
  • M4b1
  • M18'38
  • M18
  • M38
  • M30
  • M30a
  • M30b
  • M30c
  • M30c1
  • M30c1a
  • M30c1a1
  • M30d
  • M37
  • M37a
  • M43
  • M45
  • M5
  • M5a
  • M5a1
  • M5a1a
  • M5a1b
  • M5a2
  • M5a2a
  • M6
  • M7
  • M7a
  • M7a1
  • M7a1a
  • M7a1a1
  • M7a1a1a
  • M7a1a2
  • M7a1a3
  • M7a1a4
  • M7a1a4a
  • M7a1a5
  • M7a1a6
  • M7a1a7
  • M7a1b
  • M7a2
  • M7a2a
  • M7a2b
  • M7b'c'd'e
  • M7b'd
  • M7b
  • M7b1'2
  • M7b1
  • M7b2
  • M7b2a
  • M7b2b
  • M7b2c
  • M7b3
  • M7b3a
  • M7d
  • M7c'e
  • M7c
  • M7c1
  • M7c1a
  • M7c1b
  • M7c1b1
  • M7c2
  • M7c2a
  • M7c3
  • M7c3a
  • M7c3b
  • M7c3c
  • M7e
  • M8
  • M8a
  • M8a1
  • M8a2
  • M8a2a
  • M8a2b
  • CZ
  • C
  • Z
  • M9
  • M9a'b'c'd
  • M9a'c'd
  • M9a'd
  • M9a
  • M9a1
  • M9a2
  • M9a3
  • M9d
  • M9c
  • M9b
  • E
  • M10'42
  • M10
  • M10a
  • M10a1
  • M10a2
  • M42
  • M42a
  • M11
  • M11a
  • M11b
  • M12'G
  • M12
  • M12a
  • G
  • M13
  • M13a
  • M13a1
  • M14
  • M15
  • M21
  • M21a'b
  • M21a
  • M21b
  • M21c'd
  • M21c
  • M21d
  • M22
  • M23
  • M25
  • M27
  • M27a
  • M27b
  • M27c
  • M28
  • M28a
  • M28b
  • M29'Q
  • M29
  • M29a
  • M29b
  • Q
  • M31'32
  • M31
  • M31a
  • M31a1
  • M31a1a
  • M31a1b
  • M31a2
  • M31a2a
  • M31b
  • M31c
  • M32
  • M32a
  • M33
  • M33a
  • M33b
  • M33c
  • M34
  • M34a
  • M35
  • M35a
  • M35b
  • M36
  • M36a
  • M39
  • M39a
  • M40
  • M40a
  • M41
  • M44'52
  • M44
  • M52
  • M46
  • M47'50
  • M47
  • M50
  • M48
  • M49
  • M51
  • D
  • References

    Haplogroup M (mtDNA) Wikipedia