Independent Origins of Indian Caste and Tribal Paternal Lineages
The origins of the nearly one billion people inhabiting the Indian subcontinent and following the customs of the Hindu caste system are controversial: are they largely derived from Indian local populations (i.e. tribal groups) or from recent immigrants to India? Archaeological and linguistic evidence support the latter hypothesis, whereas recent genetic data seem to favor the former hypothesis. Here, we analyze the most extensive dataset of Indian caste and tribal Y chromosomes to date. We find that caste and tribal groups differ significantly in their haplogroup frequency distributions; caste groups are homogeneous for Y chromosome variation and more closely related to each other and to central Asian groups than to Indian tribal or any other Eurasian groups. We conclude that paternal lineages of Indian caste groups are primarily descended from Indo-European speakers who migrated from central Asia ~3,500 years ago. Conversely, paternal lineages of tribal groups are predominantly derived from the original Indian gene pool. We also provide evidence for bidirectional male gene flow between caste and tribal groups. In comparison, caste and tribal groups are homogeneous with respect to mitochondrial DNA variation, which may reflect the sociocultural characteristics of the Indian caste society.
PDF file
The Genetics of Language and Farming Spread in India
PDF file
The Genetic Heritage of the Earliest Settlers Persists Both in Indian Tribal and Caste Populations
Two tribal groups from southern India—the Chenchus and Koyas—were analyzed for variation in mitochondrial DNA (mtDNA), the Y chromosome, and one autosomal locus and were compared with six caste groups from different parts of India, as well as with western and central Asians. In mtDNA phylogenetic analyses, the Chenchus and Koyas coalesce at Indian-specific branches of haplogroups M and N that cover populations of different social rank from all over the subcontinent. Coalescence times suggest early late Pleistocene settlement of southern Asia and suggest that there has not been total replacement of these settlers by later migrations. H, L, and R2 are the major Indian Y-chromosomal haplogroups that occur both in castes and in tribal populations and are rarely found outside the subcontinent. Haplogroup R1a, previously associated with the putative Indo-Aryan invasion, was found at its highest frequency in Punjab but also at a relatively high frequency (26%) in the Chenchu tribe. This finding, together with the higher R1a-associated short tandem repeat diversity in India and Iran compared with Europe and central Asia, suggests that southern and western Asia might be the source of this haplogroup. Haplotype frequencies of the MX1 locus of chromosome 21 distinguish Koyas and Chenchus, along with Indian caste groups, from European and eastern Asian populations. Taken together, these results show that Indian tribal and caste populations derive largely from the same genetic heritage of Pleistocene southern and western Asians and have received limited gene flow from external regions since the Holocene. The phylogeography of the primal mtDNA and Y-chromosome founders suggests that these southern Asian Pleistocene coastal settlers from Africa would have provided the inocula for the subsequent differentiation of the distinctive eastern and western Eurasian gene pools.
PDF file
Negligible Male Gene Flow Across Ethnic Boundaries in India, Revealed by Analysis of Y-Chromosomal DNA Polymorphisms
From the historically prevalent social structure of Indian populations it may be predicted that there has been very little male gene flow across ethnic boundaries. To test this finding, we have analyzed DNA samples of individuals belonging to 10 ethnic groups, speaking Indo-European or Austroasiatic languages and inhabiting the eastern and northern regions of India. Eight Y-chromosomal markers, two biallelic and six microsatellite, were studied. All populations were monomorphic for the deletion allele at the YAP (DYS287) locus and for the 119-bp allele at the DYS288 locus. Y-chromosomal haplotypes were constructed on the basis of one RFLP locus and five microsatellite loci. The haplotype distribution among the groups showed that different ethnic groups harbor nearly disjoint sets of haplotypes. This indicates that there has been virtually no male gene flow among ethnic groups. Analysis of molecular variance revealed that there was significant haplotypic variation between castes and tribes, but nonsignificant variation among ranked caste clusters. Haplotypic variation attributable to differences in geographical regions of habitat was also nonsignificant.
PDF file
High-resolution analysis of Y-chromosomal polymorphisms reveals signatures of population movements from Central Asia and West Asia into India
Linguistic evidence suggests that West Asia and Central Asia have been the two major geographical sources of genes in the contemporary Indian gene pool. To test the nature and extent of similarities in the gene pools of these regions we have collected DNA samples from four ethnic populations of northern India, and have screened these samples for a set of 18 Y-chromosome polymorphic markers (12 unique event polymorphisms and six short tandem repeats). These data from Indian populations have been analysed in conjunction with published data from several West Asian and Central Asian populations. Our analyses have revealed traces of population movement from Central Asia and West Asia into India. Two haplogroups, HG-3 and HG-9, which are known to have arisen in the Central Asian region, are found in reasonably high frequencies (41.7% and 14.3% respectively) in the study populations. The ages estimated for these two haplogroups are less in the Indian populations than those estimated from data on Middle Eastern populations. A neighbour-joining tree based on Y-haplogroup frequencies shows that the North Indians are genetically placed between the West Asian and Central Asian populations. This is consistent with gene flow from West Asia and Central Asia into India.
PDF file
The origins of the nearly one billion people inhabiting the Indian subcontinent and following the customs of the Hindu caste system are controversial: are they largely derived from Indian local populations (i.e. tribal groups) or from recent immigrants to India? Archaeological and linguistic evidence support the latter hypothesis, whereas recent genetic data seem to favor the former hypothesis. Here, we analyze the most extensive dataset of Indian caste and tribal Y chromosomes to date. We find that caste and tribal groups differ significantly in their haplogroup frequency distributions; caste groups are homogeneous for Y chromosome variation and more closely related to each other and to central Asian groups than to Indian tribal or any other Eurasian groups. We conclude that paternal lineages of Indian caste groups are primarily descended from Indo-European speakers who migrated from central Asia ~3,500 years ago. Conversely, paternal lineages of tribal groups are predominantly derived from the original Indian gene pool. We also provide evidence for bidirectional male gene flow between caste and tribal groups. In comparison, caste and tribal groups are homogeneous with respect to mitochondrial DNA variation, which may reflect the sociocultural characteristics of the Indian caste society.
PDF file
The Genetics of Language and Farming Spread in India
PDF file
The Genetic Heritage of the Earliest Settlers Persists Both in Indian Tribal and Caste Populations
Two tribal groups from southern India—the Chenchus and Koyas—were analyzed for variation in mitochondrial DNA (mtDNA), the Y chromosome, and one autosomal locus and were compared with six caste groups from different parts of India, as well as with western and central Asians. In mtDNA phylogenetic analyses, the Chenchus and Koyas coalesce at Indian-specific branches of haplogroups M and N that cover populations of different social rank from all over the subcontinent. Coalescence times suggest early late Pleistocene settlement of southern Asia and suggest that there has not been total replacement of these settlers by later migrations. H, L, and R2 are the major Indian Y-chromosomal haplogroups that occur both in castes and in tribal populations and are rarely found outside the subcontinent. Haplogroup R1a, previously associated with the putative Indo-Aryan invasion, was found at its highest frequency in Punjab but also at a relatively high frequency (26%) in the Chenchu tribe. This finding, together with the higher R1a-associated short tandem repeat diversity in India and Iran compared with Europe and central Asia, suggests that southern and western Asia might be the source of this haplogroup. Haplotype frequencies of the MX1 locus of chromosome 21 distinguish Koyas and Chenchus, along with Indian caste groups, from European and eastern Asian populations. Taken together, these results show that Indian tribal and caste populations derive largely from the same genetic heritage of Pleistocene southern and western Asians and have received limited gene flow from external regions since the Holocene. The phylogeography of the primal mtDNA and Y-chromosome founders suggests that these southern Asian Pleistocene coastal settlers from Africa would have provided the inocula for the subsequent differentiation of the distinctive eastern and western Eurasian gene pools.
PDF file
Negligible Male Gene Flow Across Ethnic Boundaries in India, Revealed by Analysis of Y-Chromosomal DNA Polymorphisms
From the historically prevalent social structure of Indian populations it may be predicted that there has been very little male gene flow across ethnic boundaries. To test this finding, we have analyzed DNA samples of individuals belonging to 10 ethnic groups, speaking Indo-European or Austroasiatic languages and inhabiting the eastern and northern regions of India. Eight Y-chromosomal markers, two biallelic and six microsatellite, were studied. All populations were monomorphic for the deletion allele at the YAP (DYS287) locus and for the 119-bp allele at the DYS288 locus. Y-chromosomal haplotypes were constructed on the basis of one RFLP locus and five microsatellite loci. The haplotype distribution among the groups showed that different ethnic groups harbor nearly disjoint sets of haplotypes. This indicates that there has been virtually no male gene flow among ethnic groups. Analysis of molecular variance revealed that there was significant haplotypic variation between castes and tribes, but nonsignificant variation among ranked caste clusters. Haplotypic variation attributable to differences in geographical regions of habitat was also nonsignificant.
PDF file
High-resolution analysis of Y-chromosomal polymorphisms reveals signatures of population movements from Central Asia and West Asia into India
Linguistic evidence suggests that West Asia and Central Asia have been the two major geographical sources of genes in the contemporary Indian gene pool. To test the nature and extent of similarities in the gene pools of these regions we have collected DNA samples from four ethnic populations of northern India, and have screened these samples for a set of 18 Y-chromosome polymorphic markers (12 unique event polymorphisms and six short tandem repeats). These data from Indian populations have been analysed in conjunction with published data from several West Asian and Central Asian populations. Our analyses have revealed traces of population movement from Central Asia and West Asia into India. Two haplogroups, HG-3 and HG-9, which are known to have arisen in the Central Asian region, are found in reasonably high frequencies (41.7% and 14.3% respectively) in the study populations. The ages estimated for these two haplogroups are less in the Indian populations than those estimated from data on Middle Eastern populations. A neighbour-joining tree based on Y-haplogroup frequencies shows that the North Indians are genetically placed between the West Asian and Central Asian populations. This is consistent with gene flow from West Asia and Central Asia into India.
PDF file