The Clearly Evident Out of India Migration from Ancient DNA

While the two recent ancient DNA papers have set the Indian media abuzz with talk of whether Aryans were indigenous or whether they came from outside, almost all the English media has ignored the fact the genetic data also shows the migration of ancient Indians or Harappans into neighbouring regions of Eastern Iran and Central Asia.

A couple of articles took note of it but tried to minimise its relevance. Tony Joseph at The Hindu says,

Another spin around the new studies suggests an ‘Out of India’ migration. This is also misleading. If by ‘Out of India’ migration we are referring to the fact that some Harappans visited neighbouring civilisations or cultures such as the Bactria-Margiana Archaeological Complex (BMAC) or Shahr-i-Sokhta, with whom they had trade and cultural links, these are well-known and unsurprising facts.

Girish Shahane at the, opines about the Indus Periphery samples,

The obvious explanation is that the 11 people had travelled to those locations from an Indus Valley Civilisation region. Their presence in such far-flung places testifies to the extensiveness of Indus Valley Civilisation commercial and cultural contacts. It does not suggest a migration out of India extensive enough to change the genetic profile of foreign lands, because if that were the case the 11 individuals would not have been obvious outliers.

Both these writers are hardly excited by the fact that there is indeed evidence of Indian migration into Eastern Iran & Central Asia and they try to explain it away non-nonchalantly as the evidence of Harappans’ trade & cultural contacts.  I mean, why is the overwhelming obsession just about proving the supposed Aryan migration ? Isn’t it exciting that we now have clear and indisputable evidence of ancient Indians already, as early as 3300 BC (the earliest dated Indus Periphery sample), migrating and living in advanced Bronze Age civilizations outside of the traditional geographic boundary of South Asia. What was the nature of these interactions should naturally arouse considerable curiosity.

The writers also grossly underestimate the genetic influence of Harappans on both its neighbouring regions.

What the research paper says

Let us quote directly from the paper which published these 11 ancient South Asian migrant samples and see what it has to say on the matter –

We document 11 outliers—three with radiocarbon dates between 2500 and 2000 BCE from the BMAC site of Gonur and eight with radiocarbon dates or archaeological-context dates between 3300 and 2000 BCE from the eastern Iranian site of Shahr-i-Sokhta—that harbored elevated proportions of AHG-related ancestry (range: ~11 to 50%) and the remainder from a distinctive mixture of Iranian farmer– and WSHG-related ancestry (~50 to 89%).

While this part of the research is reported by our writers they fail to note another equally important piece of research from the same paper which states,

Unlike preceding Copper Age individuals from Turan, people of the BMAC cluster also harbored an additional ~2 to 5% ancestry related (deeply in time) to Andamanese hunter-gatherers (AHG). This evidence of south-to-north gene flow from South Asia is consistent with the archaeological evidence of cultural contacts between the IVC and the BMAC and the existence of an IVC trading colony in northern Afghanistan.

When we take both these statements together we get the clear picture. Not only were migrants from the Harappan civilization present in Eastern Iran and Central Asia, infact the genetic admixture from these Harappan migrants was present in all the native people of these regions and was not just consigned to the Harappan migrants only. It is puzzling as to why this crucial information has completely been ignored by almost everyone.

This Harappan admixture was apparently not present in the earlier period in Central Asia before 3000 BC which would mean that the Harappan admixture happened in the transition phase between the Copper Age in Central Asia and the formation of the Bronze Age urban civilization of BMAC when the population of this region increased greatly.

Quite clearly, the migration and mixing of Harappans with the Central Asians and Eastern Iranians co-incides or slightly pre-dates the transformation of these socieites into large urban civilizations and likely played a crucial role in the transition of these culture into urban Bronze Age civilizations.

How is this not an important finding ?

It is quite strange that this clear evidence of Out of India migration is being sidelined and neglected. Let us dig a little deeper and show what the quite comprehensive and large Supplementary section of the paper has to say on this Out of India migration.

What the Supplement Says

The primary ancestry source of all these ancient samples from Central Asia & Eastern Iran was the Iranian farmer related ancestry, the same ancestry type which was also the main ancestry of IVC people.

This Iranian farmer related ancestry was the main ancestral source in Central Asia even in Chalcolithic period (4000 – 3000 BC) which means that already by the Chalcolithic period the populations of Central Asia and NW South Asia had a lot of shared ancestry before the IVC migration into Central Asia during the Bronze Age. How old is this shared ancestry should be a matter of future research.

The analysed data from Central Asia and Eastern Iran in the Bronze Age consists of about 60 samples from the sites of Gonur, Sappali, Jarkutan (Dzharkutan) & Bustan associated with the BMAC or Oxus Civilization of Central Asia, 2 samples each from Parkhai & from Aigyrzhal in Kyrgyzstan, also from Central Asia and finally 17 samples from the large Eastern Iranian settlement of Bronze Age Helmand civilization, Shahr-i-Sokhta (fig. S17, pg 202).

Quoting from the Supplement, they observe from the admixture plot (fig S18, bottom panel, pg 203) that,

“… some individuals from each site also harbor trace amounts of a component that is maximized in Andamanese Hunter-Gatherers (AHGs) and Dravidian speaking groups in southern India. Particularly revealing is our observation of outlier individuals from several of these sites that are exceptions to these patterns. We hypothesize that these individuals were migrants from South Asia (or descendants of recent migrants)…” (pg 202)

Based on the results of the admixture-f3 stats (fig S21-S23) on the BMAC and Shahr-i-Sokhta main group of samples they further observe,

For the BMAC main cluster, we also observe significant (Z<-3) admixture signals with a source from pre-Copper Age Iran and Turan and a source related to present day groups within the Indian subcontinent, a signal that we do not detect in individuals from the earlier period in Turan (Fig S 21 – Fig S 23). This is consistent with the hypothesis that the main BMAC cluster harbors a proportion of ancestry from gene flow from the south, plausibly from South Asia… We observe that the individuals from Shahr-i-Sokhta, also show significant admixture-f3 statistics with one source as AHG. Taken together with the fact that there are individuals with significantly high proportions of AHG-related ancestry at both sites, this suggests that there was gene flow from South Asia out into Turan during the BA (pg 206).

Finally moving onto the qpAdm admixture analysis, our researchers note,

To better understand the population changes associated with these BA settlements we studied proximal sources. We first observe that there are many models that fit under our acceptance criteria, but they are of a similar nature, in the sense that all involve a population from the previously described Copper Age period with additional ancestry related to present-day South Asians. Most working models involve populations from Turan with additional South Asian related mixture, for example present-day individuals from South Asia with minimal Steppe pastoralist-related ancestry, or outlier individuals from Shahr-i-Sokhta and BMAC sites with high proportions of AHG-related ancestry (pg 210).

When we look at analysed aDNA samples Shahr-i-Sokhta, we see that out of a total of 17 samples, as much as 8 of them are classified as migrants from the Harappan civilization by our authors. So essentially about 50 % of the population at Shahr-i-Sokhta was made up of migrants from the IVC. Infact, barring one sample which they classify as an outlier, the other 8 samples, which the authors label as Shahr_I_Sokhta_BA1, also harbour IVC ancestry and the best fitting qpAdm model (p-value – 0.658) suggests that this S_I_S_BA1 group harboured as much as 20 % ancestry from these IVC migrants (table S30, pg 212).

It may also be noted that the IVC migrants at Shahr-i-Sokhta date from a period between 3300-2000 BC while the other group of 8, labelled the main S_I_S_BA1 group only date between 3300-2600 BC. All in all, this cumulative genetic evidence alone is enough to show that Shahr-i-Sokhta was enormously influenced by the Harappans.

This is further underlined when one looks at the archaeological evidence which shows substantial similarities between Shahr-i-Sokhta and sites to its east in the Indian subcontinent. Shahr-i-Sokhta itself only came into existence as a habitation site around 3300 BC but the other major site of the Helmand civilization, Mundigak, found about 425 kms east along the course of the Helmand river in Afghanistan, has a beginning going back to the 5th millenium BC, and it already shows significant cultural parallels with older sites such as Mehrgarh to its southeast. The main cattle type at Shahr-i-Sokhta was also of the Indian Zebu variety further cementing the strong east to west cultural and genetic flow at the site.

To try to brush this off vaguely as evidence of ”commercial and cultural links” and trying to de-emphasize the importance is quite telling.

In contrast, the Italian archaeologist Massimo Vidale, one of the principal researchers at Shahr-i-Sokhta in recent years and one of the contributors of this Narasimhan et al paper, states the following on page 110 of Supplementary section,

The archaeological and genetic evidence thus suggest that a flow of migrants from the northwestern borderlands of South Asia was active at the beginning of the local settlement, and that the same flow, different from the earlier one from northwest, intensified in the following centuries. We unfortunately do not have ancient DNA from Period III and the later centuries of the Shahr-i-Sokhta sequence when cultural influence from the Indus Valley Civilization appears to become stronger.

To be sure, what percentage would a trader community consitute in the population of a foreign city ? And out of these traders, how many are going to end up dying and getting buried in that foreign land ? Perhaps 1 or 2 %. It surely cannot be as high as 50 % besides also contributing your ancestry to all the local people in the foreign land.

But that is what we see at Shahr-i-Sokhta and therefore this is evidence of very deep cultural and perhaps religious and linguistic links.

The body of archaeological literature documenting the links between the IVC, Eastern Iran & BMAC is quite significant and I intend to write on it, God willing, in the next post to complete and complement the picture given by genetic data.

Moving to Bronze Age Central Asia, the authors document 3 samples from the major site of Gonur in the Margiana region of the BMAC which they classify as IVC migrants along with the 8 already noted at Shahr-i-Sokhta. Alongside these 3 samples, there is a slightly later sample dating to around 1500 BC from the site of Bustan, labelled as Bustan_BA_o2, which is not similar to the 3 Gonur IVC migrants but can be modelled as 70 % Swat_IA + 30 % IVC migrants. So this is also for all intents and purposes a likely migrant from South Asia but with a very different genetic profile.

But besides this, as can be read from the quotes from the Supplement and main paper given above, all the ancient samples of the main group of BMAC people also harboured ancestry from South Asia which was absent in the earlier period.

In the proximal models for the BMAC main cluster (table S28, pg 210), we can observe that the BMAC Bronze Age population can be modelled as deriving between 70-75 % ancestry from Shahr-i-Sokhta_BA1 which itself harbours about 20 % IVC ancestry. Therefore this suggests that the BMAC main cluster, its core population, harbours about 15 % ancestry from IVC.

Therefore, we see all pervasive South Asian admixture in Central Asia just before the region becomes urbanised and perhaps organised into a state. This suggests that the South Asian or IVC migration and admixture in Central Asia could have played a leading role in the transition of the region into an urbanised state with major cities like Gonur.

Our authors argue that the 2 samples from Aygyrzhal in Kyrgyzstan in the eastern part of Central Asia, do not have any South Asian ancestry but this is belied by the fact that in all proximal models (table S31, pg 212), these 2 samples show about 17 % admixture from IVC migrants.

Nor is the IVC genetic influence this far to the east in Central Asia an anamoly. A study of mtDNA  from some ancient Tarim Basin samples (4000 YBP), showed the presence of the indisputably Indian lineage M5, besides other lineages such as U7, U2e & R* which are also widely present in South Asia, Central Asia and Iran.

As pointed out by Silva et al 2017, mtDNA M5a was clearly part of outward South Asian migration towards Iran in the Bronze Age. Therefore its presence also in Tarim Basin in the same timeframe explicitly confirms it to have been there as part of South Asian migration and admixture in the Tarim Basin in that early period.

Winding up the genetic evidence, we observe that in the Bronze Age, the IVC genetic influence extended in an enormous arc from Shahr-i-Sokhta in Eastern Iran to the Tarim Basin mummies in modern Xinjiang, China with the genetic influence on the Helmand civilization (Shahr-i-Sokhta) being quite overwhelming and that on the BMAC also being all-pervasive.

How is this major Bronze Age genetic phenomenon not a significant event of Out of India migration ? Whether this was related to Indo-European migration, only time will tell.

It is a matter of archaeological record that the greatest influence on BMAC was from IVC and the Eastern Iranian civilizations of Halil Rud (Jiroft) and Helmand which were themselves heavily influenced by IVC. Infact, when there is evidence of such overwhelming cultural and genetic links between these regions, political links between these regions would have been a definite reality. Infact, as Mesopotamian records of the 2nd quarter of the 3rd millenium BC, the Eastern Iranian state of Marhasi (perhaps Halil Rud) was closely in political alliance with Meluhha (the IVC).

It is quite instructive that the historically known core region of Indo-Iranians was already greatly in confluence from the Early Bronze Age period.


The Origins of the Neolithic in the Indian subcontinent

Last week was a good one for all those who had been waiting endlessly for the Rakhigarhi aDNA paper to see the light of day and also for the pre-print ‘The Genomic Formation of South & Central Asia’ to come out in a peer-reviewed journal.

Unfortunately, the talk around both these papers has quickly degenerated to only figuring out whether these papers support an Aryan Migration or whether they do not.

However, the combined data that has come out from these two papers is a treasure trove of information and this data has enormous implications for a lot of other theories of South Asian prehistory.

I wish to focus one post for each of these important discoveries starting with the important discovery about the antiquity of IVC ancestral population in South Asia and its implication for origins of farming in the region.

The graphical extract given above is quite self-explanatory. But let us also quote from the paper itself to re-inforce what the graphical extract implies :-

The Iranian-related ancestry in the IVC derives from a lineage leading to early Iranian farmers, herders, and hunter gatherers before their ancestors separated, contradicting the hypothesis that the shared ancestry between early Iranians and South Asians reflects a large-scale spread of western Iranian farmers east. Instead, sampled ancient genomes from the Iranian plateau and IVC descend from different groups of hunter-gatherers who began farming without being connected by substantial movement of people.

They elaborate on what the implications of this finding is –

Our evidence that the Iranian-related ancestry in the IVC Cline diverged from lineages leading to ancient Iranian hunter-gatherers, herders, and farmers prior to their ancestors’ separation places constraints on the spread of Iranian-related ancestry across the combined region of the Iranian plateau and South Asia, where it is represented in all ancient and modern genomic data sampled to date. The Belt Cave individual dates to 10,000 BCE, definitively before the advent of farming anywhere in Iran, which implies that the split leading to the Iranian-related component in the IVC Cline predates the advent of farming there as well…Thus, the Iranian-related ancestry in the IVC Cline descends from a different group of hunter-gatherers from the ancestors of the earliest known farmers or herders in the western Iranian plateau.

So the paper on the Rakhigarhi aDNA sample makes it abundantly clear that the geneticists find the Iran farmer/herder related ancestry in the IVC people to have split up from the actual Iranian farmers/herders atleast before 12,000 years ago. This is well before the origin of farming on the Iranian plateau. Therefore, the clear implication of this is that the Iranian Neolithic Farmers did not contribute ancestry to the ancestors of the Harappans but that the ancestors of Harappans and the early Iranian farmers/herders descend from a common ancestral source that existed more than 12 kya.

What was the place of origin of that ancestral source ? We do not know that as yet so to assume that it is coming from Iran, as some are doing, is not borne out by evidence. It may well have originated in South Asia before moving into Iran.

I have written at length on this topic at my blog.

Infact, the deep pre-Neolithic origins of Iran N like ancestry in South Asians was already becoming apparent that as early as 2011 when this major paper on South Asian populations made the following observation –

… regardless of where this component was from (the Caucasus, Near East, Indus Valley, or Central Asia), its spread to other regions must have occurred well before our detection limits at 12,500 years.

Even before this, in 2004, when a paper, by this very same Estonian team, on mtDNA lineages shared between Iran & South Asia had come out, it was already noted that the spread of shard mtDNA lineages between the two regions was quite old.

This separation between the South Asian & Iranian farmer/herder ancestry not just dates to before 12 kya but it likely dates to the very end of the Last Glacial Maximum, somewhere around 18-17 kya.

Figure 1

The above graphic shows the early split of mtDNA U7 across Eurasia. One can quite clearly see that the oldest splits seem to be between Iranian plateau & South Asia. This pattern is also observable for other shared mtDNA & y-dna lineages shared between these 2 regions. For more on this please refer to my blog.

Therefore, the Rakhigarhi provides one more line of evidence to re-inforce what was already evident from many previous studies on modern DNA, that the major portion of the shared ancestry between the Iranian plateau and South Asia was not brought into South Asia from Iran by the Iranian farmers but had separated from the ancestors of Iranian farmers/herders well before 12,000 years ago. With this Rakhigarhi paper getting such widespread coverage, one can only hope that from now on this fact will become known to much more people than it had been previously.

South Asian Neolithic

Since it has now become clear that there was no migration of early Iranian farmers into South Asia, so it has begun to be argued that farming must have been adopted by local NW hunter gatherers of South Asia through spread of ldeas from the fertile crescent.

While this is certainly plausible, it rests on the premise that farming began independently only once in the Fertile Crescent and it spread from there to everywhere else.

This is however not so straightforward. There are similarities at Mehrgarh with the Neolithic sites of the Eastern Fertile Crescent or the Iranian Neolithic but not with the Levant or Anatolian Neolithic. The archaeologist who discovered the Neolithic site of Mehrgarh and carried out decades of excavation over there, J F Jarrige, had this to say

The similarities noticed between Neolithic sites from the eastern border of Mesopotamia to the western margins of the Indus valley are highly significant. A sort of cultural continuum between sites sharing a rather similar geographical context marked with an also rather similar pattern of evolution and transformation becomes more and more evident. But the Neolithic of Mehrgarh displays enough original features to imply an earlier local background which has so far not been documented. Nevertheless the cultural dynamism shown by the inhabitants of Mehrgarh as early as level I of Period I indicate that the Neolithic of Balochistan cannot be interpreted as the “”backwater” of the Neolithic culture of the Near East. (link)

So indeed there are cultural similarities between the South Asian Neolithic and the West Iranian Neolithic and we know that the early farmers of both these regions were also separated from a common ancestor but the genetic evidence makes it clear that the common ancestor likely lived at the end of LGM aroynd 18 kya rather than the beginning of Neolithic around 9-8 kya.

It is not quite clear how the early South Asian & Iranian farmers interact or relate to each other during this Neolithic phase but it is quite clear that the early Iranian farmer was significantly different than the nearby farmers of the Levant & Anatolia both genetically and even culturally while he has much more in common with his much distant eastern cousins. Might not this imply that the origins of these Iranian farmers lay to the east close to its South Asian cousins ?

Jarrige provides several lines of evidence that suggest that the Neolithic in South Asia developed independently of West Asian input.

Lorenzo Costantini has shown that the plant assemblage of Period I is dominated by naked six row barley which accounts for more than 90% of the so far recorded seeds and imprints. He has also pointed out the sphaerococcoid form of the naked-barley grains with a short compact spike with shortened internodes and small rounded seeds. According to him, such characteristics in the aceramic Neolithic levels can be ascribed to probably cultivated but perhaps not fully domesticated plants. Domestic hulled six-row barley (H. vulgare, subsp. vulgare) and wild and domestic hulled two-row barley (H. vulgare subsp. spontaneum and H. vulgare subsp. distichum) have also been recorded, but in much smaller quantities. According to Zohary quoted by R.H. Meadow, the distribution of wild barley extends today to the head of the Bolan Pass. It is therefore likely that local wild barleys could have been brought under cultivation in the Mehrgarh area.

The main crop at Mehrgarh was barley by far and we see that there is enough evidence to suggest its local domestication from nearby wild varieties.

There is also genetic evidence to suggest two independent centers of barley domestication.

There has long been speculation that barley was domesticated more than once. We use differences in haplotype frequency among geographic regions at multiple loci to infer at least two domestications of barley; one within the Fertile Crescent and a second 1,500–3,000 km farther east. The Fertile Crescent domestication contributed the majority of diversity in European and American cultivars, whereas the second domestication contributed most of the diversity in barley from Central Asia to the Far East.

It is abundantly clear that the barley used by South Asian Neolithic farmers fits in very well with this second domestication scenario.

Jarrige also notes,

The presence of bones of relatively small subadult and adult animals in the trash deposits of the early levels confirms, according to R.H. Meadow, the domestic status of at least some of the goats. Meadow has also clearly shown that “” though in the course of Period I at Mehrgarh, the remains of sheep and cattle became to increasingly dominate the faunal assemblages of the successive strata, at the same time, the animal represented grew smaller in body size”. By the end of Period I, cattle bones amount for over 50% of the faunal remains. Osteological studies as well as clay figurines indicate that zebu cattle (Bos indicus) is well attested in Period I and became most probably the dominant form (Fig. 12). Mehrgarh provides us therefore with a clear evidence of an indigenous domestication of the South Asian zebu. We know today that Bos indicus and Bos Taurus, the non-humped bull from the Middle-East, have a different genetic origin. Therefore the assumption that farming economy was introduced full-fledged from Near-East to South Asia needs to be questioned.

The archaeological & genetic evidence therefore clearly implies a local domestication of indigenous cattle just as is the case with the domestication of barley. Even the goats and sheep were likely domesticated from local wild populations of the same. We may bear in mind that the Iranian Neolithic farmers were essentially goat herders and did not have cattle for a long period before it eventually came in from the Anatolian-Levant region. So they couldn’t possibly bring in the know-how or cattle domestication to South Asia.

With such an abundance of evidence at our disposal it becomes clear that the origins of Neolithic economy in South Asia can be attributed to local South Asian Hunter Gatherers, deeply related to Iranian farmers/herders, who started it by indigenous domestication of barley and cattle from local wild ancestors. So the only question that remains is whether the Neolithic technological knowledge came in from the Fertile Crescent.

That is difficult to answer, but we may end here by noting that in the case of the Near Eastern Neolithic, the transition from hunting gathering phase to the farming phase brought about an admixture of Iranian Farmer like admixture in the local Anatolian HGs. 

Fig. 2

In the above figure a, AHG is Anatolian HG while AAF is Anatolian Aceramic Farmer. We see from this graphic that, with a transition from AHG to AAF, we see that there is an influx of admixture into AAF which brings it slightly closer to Iranian Farmer & CHG ancestry. In contrast no such admixture is seen in South Asian farmers.

So might this indicate that the Neolithic technology spread from East to West ? Future research will surely shed more light on it but it should be clear to everyone that South Asia most likely was an independent and major center of Neolithic transition and deriving its origin from the Fertile Crescent looks rather unconvincing.

We may also look afresh at the question of origins of Central Asian Neolithic and whether it derived from Iranian Neolithic or South Asian Neolithic.



Who brought the Zebu in the Near East ?

If folks over here are not already aware, a major ancient DNA paper on cattle came out last week in the Science magazine. It had an unprecedented 67 ancient cattle samples.

Here’s the abstract,

Genome-wide analysis of 67 ancient Near Eastern cattle, Bos taurus, remains reveals regional variation that has since been obscured by admixture in modern populations. Comparisons of genomes of early domestic cattle to their aurochs progenitors identify diverse origins with separate introgressions of wild stock. A later region-wide Bronze Age shift indicates rapid and widespread introgression of zebu, Bos indicus, from the Indus Valley. This process was likely stimulated at the onset of the current geological age, ~4.2 thousand years ago, by a widespread multicentury drought. In contrast to genome-wide admixture, mitochondrial DNA stasis supports that this introgression was male-driven, suggesting that selection of arid-adapted zebu bulls enhanced herd survival. This human-mediated migration of zebu-derived genetics has continued through millennia, altering tropical herding on each continent.

Well the widespread Zebu presence in the LBA Near East is not exactly a new discovery. It has been known for quite sometime now among archaeologists. This article gives a good overview on the subject.

(Syrio-Hittite Terracota Zebu, source)

(Hittite Bronze Zebu, source)


Nevertheless, with this new study, we now have conclusive evidence that the widespread Zebu admixture in the Near Eastern cattle did indeed start around 2000 BCE.


Illustration of the proportion of Zebu ancestry over time in domesticated cattle of Eurasia. From Verdugo et al. 2019

(figure courtesy : twitter/johnhawks)

As can be seen from the above figure, after 4000 YBP, there is a sudden spurt in indicine or Zebu admixture across nearly all the ancient samples of Near Eastern Cattle including cattle samples from Central Asia, Iran, the Caucasus, Levant, Mesopotamia & the Balkans.

Again quoting from the paper,

…despite archaeological evidence for contact between civilizations of the Fertile Crescent region and the Indus Valley (9), the influence of the zebu genome is detectable in ancient Southwest Asian cattle only 4000 years later (Fig. 2). However, after ~4000 yr B.P., hybrid animals (median 35% indicine ancestry) are found across the Near East, from Central Asia and Iran to the Caucasus and Mediterranean shores of the southern Levant (table S2 and fig. S1). During this period, depictions and osteological evidence for B. indicus also appear in the region (9, 13). In contrast to autosomal data, but similar to earlier work (14), we find persistence of B. taurus mitochondria, suggesting introgression may have been mediated by bulls (Fig. 2).

As the bolded portions note, the indicine admixture in the ancient cattle samples from the Near East post 4000 YBP, show on an average 35 % indicine admixture but that this mostly through the Indicine bulls as there is indicine mtDNA in these admixed samples. This widespread and major admixture in the Near Eastern cattle post 4000 YBP, is clearly a major turning point in the history of the Near Eastern cattle.

What lead to this major turning point ?

It is now widely accepted that there was a major 4200 YBP climatic event that brought about a prolonged phase of drought perhaps extending to several centuries which affected all major civilizations of the Near East extending all the way to the Indus civilization. This event is also considered a major reason for the apparent ‘collapse’ of the Indus civilization.

The Large Harappan Footprint across Middle Asia


Most if not all movements of cattle across large distances in prehistory are usually associated with movement of humans. Did the large influx of Zebu autosomal DNA into the Near East cattle also result due to South Asian population expansion into the Near East post-4000 YBP ?

What we do know from archaeology is that the Harappans were operating across a vast landscape and their presence and influence can be detected in Central Asia, Eastern Iran( Jiroft & Helmand), the SE Arabian coast (Oman & Bahrain) and in Mesopotamia already in the 3rd millenium BC. The Harappans even had their own colonies in these different regions.

It appears that a minority but nevertheless a significant and arguably influential community of Harappans were present in most of these regions away from the Greater Indus region as far as atleast Mesopotamia. Infact, in a lot of these places, the Harappans were apparently living in those places for many generations. It is conceivable that they would have brought and maintained the Zebu cattle in their new homelands.

Therefore, already in the 3rd millenium BC, it is quite probable that the Zebu cattle was already a minor but regular feature of the Middle Asian landscape.

However, after the onset of the 4.2 kya event, with the onset of aridity, it is believed that the Harappan civilization finally collapsed around 3900 kya (1900 BCE). One of the consequences of this collapse is apparently the end of Harappan trade and interaction across Middle Asia.

So the questions is – how did the Zebu admixture in the Near East shoot up all of a sudden in a period when we are led to believe that the Harappan or Indus civilization ceased to interact with this region ?

And it is not just Indian cattle admixture that becomes widespread into the Near East during this post-4000 kya period. As the authors of this paper state,

Westward human migration has been documented around  this time(19,20) along with archaeological evidence for the appearance of other South Asian taxa such as water buffalo and Asian elephants in the Near East(21), suggesting the movement of large animals by people.

Infact, the presence of Asian Elephant in the Near East dates to from the end of 3rd millenium BC to the 8th century BC and is centred around Syria and hence this ancient elephant population is also known as the Syrian Elephant.

The authors of the study cited on Asian Elephants in our cattle aDNA paper above state something very interesting,

…ancient accounts indicate that live elephants roamed and were hunted in the Orontes Valley, the Upper Euphrates Valley and the Middle Euphrates Valley around modern Ana in Iraq, at least between the end of the 16th and 9th centuries BC, possibly into the 8th century BC (Breasted 1906–07; Gardiner 1964: 179, 201; Moorey 1994: 117; Scullard 1974: 28). The core of this region comprises the area of influence of the Mitanni Kingdom, the main local political player in LBA northern Syria…

The Mitanni were an Indo-Iranian, possibly Indo-Aryan elite ruling over the native Hurrian population in present day Syria in the mid 2nd millenium BCE. And we observe that Asian Elephants, most likely imported from the Indus civilization, only appear in the Near East during the 2nd millenium BC and in a geography that was under the political control of the Mitanni. Is this merely a co-incidence ?

A New Interaction Horizon in Middle Asia in the LBA ?

It is quite probable that with the collapse of the old Bronze Age empires all across the Near East and South Asia, this entire Middle Asian region entered into a new era of interaction which is archaeologically not so well-defined. The widespread Zebu admixture in the Near Eastern cattle along with the presence of Asian elephants as also the first attestations of the Indo-Europeans such as the Hittites & the Mitanni and the associated horse-drawn chariots are evidences of this new horizon.

But while the Zebu and the Asian Elephant came from South Asia, one may argue that the Hittites, Mitanni and the horse drawn chariot must surely come from the steppe.

Well, the strong correlation of the geography of the LBA ‘Syrian’ Elephant population with the boundaries of the Mitanni empire, does suggest some sort of interaction of the Mitanni with South Asia.

And if this is not all, we have now possibly a very strong evidence of the presence of chariots in South Asia which is as old and probably older than the Sintashta chariots in the steppe.

It maybe noted that before the discovery of the chariots in Sintashta which gave the steppe chariot the distinction of being the oldest known one, it has been argued for long that the invention of chariot is mostly likely in the Near East and not in the steppe due to several factors inimical for the use of chariots on the steppe. You can read more about it here.

The discovery of the Sanauli chariots is likely to revive that debate. Along with this, we also have the prospect of horse aDNA from the Mature Harappan Phase no less.

So could the appearance of the Zebu, the Asian elephant, the Buffalo as well as the Hittites & Mitanni and their horse drawn chariot in the 2nd millenium BC in the Near East have a common cause in their probable migration from South Asia after the Indus civilization collapse.

Certainly some food for thought !





The Pulwama attack and its Aftermath

Last week the world media was gripped with the news of Indian airstrikes inside Pakistan followed a day later by Pakistani figher jets (which allegedly included F-16s) apparently intruding or flying very close to the LOC and atleast one of the Indian jets, MiG 21, pursuing the Pakistani fighter planes getting hit with both the plane & its pilot falling into the PoK region.

Initially the Pakistani army & political establishment claimed that they had downed two Indian jets and that they had 2 Indian pilots in their custody but this was corrected a little while later to the claim the custody of only 1 Indian pilot.

Two days later the Indian pilot was handed over to India after the Pakistani PM Imran Khan made an official announcement within their parliament. The prompt release of the Indian pilot is considered by some to have gone a long way in de-escalation though heavy shelling at LOC from both sides appears to be still ongoing.

The Background

However, behind this unfolding story that dominated International media for a few days, there are other developments preceding and succeeding it which also need to be looked into.

The reason why India was forced to carry out the airstrikes deep into Pakistan in Balakot, Khyber Pakhtunkhwa was the massive suicide attack on a CRPF convoy of about 75 buses transporting 2500 soldiers back to the borders after their holidays. This attack martyred more than 40 Indian soldiers and caused a huge uproar across the nation. The Indian government was quick to point to Pakistan for the attack and as per The Hindu newspaper, the Pakistan based Jaish-e-Mohammad claimed responsibility for the attack with the suicide bomber being a local Kashmiri youth.

The NDA government in India under Narendra Modi had so far tried to project a no-nonsense image against terrorism, especially in Kashmir. As a result, the Uri attack was followed by what India claimed was a major ‘surgical strike’ across the border into PoK that apparently left many dead in the terrorist lauchpads. The recent Pulwama attack was bigger than Uri and so the Modi government was expected to do something akin to the ‘surgical strike’ or maybe bigger. Modi warned of a retaliation and the air was thick in anticipation. The Balakot airstrikes followed 12 days later.

The Pakistani narrative all along has been that there is no proof of Pakistani involvement in the Pulwama attack and that it was willing to act upon the terrorist organisations inside Pakistan if India provided them ‘actionable intelligence’. The Pakistani political establishment argues that given the precarious economic position Pakistan finds itself in, it is in no position for adventurism  and carry out the Pulwama attack and create more problems for itself. It has been also preaching peace non-stop. The release of the Indian pilot was also apparently meant to further this Pakistani objective.

Who’s behind the Pulwama attack ?

In consonance with the Pakistani establishment preaching peace and innocence has been the narrative within Pakistan which suggests that it is Narendra Modi & his Govt consisting of Hindu fundamentalists who want war and bloodshed and who are just looking for an excuse to start a war. Apparently this war hysteria & jingoism across India against Pakistan is Modi’s own making which he has created to help him win the upcoming elections which he would otherwise lose. Infact the demonisation of Modi in Pakistani media has been a constant feature ever since he became the PM candidate from the BJP for the 2014 polls. In newspapers like the Dawn, there does not pass by a week without atleast a couple of articles about the ‘misdeeds’ and ‘mal-intentions’ of Modi & RSS/BJP .  This has been happening for 5 years now. You are unlikely to find a single Pakistani who has some good to say about Modi. Thats the kind of propoganda the Pakistani media has run against Modi & BJP/RSS with a lot of help from their liberal leftist media friends in India. Many in Pakistani are quite convinced that the Pulwama attack itself was done at the behest of Modi so that he could create a war hysteria and use it to his benefit in the elections.

Nevertheless, the Pulwama attack was not the only major terrorist attack in the region in the past few weeks. Just a day before the Pulwama attack which happened on February 14, there was an identical terrorist attack in Iran where a suicide bomber in a car/van attacked a bus of the elite Revolutionary guards of Iranian armed forces in the Sistan province in which as many as 27 of these soldiers were martyred. The attack was claimed by Jaish-ul-Ahd, a group operating from within Pakistan just like the Jaish-e-Mohammad. It is likely that Jaish-ul-Ahd has links with Lashkar-e-Taiba & the Pakistani establishment.


The Iranians have vowed revenge and they also appear to put the blame on Pakistan.

The extemely identical nature of these terrorist attacks in Iran & India separated by a mere day, and targetting the security forces of these countries travelling in convoys, by jeeps/vans driven by suicide bombers, suggest that these attacks may even have been planned together. If so, it raises the question – could Pakistan have the gumption to carry out such dastardly attacks at the two ends of their neighbourhood ? And to what avail ?

If Pakistan is given the benefit of doubt, it still does not mean that groups based within the Pakistani soil did not carry out these attacks. But could it be there is some external agency or power involved which has enough reach in the South Asian region in general and also in Pakistan, that backed these groups to carry this out ?

The responses of the Iranian leadership immediately after their armed forces were attacked is worth a read.

As per their Supreme leader Khamenei, “It is certain that the perpetrators of this crime were linked to spy agencies of certain regional and trans-regional countries and Iran’s relevant organizations must focus on that and seriously pursue it.”

Their President Rouhani while blaming the US & Israel as the “root causes of terror” in the region, urged Iran’s neighboring countries to “fulfill their legal obligations” within the framework of the principle of good neighborliness and prevent the terrorist groups from using their soil to launch attacks against their neighbors. In this last bit, the neighbouring country he had in mind appears to none other than Pakistan.

Could the Iranian leadership be onto something here and could it be that other more formidable powers with interests in the region, especially in Afghanistan & Pakistan, had a role in the twin attacks ? Surely one should not let Pakistan off the hook. It appears that even Iran argues about Pakistani complicity in the attack on its forces,

“Pakistan’s government, who has housed these anti-revolutionaries and threats to Islam, knows where they are and they are supported by Pakistan’s security forces,” said Revolutionary Commander Major General Mohammad Ali Jafari, referring to terror group Jaish al-Adl (“Army of Justice”). 

“If (the Pakistan government) does not punish them, we will retaliate against this anti-revolutionary force, and whatever Pakistan sees will be the consequence of its support for them,” Ali Jafri warned. 

However, could it be that the terrorist organisations and their military/ISI backers are a law unto themselves who perhaps do not even willingly answer to the Pakistani establishment and are willing to do the work of the highest bidder ? In such an environment, an external power can certainly find much opportunity to further its own ends.

The Aftermath of the Indian air-strikes

Apparently, after the air strikes, there was also a threat of a missile attack on Pakistan, as claimed by the political leadership of Pakistan (Imran Khan, Shah Mehmood Qureshi), a perilous situation that was subsequently diffused. It also appears that the US among all the external players involved in the region, has played the most active and significant role in diffusing the situation between India & Pakistan.

The statement by Donald Trump from Hanoi, Veitnam of some “reasonably attractive” news coming from India-Pakistan followed by the release of IAF pilot as well as the Pakistan Foreign Minister profusely thanking the Americans in playing the lead role in diffusing the tensions is clear pointing to such an inference.

As per Qureshi, “I would especially like to thank US Secretary of State Mike Pompeo. By utilising private diplomacy, the US has played a very positive role. I’d also like to mention the efforts of China, Russia, Saudi Arabia, Turkey, Jordan and the UAE for trying to defuse the situation through diplomacy.

The response from China has been on the other hand quite muted. It has also refused to take sides with Pakistan, whether in defending it from the Pulwama attack accusation or after the Balakot airstrike. China has merely asked both the countries to sort out the differences between themselves.

It could very well be that sensing an opportunity in the ongoing crisis to further its own ends, the US forced itself into the mediation between the two countries. In response to relieving the pressure from India, Iran & Afghanistan, India would have likely asked for strict concrete steps being taken against LeT & JeM and their leaders. However, it is also certain that the US would also have asked for its pound of flesh in the bargain in the form of Pakistani support in the US activities in the region, such as in the US talks with the Taliban. It may also have been a way for the US to convey to Pakistan that in a crisis with India, it was only the US and not China that can help it. If Pakistan has acquiesced to the US demands, we may even see some economic aid given to them by the US pretty soon.

What can India do ?

It seems to be clear that the US and Indian interests in Afghanistan and Pakistan are not converging.  In the current flare-up, the US appears to not be completely on the Indian side. While it has condemned the terrorist attack, batted for India’s right to respond and asked Pakistan to act against the terrorists on its soil, it has taken an attitude of reconciliation rather than confrontation with Pakistan.

As far as India is concerned, one of the things that has changed from the Indian side is that it has shown a willingness to respond to a cross-border terrorist attack with significant escalation. This raises the stakes for all those countries with interests in the Af-Pak region such as the US, Russia & China as well as for the wider international community. India’s entreaties to other countries about terrorism emanating from Pakistan which harms it, will henceforth be taken with a greater urgency. The airstrikes did not just send a signal to the Pakistani establishment and they may have also served to remind the world community that India is in no mood to take this lying down and if they want things to run smoothly, they ought to exert pressure on Pakistan.

It appears now that Pakistan has for the time being at the very least started acting against the terrorist organisations if the news coming out of Pakistan is to be believed. Besides the Indian pressure, the imminent blacklisting under FATF is also hanging like a Damocles’ sword on Pakistan’s neck. But India remains unconvinced because it has often seen such actions being reversed in the past.

It remains to be seen what the larger Indian objective is and how will it be achieved. One likely strategy is to bring Iran & Afghanistan on the same page as India and portray them as victims of Pakistan based rogue elements. How much support does India get from the US in this is anybody’s guess.

While the Imran Khan government has unequivocally said that it wants nothing but peace with India, it has still made all efforts to highlight the Kashmir issue and sell its narrative of Indian atrocities in Kashmir. So, it is clear that though Pakistan seeks peace it still wants it on its own terms. As long as Pakistan maintains this obstinacy peace in the region will remain elusive.  If Pakistan is sincere about peace, it should stop its focus on Kashmir. If India-Pakistan relations improve it is quite natural that the terrorism in Kashmir and alleged Indian atrocities are going to die down. So why does Pakistan not see this ? Perhaps because it still harbors a desire of wresting Kashmir out of Indian control, if not as a Pakistani province than as a independent country under its absolute influence. This will quite obviously not be acceptable to India at any cost. The sooner the Pakistani establishment lets go off this, the better it would be for peace in the region.

In the long term, the South Asian region needs to come together as one block if it wants World powers to stop meddling in the region. The biggest stumbling block is the Indo-Pakistani conflict on Kashmir and Pakistan’s willingness to act as a client state of world powers like the US and China who are only chasing after their own interests. China has a firm handle on Pakistan at the moment and it is unlikely that China will let go off it anytime soon but the Pulwama crisis has left the Chinese in an awkward position. With Pakistan giving the lion’s share of the credit for de-escalation to the US, the Chinese have had to themselves come forward and claim that it too played a ‘constructive’ role in the de-escalation. What this means for the future is anybody’s guess.


The Balto-Slavic & Indo-Iranian Connection

         A Confirmation of the Vedic Tradition

I had made a partial review of the recent paper on Indus Valley populations last time around where I tried to argue that the genetic evidence brought out by the paper confirms the Vedic tradition. As per the Vedic tradition the region of Haryana and Western UP was the Vedic homeland from where the Vedic culture, religion and language spread across the entire subcontinent. It is conceivable that this was accompanied by migration of people from the Vedic heartland into regions further inland spreading their genetic signature in the process. It is also conceivable that this genetic signature was present in higher proportion among the Upper Castes like the Brahmins & Rajputs than the lower castes in those regions. Such a signature,  found in higher proportions among the Upper Caste Brahmins and Rajputs, has been claimed to be identified by the geneticists but its source is said to come from the Pontic Caspian Steppe. Its entry into South Asia supposedly formed a group termed as ANI that then became the source population for the Indo-Aryan spread & expansion across South Asia and that the genetic signature on this Indo-Aryan expansion in the recipient groups further inland was in terms of their relative share of this ANI ancestry. In short, the Indo-Aryan and Vedic civilization spread across South Asia was accompanied by admixture with this ANI group by the recipient populations.

It has also been argued that the greater presence of ‘steppe’ ancestry among the Upper Castes is an implicit confirmation of this ancestry having brought Indo-Aryans and the Vedic culture into South Asia.

The present study under review shows quite clearly that a group presently living in the region of the ancient Vedic heartland, Rors (but also the much more numerous Jats), have the highest ‘steppe’ ancestry among South Asians and than they can be considered as that hypothetical  ANI source population. Since this puts the ANI source population squarely in Haryana & Western UP (places inhabited by the Haryanvi Jats) it suggests that Haryana is the genetic ground zero from where the genetic signature of ancient Vedic people spread across the subcontinent.

This inference is therefore clearly in support and confirmation of the Vedic tradition which revers the land of Haryana & Western UP as the ancient Vedic heartland from where the Vedic culture disseminated across the wider South Asian region.

There is also some linguistic support for Haryana & Western UP being the Vedic homeland/heartland. Continue reading “The Balto-Slavic & Indo-Iranian Connection”


Are Haryana Jats the closest living descendents of our Vedic forefathers ?

Recently, there was a paper on some communities of Northwestern India such as Rors, Jats, Kambojs, Gujjars & Khatris. The primary focus of the paper was the community of cattle herders from Haryana known as Rors.

This is part 1 of my review of the paper. In part 2 I shall focus on whether the evidence furnished in the paper proves a steppe migration into South Asia.

Let me first quote the abstract in full :-

The Indus Valley has been the backdrop for several historic and prehistoric population movements between South Asia and West Eurasia. However, the genetic structure of present-day populations from Northwest India is poorly characterized. Here we report new genomewide genotype data for 45 modern individuals from four Northwest Indian populations, including the Ror, whose long-term occupation of the region can be traced back to the early Vedic scriptures. Our results suggest that although the genetic architecture of most Northwest Indian populations fits well on the broader North-South Indian genetic cline, culturally distinct groups such as the Ror stand out by being genetically more akin to populations living west of India; such populations include prehistorical and early historical ancient individuals from the Swat Valley near the Indus Valley. We argue that this affinity is more likely a result of genetic continuity since the Bronze Age migrations from the Steppe Belt than a result of recent admixture. The observed patterns of genetic relationships both with modern and ancient West Eurasians suggest that the Ror can be used as a proxy for a population descended from the Ancestral North Indian (ANI) population. Collectively, our results show that the Indus Valley populations are characterized by considerable genetic heterogeneity that has persisted over thousands of years.

Pay attention to the bolded part. As per the pre-print by Narasimhan et al, the ANI is the likely population that spread Steppe ancestry and hence Indo-Aryan ancestry among South Asians by mixing with  the ASI group. Now this paper on Rors says that Rors (by corollary the Jats) are the population most identical to this hypothetical ANI population. Please note – It is not Brahmins but a herder group from Haryana, which is the vert heartland of Vedic India. This is very significant because it clearly establishes the veracity of our Vedic tradition.

Let us look at this in more detail.

The ancestors of Rors and Jats from Haryana spread the Vedic civilization

As many of you here might be aware, the Vedic homeland was situated on the banks of the river Saraswati in a region which encompassed today’s Haryana and Western UP from where it eventually spread further into Northern India, principally in the Gangetic plains and beyond.

In terms of genetics therefore, one may argue that if there is a genetic signature of the Vedic people, it should be found most strongly in the original Vedic homeland and gradually reduce as one moves away from this homeland. Ofcourse, the caveat would be, that unless the modern people residing in the Vedic homeland had come to completely replace the original inhabitants of Haryana  who spread the Vedic culture.

The ancient DNA research has now shown that in terms of autosomal ancestry, there is link between the modern presence of Indo-European speakers across Eurasia and the ‘steppe’ ancestry component.

In South Asia it is argued, that the ‘steppe’ component is highest among the Brahmins and decreases as one moves down the caste heirarchy and this is said to be one of the principal evidences of movement of steppe people into South Asia having spread the Indo-European language and culture. Infact, the recent Narasimhan et al paper, even went so far as to suggest,

Although the enrichment for Steppe ancestry is not found in the southern Indian groups, the Steppe enrichment in the northern groups is striking as Brahmins and Bhumihars are among the traditional custodians of texts written in early Sanskrit. A possible explanation is that the influx of Steppe_MLBA ancestry into South Asia in the mid-2nd millennium BCE created a meta-population of groups with different proportions of Steppe ancestry, with ones having relatively more Steppe ancestry having a central role in spreading early Vedic culture.

However, it has already been known since many years that the population having the highest ‘steppe’ ancestry in South Asia are not the Brahmins but the Jats, more specifically, the Haryanvi Jats. This was also noted by Razib in one of his earlier blogs.

The present study focuses on this elevated steppe related component in Jats and more specifically in a related group from Haryana known as the Rors. It is titled, ” The Genetic Ancestry of Modern Indus Valley Populations from Northwest India “.  This study has the advantage that it incorporates the aDNA data from the Narasimhan et al and other recent papers.

The following is the admixture graph from the study,

As can be seen in the selected enlarged portion of the graph, the ‘steppe’ like light blue component, which is highest in some of the Northern European groups closest to the steppe, like the Latvians, Lithuanians, Russians etc., is far higher in Rors than it is in the Brahmins or any other South Asian group.

As per the authors themselves,

Outgroup f3 analysis in the form of (PNWI, X; Yoruba) showed that the Ror (and Jat) have distinct, high genetic similarity to modern Europeans (Figures 1C, 1D, and S5), far higher than the similarity observed in other NWI populations, such as the Gujjar (Figures 1D and S5). Among an extended set of South Asians, this pattern was repeated only in the Pathan population from Pakistan (Figure S5).


Refined IBD analysis highlights the general trend whereby the sharing of IBD segments declines as one moves along the cline from PNWI and NI_IE toward Dravidian and Indian Austroasiatic (IN_AA) groups (Figure 2A). Strikingly, among all PNWI groups studied, the Ror demonstrate the highest number of IBD segments shared with Europeans and Central Asians, whereas the Gujjar share a higher number of IBD segments with local Indian Indo-Europeans and Dravidians than do other PNWI groups (Figure 2A).

In CHROMOPAINTER analysis, as expected, the Ror (and Jat) exhibited a significantly higher number of chunks received from Europeans than do other NWI populations studied (t test, p value < 0.01).

They also state further,

A higher level of European ancestry in the Ror and Jat compared to other South Asians (Figures 1, 2, S2, S5, and S13 and Tables S5–S8) makes these two populations outliers within the broader Northwest South Asian landscape. This could be indicative of either a possible recent gene flow from a population related to Europe or to ancient West-Eurasian-related influx, which would agree with previous studies on adaptation, wherein the Ror and Jat have stood out for their high frequency of the lactase persistence allele (LCT-13910T) and the light-skin-color gene variant (SLC24A5).

The Rors and Jats also have higher frequencies of Lactase persistence and light skin color gene variant which makes the case of their more recent ancestry sharing, compared to other South Asians, with Northern Europeans or steppe groups stronger.


We also report that, relative to other South Asians, the Ror group has high shared drift with the EHG and Steppe_EMBA groups, higher allele sharing with the Steppe_MLBA group, and higher affinity with the Iron Age (prehistorical) and early historical first South Asian ancient sources (Figures S6A, S6B, S7, S8A, S8D, and S9 and Tables S9 and S16).

Finally the authors argue that the Rors are the best proxy for the ANI ancestry in South Asians,

In summary, we demonstrate a higher proportion of genomic sharing between PNWI populations and ancient EHG and Steppe-related populations than we observe in other South Asians.We report that the Ror are the modern population that is closest to the first prehistorical and early historical South Asian ancient samples near the Indus Valley, and they also harbor the highest Steppe-related, EHG, and Neolithic Anatolian ancestry. However, compared to other adjoining groups, the Ror show less affinity with the Neolithic Iranians. The Ror population can plausibly be used as an alternative proxy for ANI in future demographic modeling of South Asian populations.

The bar graph below explains it very well, where it can be seen that the proportion of the steppe orange component is higher among Rors and Jats than either the Pathans, the Brahmins or any other South Asian group.

The admixture proportions as per the qpAdm given in the Supplementary Table 11 and it is instructive to observe that the steppe_emba proportion for Rors is estimated at 57 % of total ancestry while for Jats it is 61 %. The same proportions for Brahmins from UP, Gujarat & Bengal are 46 %, 45 % & 44 % respectively. Even for Pashtuns from Afghanistan it is 52 % and for Kalash it is 58 %. Only the Yaghnobis and Pamiris from Central Asia are estimated to have a higher proportion of steppe_EMBA at 62 % & 67 % respectively.

Before moving forward it is necessary to point out that the light blue component observed in the admixture graph which is highest among the Northern Europeans is not the same as the steppe_EMBA or steppe_MLBA ancestry. Steppe_EMBA & Steppe_MLBA are an amalgation of the light blue, the dark blue (Anatolian-Farmer related) and the light green (Iran_N/CHG) components you see in the admixture graphs. So while the light blue component which peaks in Northern Europe is significantly less among South Asians, the light green component which correlates well with Iran Neolithic type ancestry, peaks in South Asia but it present at a lot less proportion among the northern Europeans.

Infact, the authors even stress that,

The Ror and Jat peoples stand out for having the highest proportion of Steppe_ MLBA ancestry (- 63%). The proportion of Steppe ancestry in the Ror is similar to that observed in present day Northern Europeans

Therefore, the predominance of the light blue component in Northern Europeans is not alone an indication that their ‘steppe’ ancestry is far higher than among South Asians.

Now, if steppe-related ancestry correlates with presence and spread of Indo-European languages, the above data clearly implies that the highest steppe-related and therefore IE ancestry among South Asians is among the Jats  & Rors, significantly higher than in other NW groups as well as Brahmins and Kshatriyas. Jats and Rors sampled for the study, live in Haryana & Western UP, which is the Vedic homeland.

It therefore supports the ancient Indian tradition according to which the region of Haryana & Western UP was the homeland of the Vedic people from where they spread out across Northern India. It can therefore be argued perfectly well, that the Brahmins and Kshatriyas in other regions have higher proportion of ‘steppe’ ancestry than the lower classes around them precisely because they have greater percentage of their ancestry derived from the ‘steppe’ rich people from the Vedic homeland. It has long been an argument that the ‘steppe’ ancestry in higher among the Brahmins and Kshatriyas than the lower castes across all regions of India  and that this was evidence of IE culture spreading in South Asia with the ‘steppe’ ancestry. But the example of Jats and Rors in Haryana puts to doubt all such claims. Instead, we can argue that the higher ‘steppe’ related ancestry in Upper Castes across India is a function of them having a greater portion of their ancestry from their Vedic forefathers who lived in Haryana & Western UP, just as is suggested by the Vedic tradition.

I may finally add that there is a closely related group based on close fst distances and similar admixture proportions that likely descends from the core group that was responsible for the spread of this ancestry into the Caucasus and the steppe. This group consists of Rors, Jats, Kalash, Pashtun, Pathan, Tajik & Pamiri. They have broadly similar levels of Iran_N (15 to 30 %), Steppe_EMBA (49 to 67 %) & Onge (15 to 25 %) as per the qpAdm modelling in table S11. Fst distances also indicate that they are quite closely related. For example, the Fst distance between Rors and Pamiris (0.0069), Pashtuns (0.0057) & Tajiks (0.0058) is similar to Fst distances of Rors with neighbouring groups like Kamboj (0.0088), Gujjar (0.0064), Khatri (0.0056), Brahmins (0.0052) & Kshatriyas (0.0062). Considering the fact that Rors (& perhaps Jats) haven’t probably admixed with Pamiris, Tajiks or Pashtuns since millenia, their Fst distances would have been even less initially. The other Indus Valley modern populations are also not very far off in terms of Fst distances with each other but the above groups seem to form a subset among them.

It is conceivable that an ancestral group related to these populations with similar levels of ancestry proportions as exhibited by them (but perhaps with lowel levels of AASI – since BMAC has only 5 % in comparison to Pamiris who have 15 %), spread out from North India to Central Asia and those from Central Asia venturing further towards Caucasus and from there onto the steppe.



The Evidence for Out Of India – Part I

I have earlier on in two of my posts here & here , cited several evidences which support migrations out of India or South Asia.

However, I haven’t put out all of these evidences in one place. So I shall, in this post, cite and quote as much as of the evidence as I am aware of. But I realise that if all of it was put in one post, it would become gargantuan and tedious to read. So I will break it up in 3 or 4 parts. This is the 1st part.  I would request Razib, Zack, Slapstik, Anan, Santosh, Vijay, Numinous, Fraxinicus, Snake Charmer & others to read and comment.


The Yamnaya cultural complex in the late 4th – early 3rd millenium BC on the Pontic Caspian steppe is currently the most favoured archaeological complex for the PIE or atleast the late PIE (i.e. IE languages after the early separation of Anatolian).

It has also been known since long that the Maykop culture in North Caucasus significantly influenced the genesis of the Yamnaya phenomenon and therefore some argue that the IE languages also likely spread to Yamnaya from the Maykop culture.

Now, it may be noted that Maykop shows several artifacts which are clearly of Central or South Asian origin. Let me quote the archaeologist Mariya Ivanova :-

Graves and settlements of the 5th millennium BC in North Caucasus attest to a material culture that was related to contemporaneous archaeological complexes in the northern and western Black Sea region. Yet it was replaced, suddenly as it seems, around the middle of the 4th millennium BC by a “high culture” whose origin is still quite unclear. This archaeological culture named after the great Maikop kurgan showed innovations in all areas which have no local archetypes and which cannot be assigned to the tradition of the Balkan-Anatolian Copper Age. The favoured theory of Russian researchers is a migration from the south originating in the Syro-Anatolian area, which is often mentioned in connection with the so-called “Uruk expansion”. However, serious doubts have arisen about a connection between Maikop and the Syro-Anatolian region. The foreign objects in the North Caucasus reveal no connection to the upper reaches of the Euphrates and Tigris or to the floodplains of Mesopotamia, but rather seem to have ties to the Iranian plateau and to South Central Asia. Recent excavations in the Southwest Caspian Sea region are enabling a new perspective about the interactions between the “Orient” and Continental Europe. On the one hand, it is becoming gradually apparent that a gigantic area of interaction evolved already in the early 4th millennium BC which extended far beyond Me- sopotamia; on the other hand, these findings relativise the traditional importance given to Mesopotamia, because innovations originating in Iran and Central Asia obviously spread throughout the Syro-Anatolian region independently thereof.

So the culture that apparently had a transformative effect on the steppe and probably brought the Indo-European languages on the steppe is said to have been in interaction  already with South Central Asia with several features of the Maykop phenomenon probably derived from there.

Now one may argue this influence seems to come to Maykop from Iran & Central Asia and how does South Asia come into the picture ? So let us quote a few passages from Mariya Ivanova’s book . In one of the sites, Dolmen 2, of the Maykop culture we find the following:-

The textile samples from Klady 31/5 consisted of very fine, 0.15-0.30 mm thin linen fibres. The threads were spun, plied, and dyed in two different hues of brown. In contrast, the fabric from Dolmen 2 was woven from a mix of wool with a plant fiber, possibly cotton

Cotton was rare in the bronze Age and is known earliest from Mehrgarh and Indus civilization in general. Its appearance in the Maykop culture links it with the Early Harappan phase. Even the wool found at the other site could have come from the East since the earliest evidence of wool was discovered at Shahr-i-Sokhta, a site which was in close contact with the Harappans (as we will see later) –

The earliest actual remains of a wool textile have been recovered at Shahr-i Sokhta I and date to the last centuries of the fourth millennium BC.

It may be noted that the earliest Indus Periphery sample that was recovered from Shahr-i-Sokhta dates to 3100 BC (i.e. precisely to the period when wool is found at the site).

Further, Ivanova discusses an axe of copper-lead alloy discovered from the Klady site –

The axe from Klady was not work-hardened and possibly never intended for use, as suggested by its elaborate decoration. It demonstrates that the north Caucasus belonged to an area of early use of copper-lead alloys. The earliest evidence for this material has been reported from the Indus valley and dates to the fifth millennium BC (Mille et al. 2004, 267). Terekhova (1981, 316) reports that during the Namazga II period artefacts of copper with high lead content were common at sites in the Kopet Dag piedmont.

So this copper-lead alloying technology possibly could also have spread from Early Harappans to Central Asia (Namazga) and from there to North Caucasus.

But this is not the only technology apparently borrowed from the Early Harappans. We may also note,

Finally, several metal artefacts such as animal figurines, axes with relief decoration, “forks”, and daggers with complex profiles were apparently produced in the lost-wax technique (Ryndina et al. zoo8). The earliest evidence for the lost-wax technique so far comes from the chalcolithic levels at Mehrgarh in north Baluchistan and dates to the fifth millennium BC (Mille et al. 2004, 267). Lost-wax casting was widespread in central and south- west Asia during the late fourth millennium BC.

So the Maykop culture also shows the use of lost-wax casting and this technique is also known to have been practiced the earliest by South Asians as demonstrated at the site of Mehrgah. This yet again shows the possible direction of exchange of technology and the important role of South Asia in it.

Ivanova goes on to describe the various copper based objects found at Maykop sites and then adds, “Most of the described copper objects have comparisons on the Iranian plateau and in central Asia, dating
to the early fourth rnillenniurn BC.


The absence of shaft-hole axes outside the Caucasus during the fourth millennium BC is striking. The only exceptions are probably the shaft-hole axes found in Level III, 6 at Mundigak in south Afghanistan.

Ivanova then moves on to the discovery of wheeled transport among the Maykop people,

Half of all third-millennium finds of wheeled vehicles between the Danube and Ural were uncovered in the region of Kuban. Trifonov (2004, note 2) interprets this striking concentration of early graves with wagons on Kuban as an indication of the spread of the wagon from this area into the steppe.

Pay attention to the bolded part. So the wheeled vehicle technology apparently reached the steppe from Maykop. Wheeled vehicle technolgy is considered by David Anthony as probably the most transformative technology that helped the steppe people spread their influence across a wide region in the form of the IE languages.

Now, let us note what Ivanova has to say about the appearance of the wheeled vehicles in Maykop itself,

The evidence for wheeled vehides dating to the preceding Maikop period, in contrast, is very tenuous… The vehicle from which the wheels at Novokorsunskaja originate might have been a two-axle wagon like the roughly contemporary wagon from Koldyri on the Lower Don (see Chapter 5). But it is also possible that the find from Novokorsunskaja was a two-wheeled cart. Clay· models of two-wheeled carts with rotating wheels attest to the use of this type of vehicle in central Asia and the Indus valley in the late fourth millennium BC. At Altyn-depe in south Turkmenistan, such models occur in the second half of the fourth millennium (Namazga III period) and become more common in the earlv centuries of the third millennium (Kircho 2009). Cattle figurines with holes in the withers for attaching the yoke have been recovered at Kara-depe (Kircho 2009, 30). Comparable models appeared in the Indus valley around 3500-3300 BC, during the Ravi-Phase of the Indus culture at Harappa.

So again, the link extends all the way to the Early Harappans and this time it is the technology which David Anthony deems as transformative for the spread of IE languages.

There is also evidence of long distance trade between Maykop and SC Asia. Cylinder seals of carnelian discovered at a Maykop site is similar to such seals found as far away as Shahr-i-Sokhta and Sarazm, both sites with significant Harappan contacts (as discussed later in the next post).

Another exotic item is undoubtedly the carnelian cylinder seal from Krasnogvardejskoe, engraved with a depiction of a stag and a tree (Fig. 4·5).145 A cylinder seal with a stag and a tree depicted in a remarkably similar manner has been found at Tepe Sialk IV… Further cylinder seals with animal depictions dating to the late fourth and early third millenniums have been unearthed at Sarazm III and IV in the valley of Zaravshan and at Shahr-i Sokhta I in Seistan.

Ivanova then proceeds to do a survey of the various animal figurines found on the Maykop vessels and concludes,

In conclusion, the north Caucasus, a geographic region which has  never been inhabited by lions, Asiatic gazelles and wild sheep, can be ruled out as the area in which the two silver cups frorn Maikop were designed and manufactured. The alluvial plains of Mesopotamia can be also excludedThe animal depictions from Maikop are unique indeed. However, they show resemblances to some of the objects from a hoard of five gold and seven silver vessels recovered during illicit excavations in 1966 near Fullol in north Afghanistan.

Further still,

Probably the most attractive and highly valued ornamental stone of exotic origin, lapis lazuli, has been found at only three sites north of the main Caucasus range… The most famaus source of lapis in the Old World is the mines of Sar-i Sang located in the Kokcha Valley in the Afghan province of Badakshan (Weisgerber 2004). Sources of secondary importance are situated in the Chagai hills in Pakistani ßaluchistan and in the Pamir mountains.

Ivanova finally sums it up as follows,

Let us consider the elements of central Asian origin in the material culture and technology of the north Caucasian societies. The most unambiguous evidence is provided by beads of colourful ornamental stones. Not only are the deposits of lapis lazuli, turquoise and possibly carnelian situated on the Iranian plateau and in the mountainous regions of central Asia, but the indirect supply with such materials via Upper Mesopotamia can be essentially ruled out. In the early fourth millennium lapis lazuli and turquoise were nearly absent in southwest Asia. Sites in Iran and central Asia, in contrast, provide ample evidence for the continuous exchange of these materials at least since the sixth rnillennium BC…An exotic fabric from Dolmen 2 at Klady, a rare red pigment from the same tomb, a unique bone pin with flat triangular head at Ust Dzheguta, and a carnelian cylinder seal with an engraved stag and tree motive from Krasnogvardejskoe all point to the Iranian plateau and its borderlands. Furthermore, the silver vessels with animal depictions from the kurgan of Maikop portray animal species which are native in Azerbaijan and west Iran, and resemble in style another gold vessel with animal decoration found at Fullol in north Afghanistan. Along with these exotic commodities, exotic ideas and technological knowledge reached the communities of the Maikop period from the southeast. Most shapes of locally made copper tools, for example, derive clearly from Iranian and not from Syro-Anatolian prototypes. Other technological peculiarities of the north Caucasus, like lost-wax casting, beads of gold and silver sheet over faience core, copper-lead alloys, copper-silver alloys, arsenic-nickel copper, use of silver and gold, manufacturing of metal vessels, may well originate from the “Irano-Afghan” cultural sphere, and not from Greater Mesopotamia. All these innovations were part of the technological systern in central Asia and Iran during the early fourth millennium BC.

So a whole host of artefacts of the Maykop culture ,that suppossedly brought ‘IE’ to the steppe, have their most likely origin in SC Asia. Isn’t this worth pondering ?

Ivanova also briefly comments on Central Asian and South Asian links,

Terracotta female figurines in central Asian (Geoksyur) style have been found in the lowest levels of Said-Qala in the Helmand valley and at pre-Harappan sites in the Indus valley… the presence of characteristic wheel-made pottery and seashell bracelets in Sarazm points to contacts with Baluchistan.

So we see a large interaction zone that starts in South Asia and stretches upto the North Caucasus. The direction of movement however is largely from Southeast to Northwest. This North Caucasian Maykop culture eventually ends up greatly influencing the steppe culture of Yamnaya which suppossedly spread the late Indo-European (PIE) culture. So one can see that South and Central Asia were already key drivers in an interaction zone that had the designated IE cultures on the steppe.


Having done a survey of the archaeological evidence we may now also look at some genetic evidences that directly link the Maykop and related cultures in the Caucasus with South Asia.


According to an ancient mtDNA study of 6 human samples from various Maykop sites –

…we used target-enrichment together with high- throughput sequencing to characterize the complete mitochondrial sequence of three Maikop and three Novosvobodnaya individuals. We identified T2b, N1b1 and V7 haplogroups, all widely spread in Neolithic Europe. In addition,we identified the Paleolithic Eurasian U8b1a2 and M52 haplogroups, which are frequent in modern South Asia, particularly in modern India

We have an mtDNA haplogroup M52, in one of the site of Maykop, which is today mostly found in South Asia and is likely of deep South Asian ancestry. Clearly its presence at one of the Maykop sites in the 4th millenium BC in North Caucasus along with the abundance of the artefacts of SC Asian origin that we discussed earlier, reveals that South Asians were indeed migrating towards Caucasus at that early period.

We have even more evidence still. In the 2016 study by Lazaridis et al which for the 1st time published the samples from Neolithic Iran, there were also 3 samples from Chalcolithic Armenia and this is what they say about these samples in the Supplementary Section of the paper –

All three males from this population belong to Y-chromosome haplogroup L1a-M27/P329. The M27 mutation is common in South Asian haplogroup L Y-chromosomes, but was absent in a survey of Y-chromosomes from Anatolia. Haplogroup L occurs at a very low ~2% frequency in present-day Armenians.

Considering the high frequency of this haplogroup among South Asians and its very low frequency among the present Caucasian or Near Eastern populations, this y-dna is also very likely of South Asian origin.

But y-dna L was not just found among the 3 samples from Chalcolithic Armenia. In a later study published this year which had a very comprehensive ancient sample set from the Caucasus, there were another 3 samples from late Maykop period which were designated as y-dna L but which was not resolved for its sublineages. Considering the presence of South Asian L1a already in Chalcolithic Armenia and also the presence of South Asian mtDNA M52, it may not be surprising that these L lineages were also L1a like those from Armenia.

Combine this with the fact that the y-dna J2a, a common haplogroup among South Asians and Iranians was also found in the Chalcolithic Caucasus samples and in the Narasimhan et al paper among the Chaocolithic & Bronze Age Central Asians. There are also many mtDNA found among these ancient Caucasian samples (mostly from Maykop sites) which are shared with South Asians (though not exclusively) such as T2a1, T2e, H2a1, H13ala, U4c1, U5a1b, U7b, U1a1a, U2e1b, R1a1a, I4a, W1.

The Narasimhan et al paper also shows us through several d-stats that the Armenia Chalcolithic and Armenia Early Bronze Age samples have quite a bit of ancestry sharing with Chalcolithic & Bronze Age Central Asia & Eastern Iran.

Therefore, not only do we find material evidence from the 4th millenium BC Caucasus of artefacts and technologies with South Central Asian origin like cotton, wheeled vehicles, copper-lead alloys, lost wax casting etc but we also see genetic evidence linking both regions with definite evidence of some human migration from SC Asia into the Caucasus. And from this same Caucasus region flowed the cultural assemblage and perhaps genes as well, which is said to have brought the IE languages to the steppe.

We can therefore observe that the Chalcolithic SC Asians were already linked to the suppossed IE cultures of the steppe during its early phases with influences going from SC Asia to Maykop and then steppe and not vice-versa.


Now for some evidence from animal genetics which further supports migration from South or Central Asia into the Caucasus but also the steppe.

Zebu is the South Asian native cattle which was domesticated in South Asia and is found from the earliest periods at the sites of Mehrgarh in Balochistan and Bhiranna in Haryana (both sites date to atleast 7000 BC and possibly even earlier)

According to this FAO webpage, “There are grounds for believing that zebus were raised on the territory of the present-day Azerbaijan 4000-4500 years ago. During the excavations of a stone burial ground in the vicinity of the city of Lenkoran the French archaeologist Jacques De Morgan unearthed and described a unique round seal of black and grey agate depicting a humped zebu bull covered with dense hair. This he dated to 2500-2000 B.C.

We can see from our earlier discussion that there was a transfer of technology and material assemblage with some human migration from SC Asia to the Caucasus during the Chalcolithic age. So it is very likely that the Azerbaijani Zebu could have reached the Caucasus around the same time.

But Zebu genetic inheritance is not just restricted to the Caucasus and it extends to the ancient steppe cattle as well.

According to a study of ancient and primitive European DNA, “The divergence of the BAI cattle as suggested by PCA (Figure 2c) can be attributed to an indicine genomic component which is identified in the ADMIXTURE (Figure 3) and D-statistics (Table 2) analyses. By analysing the genome-wide SNP markers, McTavish et al. (2013) and Decker et al. (2014) also reported an indicine influence on Italian cattle breeds. Using whole-genome sequences of ancient human DNA, Jones et al. (2015) and Haak et al. (2015) suggested massive migration of Yamnaya steppe herders as a source of dispersion of Indo-European languages to both northern-central Europe and India. These herders might also have mediated gene flow between Indian zebu and Ukrainian steppe cattle.

So here the authors are directly linking the presence of Zebu genetic admixture in the ancient Ukrainian steppe cattle to the Bronze Age steppe herders or in other words to IE expansions. But how would Indian Zebu get admixed in the ancient steppe cattle if it was not brought there by people ? Is it really probable that steppe pastoralists went all the way to SC Asia and brought back the Zebu cattle to the steppe on their return ? What is much more likelier to be the case is that people with origins in SC Asia, as they gradually migrated through Iran into the Caucasus and beyond probably also brought their Zebu cattle along with them.

But not only is there evidence of cattle migration from South Asia to the Caucasus and the steppe, there is also evidence of possible sheep migration.

As per this study on mtDNA diversity in Indian sheep,

Previous studies on mitochondrial DNA analysis of sheep from different regions of the world have revealed the presence of two major- A and B, and three minor- C, D and E maternal lineages. Lineage A is more frequent in Asia and lineage B is more abundant in regions other than Asia… The breed differentiation in Indian sheep was essentially due to variable contribution of two major lineages to different breeds, and sub- structuring of lineage A, possibly the latter resulting from genetic drift. Nucleotide diversity of this lineage was higher in Indian sheep (0.014 ± 0.007) as compared to that of sheep from other regions of the world (0.009 ± 0.005 to 0.01 ± 0.005). Reduced median network analysis of control region and cytochrome b gene sequences of Indian sheep when analyzed along with available published sequences of sheep from other regions of the world showed that several haplotypes of lineage A were exclusive to Indian sheep. Given the high nucleotide diversity in Indian sheep and the poor sharing of lineage A haplotypes between Indian and non-Indian sheep, we propose that lineage A sheep has also been domesticated in the east of Near East, possibly in Indian sub-continent.

Haplotype A is more widespread and frequent in Asia while mtDNA haplotype B is more common in Europe.

The above is an map from another paper which shows the relative distribution of various sheep mtDNA lineages. One can observe that the mtDNA A lineage (in Blue) predominates in Asia while mtDNA B (in Red) predominates in Europe. However, it is also evident that mtDNA A has a significant presence in the Caucasus as well as on the European steppe and Northern Europe. Considering its likely origin in South Asia and its presence in the Caucasus and steppe, this may again indicate that a sheep lineage spread out from South Asia to these two regions along with the Zebu cattle and was accompanied by human migration as well.

We also have evidence from actual animal aDNA.

There is ancient dog DNA from a Corded ware site that shows possible Indian dog and Indian wolf admixture. The study involved aDNA from 3 dog samples, one of which, labelled as CTC, came from a corded ware site in Germany. Corded ware culture, as many here would know, is considered the 1st IE culture in Europe proper which was heavily influenced by Yamnaya groups from steppe.

In a peer review file, the authors respond to one of the critics in the following manner,

A key finding of our analysis is indeed that CTC appears to share some ancestry that is predominantly found in modern Indian dogs (which we refer to as India-like as it is also found in Central Asian and Middle Eastern dogs). All analyses that incorporate some model of admixture show this. We would argue that visually CTC is approximately halfway between European and
Indian dogs rather than being “clustered with Indian dogs”, but nonetheless, any Indian-like component inferred from this PCA analysis is clearly significant, as correctly pointed out by the reviewer. CTC also appears to show a clear Indian component in both the NGSadmix and ADMIXTURE clustering analysis, on the order of 25% (so closer to modern Europeans and thus
consistent with the Neighbor joining tree analysis). The MixMapper and ADMIXTUREGRAPH analysis also point to other higher values of Indian-like admixture.

The below admixture graph from the same paper explains the likely path of admixture from Indian dogs into the Corded ware CTC dog sample.

Figure 5


Whether it be domesticated cattle, sheep or dogs, their movement is usually associated with the movement or migration of humans. In this case, we can see migration into the steppe of all three domesticated species from a likely origin in South Asia. So can we also detect some signs of human migration into the steppe. While we have clearly seen the signs of human migration from South & Central Asia into the Caucasus associated with a cultural and technological package, such evidence for steppe is much harder to come by. More so because of the absolute rarity of aDNA from South Asia.

But nevertheless, there is tentative evidence. Let us go through it.

There was this recent paper which was covered by Razib on his blog.

In it the authors make the following observation :-

The Bronze Age Srubnaya-Alakulskaya individuals from Kazburun 1/Muradym 8 presented genetic similarities to the previously published Srubnaya individuals.  However, in f4 statistics, they shared more drift with representatives of the Andronovo and Afanasievo populations compared to the published Srubnaya individuals. Those apparently West Eurasian people lacked significant Siberian components (NEA and SEA) in ADMIXTURE analyses but carried traces of the SA component that could represent an earlier connection to ancient Bactria. The presence of an SA component (as well as finding of metals imported from Tien Shan Mountains in Muradym 8) could therefore reflect a connection to the complex networks of the nomadic transmigration patterns characteristic of seasonal steppe population movements.

The South Asian ‘lilac’ component can be observed in trace components across most of the steppe samples above here. This ‘lilac’ component is also the one that peaks South Indian tribal groups like the ‘Mala’. So is this sign of AASI admixture on the steppe during the late Bronze Age ?

Even the Narasimhan et al preprint, observed that, “we examine the second set of outliers from this time period, those related to Turan. We see that these samples are drawn towards the BMAC on the PCA and have substantially higher proportions of Iranian agriculturalist related ancestry on the ADMIXTURE plot as well as by f4-statistics. These samples are at the southern end of our geographic sampling,  suggesting that the individuals of the Andronovo material culture horizon came into contact with the people of the BMAC at around 1500 BCE.

These outlier samples from the steppe dating to around 1500 BC, show evidence of BMAC admixture where models with the Swat samples or even the high AASI groups like Irula, work as good proxies for the 3rd ancestral source in these outliers.


However, it is possible that the Srubnaya individuals did not receive the AASI-like admixture directly but rather inherited through a steppe Yamnaya like ancestor. As can be seen below, even earlier steppe groups like the Yamnaya shows traces of AASI like ancestry component at k=15


The South Asian ‘lilac’ component is present also in a wide range of earlier steppe samples such as the Yamnaya and it is therefore likely that later steppe groups inherited it from these early steppe precursors rather than directly through BMAC contacts (which also looks quite possible).

The possibility of early southern admixture into the steppe during the Yamnaya period is also indicated by the qpAdm admixture model for the Yamnaya Ozera sample which can be modelled with a significant degree of confidence as having Sarazm_EN admixture. The 2 samples from Sarazm Eneolithic from Tajikistan dating to 3500 BC are the earliest Central Asian samples with AASI admixture. The Sarazm samples are also closer or share more alleles with Indian tribal groups with high AASI, in comparison to the BMAC main cluster which themselves have around 5 % AASI. Such a group with close genetic linkages to SC Asians acting as a good proxy for the southern admixture on the steppe again reinforces that some sort of genetic admixture ultimately originating from SC Asia may have mixed into the steppe groups.

Such southern admixture in the main sample set of Yamnaya can also not be discounted since the Yamnaya is already modeled as part Iran_Chl which is not so distant to populations like Sarazm_EN.



We may end by discussing the historical knowledge we glean from Greek sources regarding some tribes living in the Kuban valley of the North Caucasus.

Traces of Aryan tribes and names are also found in the northern Black Sea area in the Kuban region, as well as in the vicinity of the Caspian. These Aryan words, more archaic than the later Iranian forms,include the river name Sindes and the tribes Sindoi, Sindoi (in classical sources; cf. Skt. Sindhu- ‘Indus River; India’); the Kuban River (Cuphis, Kouphis, cf. Skt. Kubha ‘Kabul River’ in the Rigveda); the Caspian tribe Das- (cf. Skt. DAsa, dasyu- ‘foreign tribe’, OPers. dahyu- ‘province, district’): see Kretschmer 1943 (1944); cf. Uslar 1881:451, Thumb and Hauschild 1958:1:1.87, Trubacev 1976; Trubacev 1977 gives a list of names from this region with possible Indo-Aryan etymologies. The isolated location of these places north of the Caucasus gives reason to assume that Aryan tribes entered the area from the Near East via the Caucasus.

How and when these South Asian groups reached the Caucasus where they were recorded in antiquity is a matter of curiosity.

I shall in the next two posts, focus on evidence of Bronze Age human migrations from South Asia into Eastern Iran, Central Asia, Mesopotamia, Magan, Dilmun, Syria and the Tarim Basin.


Why AMT is far from proven and why it makes no sense for Indian history

In the recent podcast and its follow-up post, an argument has been made that AIT/AMT or more specifically the steppe migration into South Asia in the 2nd millenium BC is already proven by genetics and that there is no hope for OIT. It has also been insinuated that OIT is being propogated more due to politics and that there is hardly any data or rationale behind it.

While in my two previous posts, I have shown data that supports OIT, I shall again breach this topic but I shall approach it in a different manner and also bring in a lot more data that I have not touched in my earlier posts.

I shall in this post, explain why AMT makes little sense. In the next post, I shall present archaeological and genetic data that supports OIT.

I see that a lot of people outside India can’t seem to get why Indians are so opposed to the AMT as apparently it is most ‘scientific’ and is supported by linguistics, archaeology and now genetics. Let me just put it in brief why Indians like me oppose AMT .


Many Hindus have grown up with stories from the Mahabharata, Ramayana and the various Puranas. They have been told that this is the history of their ancient forebears. Most Hindus believe that it is history and they grow up being proud of this great heritage. And this has been happening since millenia. It is since millenia that Indians have been growing up with the stories of the Ramayana, Mahabharata & Puranas and of other Vedic literature. They have in the ancient period even spread this knowledge  across the vast expanse of SE Asia where the indigenous people still perform plays based on these Indian epics.

It is a well-known fact that the Puranas stop narrating history around the time when the Guptas were about to become the pre-eminent powers in South Asia i.e. around 2000 years ago. All the history narrated by Puranas is therefore of a period earlier than 2000 YBP. It is also well-known that without the help of the Puranas a lot of ancient Indian history would be still remain unknown. That the Puranas have historical information is unquestionable.

However, the Puranas date the ancient events it dates events like the ascension of Mahapadma Nanda, the Nanda emperor – a historical person, from the reference point of events mentioned in the Mahabharata like the birth of Parikshit who was born during the Great War of the epic.

The Mahabharata event is believed to have happened in the 32nd century BC and there is a very strong consistent tradition behind it. Astronomers such as Brahmagupta, for example, have said that there is  a lapse of 3179 years between the Saka Era of 78 CE and the Kaliyuga Era. The Aihole inscription of the Chalukya Emperor Pulakeshin (who defeated Harshavardhana), dates the inscription in both the Saka & the Kaliyuga Era. Kalhana mentions it as well though he seems to have fallen into confusion between Kaliyuga Era and the period of the Pandavas.

All traditional histories, whether it be the very ancient narratives of the Puranas or it be Kalhana’s Rajataringini (of Kashmir) or be it the Gopala Rajavamshavali (of Nepal) date their histories in reference of the Era of the Pandavas and the Kaliyuga.

And if that is not enough, as per the Epics and the Puranas, there are scores of Kings and leaders who preceded the kings of the Mahabharata period. In the Suryavamsha (the Solar Lineage), there is a list of 95 successor kings from the time of the Great Father of All, Manu Vaivasvata. Manu or Manusha is cognate with the father of Germanic people, Mannus. Therefore even his name is of IE origin.

Megasthenes, around 320 BC, relates and Indian tradition, which is no longer extant, according to which 6451 years and 154 kings have elapsed between the 1st Indian king, whom Megasthenes calls Dionysus, and Sandrocottus or Chandragupta. This Dionysus was none other than Prthu Vainya, who is said to lived even earlier to Manu Vaivasvata in the 6th epoch, while Manu Vaivasvata is the father of all mankind in the 7th epoch. Manu Vaivasvata is also the same person who is known in the Semitic tradition as Noah or Nuh (Arabic).

So all of this gargantum ancient tradition which is believed to be several thousands of years old and which has been handed down to Indians for millenia is to be thrown away ? Based on what ? What exactly have the Europeans shown that disproves of discredits this ancient tradition ?

William Jones in the late 18th century, laid the groundwork to bring down the great antiquity claimed by the Indians for their own history. He did this guided by the very ‘scientific’ belief of the enlightened Europeans of his age that Earth was only 6000 years old, so how the hell can the Indians claim such absurdly ancient dates for their history !

So in William Jones’ time began the effort by the British to cram down all of the ancient history claimed by the Indians from the time of Manu, in a timeframe not older than 2000 BC. It is in following this great tradition set by Jones, that Max Muller pulled out the date of 1500 BCE for the arrival of the Aryans. Ofcourse, in all of the ancient Vedic and Puranic or Epic tradition, which talks of great ancient events and personalities of importance, there is never a single mention of Indo-Aryans coming into South Asia from Central Asia or from anywhere else. Their sacred home and their sacred places are all in South Asia.

So, you want the Indians to believe that all of their ancient historical traditions which they have been handed down from their forefathers for thousands of years are of exaggerated antiquity and that they are in actual fact of much later time period and whats more so, the progenitors of this great cultural tradition of yours only came to South Asia in 1500 BC. Before that they were not here and all of your tradition that claims otherwise is worthless. It also does not matter that there is no mention of such migration in Indian tradition anywhere.


One may accept such a extraordinary proposition if it was based on some concrete data such as archaeological evidence. But where was archaeology in the late 18th and most of the 19th century when these theories were proposed ? Even after that, where has archaeology produced evidence of Indo-Aryan migration into South Asia ?

According to JP Mallory, who along with David Anthony is the leading proponent of the steppe theory,

The archaeological evidence for an expansion from the steppelands across historical Iran and India varies from the extremely meagre to total absence: both the Anatolian and the Kurgan theory find it extraordinarily difficult to explain the expansion of the Indo- European languages over a vast area of urbanized Asian populations, approximately the same area as that of Europe.


This is indeed the problem for both the Near Eastern and the Pontic-Caspian models and, following the logic of this analysis, the Bouckaert model appears to be in the same boat. All of these models apparently require the Indo-European languages (including their attendant agri- cultural vocabulary) to be superimposed/adopted by at least several major complex societies of Central Asia and the Indus… In any event, all three models require some form of major language shift despite there being no credible archaeological evidence to demonstrate, through elite dominance or any other mechanism, the type of language shift required to explain, for example, the arrival and dominance of the Indo- Aryans in India… But all theories must still explain why relatively advanced agrarian societies in greater Iran and India abandoned their own languages for those of later Neolithic or Bronze Age Indo- Iranian intruders.

An archaeological supplement with a recent aDNA paper informs us thus –

This survey of the archaeological and biological record of southern Central Asia yields four important findings. First, contacts between the sedentary food-producing populations of the Namazga culture populations residing in Kopet Dagh piedmont and Geokyur oasis of southern Turkmenistan who likely established the outpost at Sarazm had little to no contact with populations residing in the southern steppe zone. Second, contacts between Bronze Age steppe populations and NMG V and BMAC populations appears to have been one in which the dynamic of cultural influence was stronger on the side of the well-established sedentary food- producing populations, and this resulted in the partial assimilation of these initial newcomers to the region both culturally and, to a lesser degree, biologically as well. Third, not all of those who emigrated from the north turned to farming but may have continued a semi-nomadic existence in the highlands, which were unsuitable for the kind of intensive farming practiced in the BMAC homelands or in the regions of Khorezm. Fourth, if there was any Central Asian influence on South Asian populations, that influence likely long predated any development of Iranian, let alone Indo-Aryan, languages, and most likely occurred during the late NMG IV to early NMG V period (ca. 2800–2300 BCE) and even earlier during the Eneolithic from Kelteminar culture groups (4000–3500 BCE).


So we have zero archaeological evidence of the so-called steppe migration of Indo-Iranians into Iran & South Asia and even in Central Asia, the interactions between the steppe people and BMAC was such that the former were assimilated into the latter groups (which would have led to them adopting the BMAC language(s)), yet we are to accept that the whole region from Iran, Central Asia and South Asia having some of the most densely populated and advanced civilizations of their time, simply just switched over to the languages of the steppe nomads (who are archaeologically invisible) in such a comprehensive manner that not a trace of the earlier languages of these advanced civilizations exist and what is ubiquitous today all across this vast region is only the language of these invisible steppe nomads. How Incredible !

To give you a perspective, around the same time, the IE Hittites and Mitanni ruled over vast kingdoms in Anatolia & Syria respectively, and that too for many centuries. Where are these Indo-Europeans now in the Near East ? Not a trace of them exists. Yet in South Asia (where the civilizational expanse was much vaster than in the Near East), where the steppe nomads are not even archaeologically visible, we are to believe that the IVC language became invisible while the steppe language reigned supreme.



So we have no archaeological evidence of even a trace of migration from the steppe when we would require a very substantial migration from the steppe to make such a major cultural, religious and linguistic impact on the IVC to make them totally adopt the steppe language and forsake their own languages. And we do not even have any mention of a migration in any of the ancient Indian texts which on the contrary claim a much greater antiquity of IE culture and tradition within South Asia. So why should we people of South Asian origin be compelled to accept this hopeless theory ? There is very good reason to believe that our history is being misinterpreted and our real history is being robbed from us. So why should we lie down and let them walk all over us ?

It may be noted that the very reason the Europeans started archaeological digs in the late 18th & early 19th century in the Near East which lead to the discovery of the great Bronze age civilization of Mesopotamia & Egypt was the mention of ancient Great civilizations in the Bible. So the historical memory preserved in religious texts is very important and not to be dismissed lightly. Moreover, the Bible and Torah are Semitic books written originally in a Semitic language that preserved a memory of Semitic people in Bronze Age Near East. We now know that Semitic speakers were widespread in Bronze Age Near East.

Similarly, the historical memory from ancient Indian texts can help us a great deal in reconstructing the early history of Indian civilization. According to the Vedic literature, the Saraswati region around Haryana and parts of Western UP was the Vedic homeland, and we may note that some of the earliest Early & Pre-Harappan sites such as Kunal, Bhiranna, Farmana, Rakhigarhi etc are now known from Haryana. The fact that this ancient memory of Indian civilization is preserved in an IE language (Sanskrit) should also give a clue that IE languages have a deep history in South Asia intimately connected with the IVC. This is just plain common sense. Or else, we would have to argue that the deep history of Indian civilization was noted by the IVC inhabitants in a Dravidian or some other unknown language and then with the invasion of the Indo-Aryans, they translated that knowledge into Sanskrit which is a bit of a stretch and is also unsustainable because the earliest figures of Vedic literature such as Manu are also early father figures in other IE tradition such as Germanic.


Finally, it may be argued that however dicy the arguments of AMT are and however little evidence there is to support it through archaeology or Sanskrit texts, genetic data has still managed to prove AMT.

Well, that is also not really proven.

We have aDNA from South Asia (dozens of them) that date only from 1200 – 300 BC i.e. well after the so-called AMT and they are mostly from North Pakistan.

We only have 3 Bronze Age samples designated as Indus_Periphery and that too not from South Asia but from Eastern Iran and Central Asia.

These 3 samples do have steppe-related ancestry but they have it less as compared to the Swat samples and many of the modern IE speakers from South Asia.

It has not however been proved that Steppe_mlba related ancestry was absent in Indus_Periphery and came into South Asia only after 1500 BC. It has only been proved that steppe_mlba related ancestry is found in greater proportion among Swat samples and among many of the modern IE groups.

But how does that prove that the extra steppe-related ancestry only came from the Steppe_mlba ? Especially when the y-dna marker of steppe_mlba groups R1a-Z93 is so clearly lacking in all the Swat samples.

And by no stretch of imagination can be assume that the 3 Indus_periphery samples are an accurate representation of the enormous genetic diversity that would have existed in the Indus civilization. Yet the research explicitly makes this assumption that Indus_Periphery captures the genetic diversity of the IVC. However, realistically, in all likelihood, there would have groups in the Indus civilization whose steppe-related ancestry was much higher than those of Indus_periphery. Therefore unless we make sure that such is not the case, how can one argue that the extra steppe-related ancestry in modern South Asia and in ancient Swat was because of a steppe migration,

-a migration for which there is no archaeological evidence,

-there is lack of steppe marker R1a-Z93 in Swat.

There is also little genetic evidence of steppe impact on BMAC and of BMAC genetic impact on South Asians.

All things considered, there is every reason from Indians and South Asians in general to be very skeptical of the AMT.


A Tentative OUT OF INDIA Model To Explain The Origin & Spread Of INDO-EUROPEAN LANGUAGES


The Roots of Indo-Iranian cultural genesis


The Last 2 months have produced a flurry of ancient DNA studies that have given us results with enormous implications for the spread of Indo-European languages. Incorporating the results of these studies along with linguistic and archaeological evidence, we can create a model of spread of Indo-European languages from SC Asia to other parts of Eurasia.

Image result for The Indo-European languages


Johanna Nichols had produced, more than 2 decades ago, a wonderful model for the spread of the Indo-European languages from its locus in Central Asia. Her thesis was spread over two articles in two volumes. According to her –

Several kinds of evidence for the PIE locus have been presented here. Ancient loanwords point to a locus along the desert trajectory, not particularly close to Mesopotamia and probably far out in the eastern hinterlands. The structure of the family tree, the accumulation of genetic diversity at the western periphery of the range, the location of Tocharian and its implications for early dialect geography, the early attestation of Anatolian in Asia Minor, and the geography of the centum-satem split all point in the same direction: a locus in western central Asia. Evidence presented in Volume II supports the same conclusion: the long-standing westward trajectories of languages point to an eastward locus, and the spread of IE along all three trajectories points to a locus well to the east of the Caspian Sea. The satem shift also spread from a locus to the south-east of the Caspian, with satem languages showing up as later entrants along all three trajectory terminals. (The satem shift is a post-PIE but very early IE development.) The locus of the IE spread was therefore somewhere in the vicinity of ancient Bactria-Sogdiana. This locus resembles those of the three known post-IE spreads: those of Indo-Iranian (from a locus close to that of PIE), Turkic (from a locus near north-western Mongolia), and Mongolian (from north-eastern Mongolia) as shown in Figure 8.8. Thus in regard to its locus, as in other respects, the PIE spread was no singularity but was absolutely ordinary for its geography and its time-frame.

To summarize the important points of dialect geography in the Eurasian spread zone, the hallmark of a language family that enters a spread zone as an undifferentiated single language and diversifies while spreading is a multiple branching from the root. This is the structure of the IE tree, which has the greatest number of primary branches of any known genetic grouping of comparable age. The hallmark of developments that arise in or near the locus is that they appear along more than one trajectory. This is the distribution of the centum/satem division in IE, and in the later Indo-Iranian spread it is the distribution of the Indo-Aryan/Iranian split (as argued in Nichols, Volume II). The reason that dialect divisions arising in the locus show up along more than one trajectory is that the Caspian Sea divides westward spreads into steppe versus desert trajectories quite close to the locus and hence quite early in the spread. In contrast, developments that occurred farther west, as the split of Slavic from Baltic in the middle Volga may have, continue to spread along only one trajectory.This is why the Pontic steppe is an unlikely locus for the PIE spread. (THE EPICENTRE OF INDO-EUROPEAN LINGUISTIC SPREAD – pgs 137-138)

She further states in her 2nd article,

IE homeland studies so far have had to resolve the dilemma of how to reconcile conflicting lexical evidence about the IE homeland. Were the Indo-Europeans pastoralists or agriculturalists? The lexical evidence can be used to support both viewpoints (for a summary and argument in favour of agriculture see Diebold 1992). If they were a people of the dry grasslands, how do we explain the presence in their language of words for ‘beaver’, ‘birch’, and ‘oak’, the latter with extensive mythic and cultural salience (Friedrich 1970:129ff.)? If they were steppe pastoralists, how do we explain the presence of words for ‘double door’ and ‘enclosed yard or garden’ suggestive of dwellings in the urban Near East (Gamkrelidze and Ivanov [1984:741ff.] 1994:645ff.)? If they were nomadic herders of the plains, how is the presence of a word for ‘pig’ explained? A homeland reconstructed as locus, trajectory and range removes the dilemma: a locus in the vicinity of Bactria-Sogdiana implies a spread beginning at the frontier of ancient Near Eastern civilization and a range throughout the steppe and central Asia, following the east-to-west trajectory, with occasional or periodic spreads into the Danube plain and Anatolia. The PIE ecological and cultural world, then, included the forested mountains southeast of the Kazakh steppe, the dry eastern steppes, the Central Asian deserts, the urbanized oases of southern Turkmenistan and Bactria-Sogdiana, the eastern extension of the urban Near East, the rich grasslands of the Black Sea steppe, the southern edge of the forest-steppe zone and the Siberian taiga, fresh-water lakes, and salt seas (the Aral and Caspian). The economy of the Indo-Europeans included dry-grasslands pastoralism, settled farming, mixed herding and farming, and trade, including not only trade between farmers and herders in central Asia but also, importantly, control of the antecedents to the Silk Route and the trade connections with India to the south. This economic and ecological diversity is reflected in the vocabulary of PIE. (THE EURASIAN SPREAD ZONE & INDO-EUROPEAN DISPERSAL, pg 233)

Nichols dates the breaking of IE languages between 4000 – 3300 BCE. This is contemporary to the Chalcolithic aDNA samples we now have from Central Asia, Iran, the Caucasus, Anatolia and the steppe. But before proceeding with the genetic evidence let us also have a glance at the archaeological evidence.

Image result for hittites


Continue reading “A Tentative OUT OF INDIA Model To Explain The Origin & Spread Of INDO-EUROPEAN LANGUAGES”


The Roots of Indo-Iranian cultural genesis

Here is my take on the significance of South Asian aDNA from Eastern Iran and Central Asia during the Bronze Age –

The Chalcolithic contacts between South Asia and regions immediately to its East & North i.e. Eastern Iranian cultures such as Jiroft or Halil Rud (from sites such as Jiroft & Konar Sandal) & Helmand (Shahr-i-Sokhta) as well as Central Asia (from sites such as Geoksiur or Sarazm) are not so well documented. This is an unfortunate lacunae that needs to be filled up in the near future because the Chalcolithic appears to be a critical phase where the communication channels within this vast region are likely to have become more intensified leading to a process of urbanism and continuing well upto the downfall of these urban civilizations.

Nevertheless, there are some tantalising and very important clues for this period that can have larger repurcussions as more research is done but I will come to that later.

Let me first point out the archaeological and genetic evidence we have for the 3rd millenium BC.

First let us note the evidence of interaction between the Helmand civilization (exemplified by sites such as Shahr-i-Sokhta & Mundigak)

A series of artefacts found at Shahr-i Sokhta and nearby sites (Iranian Seistan) that were presumably imported from Baluchistan and the Indus domain are discussed, together with finds from the French excavations at Mundigak (Kandahar, Afghanistan) that might have the same origin. Other artefacts and the involved technologies bear witness to the local adaptation of south-eastern manufactures and practices in the protohistoric Sistan culture. While the objects datable to the first centuries of the 3rd millennium BCE fall in the so called “domestic universe” and reflect common household activities, in the centuries that follow we see a shift to the sharing of luxury objects and activities concerning the display of a superior social status; but this might be fruit of a general transformation of the archaeological record of Shahr-i Sokhta and its formation processes.

The above is part of the abstract from this paper – Continue reading “The Roots of Indo-Iranian cultural genesis”