Disclaimer
Please note I am a dentist â not a geneticist â and I do not claim formal expertise in this field. I have a long-standing interest in history and look to archaeogenetics as one of the best tools available for addressing some of the most enduring questions about South Asian origins and identity.
Credit is due to the many researchers, bloggers, and science communicators who have made this field accessible â including Razib Khan (whom I havenât met, though he happens to be a fellow Bengali), who’s writing first inspired me to engage deeply with these questions.
I first got into Ancient Indian DNA space when the research from the Rakhigarhi womanâs DNA was published- shedding light on IVC genetics for the first time. Prior to that Indian society had no idea how to scientifically answer questions like whether the âAryan-Invasionâ was real. Everything you heard or read was just some dude’s opinion.
If the Aryan invasion was real didnât the Indus Valley civilization prove the indigenous but conquered âDravidiansâ were the ‘superior’ culture builders and the Aryans were the barbarians?
If they spoke Dravidian languages, did it mean they looked South Indian… and that after the invasion they were all somehow transplanted into the actual South of the peninsula – while the Indus region was taken over by⊠White people??
These were the conundrums that filled my mind and it was due to the work of some of these excellent pioneers in the genetics field (Shinde, Narasimhan, Reich and Razib) that I went down the rabbit hole of answers.
At some point as more research came out about the Indus people and the genetics of the caste system, there seemed to sprout the idea dimly apparent but visible if you looked – that Dravidians themselves as a language group may not be native to India.
Now I wasnât completely sold. Like many of you I wonder why it canât be the other way around.
Why did neolithic Iranians have to come into India â why canât it be that neolithic northwest Indians went to the Iranian plateau? I donât have all the answers, and itâll take a while for all the questions to be settled. Till then I try to keep my biases in check and follow the research.
A paper came out in Nature in October shedding light on the possible genetic basis to the origins of the Dravidian language family.
The researchers sampled a small tribe â the Koraga from Southern India speaking a Dravidian language.
Now we all know the main cline of Indians come from Steppe, Indus farmer and AASI in the literature.
New finding from this paper: Proposed fourth ancestral component â which they call âProto-Dravidianâ ancestry â found in the Koraga, which branched off around 2,400 B.C.
The origin of this ancestry seems to lie in the region between the Iranian plateau and the Indus Valley before the arrival of Indo-European languages. It persists in diluted form in many South Asian communities (not just in the tribal population) today, especially Dravidian speaking ones.
The researcher purport this may provide support to the Elamo-Dravidian language family hypothesis, which would provide a common source of languages for all populations derived from Neolithic Iranians back to one of the centres (Zagros) where farming was invented.
What does this research imply about the ancestry of the IVC people and what language they spoke?
What does it imply about the so-called âIndus farmerâ component of Indian ancestry?
The Indus farmer as one of the three key sources of Indian ancestry was thought to represent the core ancestry of the IVC.
This study with the Koraga finds a distinct 4,400-year-old branch of the main Neolithic Iranian and not identical to the core âIndus Farmerâ. Genetically it’s closer to current Dravidian speaking rather than Indo-Aryan speaking populations.
There might have been two (or multiple) Iranian-related influxes into the subcontinent:
- An earlier, associated with Neolithic spread of agriculture (â 7000â5000 BCE â the IVC base population),
- at least one later sub-branch (~2400 BCE) that corresponds to this âProto-Dravidianâ ancestry.
So, the IVC population may have contained multiple Iranian-related sub-lineages, one of which could have seeded the Proto-Dravidian gene pool that persisted in southern India.
The 4,400-year-old date inferred for the Proto-Dravidian ancestry roughly matches the mature-to-late IVC period (2600â1900 BCE).
While genetics canât âproveâ a language, the chronological and geographical overlap adds circumstantial weight to the idea that the language(s) of the IVC were Dravidian or Proto-Dravidian in nature.
Wasnât the mature period of the IVC already in full swing 4,400 years ago?
The Indus Civilization wasnât a single static populationâit extended from Baluchistan and Sindh up to Haryana and Gujarat, and its regional sub-populations may have had slightly different genetic mixtures.
Local differentiations could easily occur within that 700-year mature phase.
So the âKoraga-likeâ branch could represent:
- a southern or interior offshoot of the broader IVC population, or
- the southernmost fraction of IVC peoples who moved into peninsular India as the civilization urban centres declined (post-1900 BCE).
What does it suggest about Brahui or Burushaski or Nihali languages; the three âmysteryâ languages?
Â
Brahui â the case is strengthened for this language to be a relict of an IVC-era language and not a recent back migration from South to North. These speakers genetically have the same ancestral components of the Indo-Aryan speakers around them rather than the amplified “proto-Dravidian” component that you would expect from migrants from the South.
Burushashki – Fits a scenario where it could descend from a pre-Indo-Aryan, pre-Dravidian language of the upper Indus region â maybe a branch of the same broad macro-family that included IVC languages.
– may preserve a northern dialect of the IVC-era linguistic landscape â an (adopted) sister, not daughter, to Proto-Dravidian which survived in a mountain redoubt while Indo-Aryan took over the plains.
Nihali – may descend from one of these pre-Munda, pre-Dravidian forager populations, preserving fragments of a Mesolithic or Neolithic substratum language.
This fits with the new studyâs timeline:
- the IVC-Dravidian connection crystallized after 2600 BCE,
- but eastern and central India already had long-established hunter-gatherer cultures with minimal Iranian-farmer input.
Nihali probably represents a pre-agricultural linguistic substrate, not Dravidian or IVC-derived at all â more of an AASI linguistic survivor.
The roots of what became Jati
I wonât go too in depth as the field here is still a lot of theory without definitive answers â this paper taken with Narasimhan (2019) and Shinde (2019) finds that at least 4,400 years ago IVC was already a genetically distinct and relatively stable mix of Iran-related farmer ancestry + South Asian forager ancestry, with internal substructure.
Â
- Significant regional genetic variation between different Indus sites
- Long-term persistence of ancestry profiles â meaning mixing across the entire IVC zone was not completely random.
Does that mean âcasteâ?
Â
Not strictly in the Vedic sense with Varna, Jati as far as we can see but it could mean that:
- There were endogamous or semi-endogamous groups within the IVC.
- These groups likely corresponded to occupational, regional, or lineage divisions, rather than religious varnas.
- The cultural habit of limiting marriage circles could therefore be much older than the Indo-Aryan social codes that later memorialized it as varnaâjati.
- The original social boundaries may be linked to pre-agriculture tribal or kinship identities that were maintained even throughout the period of early urbanization/integration.
We know from the paper: 50,000 years of evolutionary history of India: Impact on health and disease variation â India has long been a reservoir for human DNA.
Old groups are preserved alongside but separate from the new⊠far longer than other subcontinents where homogenization cause their disappearance.
This was true for archaic hominins living in India before Out of Africa (according to the archaeological record) and continued to be true afterward even through to the coming together of these different groups as they built the first cities.
————————————————–
This paper does not settle the origins of the Dravidian languages or linguistic identity of the IVC.
It’s my first article so if people want to share feedback â please feel free and keep it respectful â thanks. If there is interest I have more ideas planned like the history of endogamy in South Asia and the intersection across culture, politics and identity from the perspective of a second generation Indian (West Bengal) Australian.
Warm Regards,
Dr Rajorshi
