A Medley of Potpourri: Apr 30, 2022

Saturday, April 30, 2022

Radar signal characteristics

From Wikipedia, the free encyclopedia

A radar system uses a radio-frequency electromagnetic signal reflected from a target to determine information about that target. In any radar system, the signal transmitted and received will exhibit many of the characteristics described below.

The radar signal in the time domain

The diagram below shows the characteristics of the transmitted signal in the time domain. Note that in this and in all the diagrams within this article, the x axis is exaggerated to make the explanation clearer.

Carrier

The carrier is an RF signal, typically of microwave frequencies, which is usually (but not always) modulated to allow the system to capture the required data. In simple ranging radars, the carrier will be pulse modulated and in continuous wave systems, such as Doppler radar, modulation may not be required. Most systems use pulse modulation, with or without other supplementary modulating signals. Note that with pulse modulation, the carrier is simply switched on and off in sync with the pulses; the modulating waveform does not actually exist in the transmitted signal and the envelope of the pulse waveform is extracted from the demodulated carrier in the receiver. Although obvious when described, this point is often missed when pulse transmissions are first studied, leading to misunderstandings about the nature of the signal.

Pulse width

The pulse width ( $\tau$ ) (or pulse duration) of the transmitted signal is the time, typically in microseconds, each pulse lasts. If the pulse is not a perfect square wave, the time is typically measured between the 50% power levels of the rising and falling edges of the pulse.

The pulse width must be long enough to ensure that the radar emits sufficient energy so that the reflected pulse is detectable by its receiver. The amount of energy that can be delivered to a distant target is the product of two things; the peak output power of the transmitter, and the duration of the transmission. Therefore, pulse width constrains the maximum detection range of a target.

Pulse width also constrains the range discrimination, that is the capacity of the radar to distinguish between two targets that are close together. At any range, with similar azimuth and elevation angles and as viewed by a radar with an unmodulated pulse, the range resolution is approximately equal in distance to half of the pulse duration times the speed of light (approximately 300 meters per microsecond).

Radar echoes, showing a representation of the carrier

Pulse width also determines the radar's dead zone at close ranges. While the radar transmitter is active, the receiver input is blanked to avoid the amplifiers being swamped (saturated) or, (more likely), damaged. A simple calculation reveals that a radar echo will take approximately 10.8 μs to return from a target 1 statute mile away (counting from the leading edge of the transmitter pulse (T₀), (sometimes known as transmitter main bang)). For convenience, these figures may also be expressed as 1 nautical mile in 12.4 μs or 1 kilometre in 6.7 μs. (For simplicity, all further discussion will use metric figures.) If the radar pulse width is 1 μs, then there can be no detection of targets closer than about 150 m, because the receiver is blanked.

All this means that the designer cannot simply increase the pulse width to get greater range without having an impact on other performance factors. As with everything else in a radar system, compromises have to be made to a radar system's design to provide the optimal performance for its role.

Pulse repetition frequency (PRF)

In order to build up a discernible echo, most radar systems emit pulses continuously and the repetition rate of these pulses is determined by the role of the system. An echo from a target will therefore be 'painted' on the display or integrated within the signal processor every time a new pulse is transmitted, reinforcing the return and making detection easier. The higher the PRF that is used, then the more the target is painted. However, with the higher PRF the range that the radar can "see" is reduced. Radar designers try to use the highest PRF possible commensurate with the other factors that constrain it, as described below.

There are two other facets related to PRF that the designer must weigh very carefully; the beamwidth characteristics of the antenna, and the required periodicity with which the radar must sweep the field of view. A radar with a 1° horizontal beamwidth that sweeps the entire 360° horizon every 2 seconds with a PRF of 1080 Hz will radiate 6 pulses over each 1-degree arc. If the receiver needs at least 12 reflected pulses of similar amplitudes to achieve an acceptable probability of detection, then there are three choices for the designer: double the PRF, halve the sweep speed, or double the beamwidth. In reality, all three choices are used, to varying extents; radar design is all about compromises between conflicting pressures.

Staggered PRF

Staggered PRF is a transmission process where the time between interrogations from radar changes slightly, in a patterned and readily-discernible repeating manner. The change of repetition frequency allows the radar, on a pulse-to-pulse basis, to differentiate between returns from its own transmissions and returns from other radar systems with the same PRF and a similar radio frequency. Consider a radar with a constant interval between pulses; target reflections appear at a relatively constant range related to the flight-time of the pulse. In today's very crowded radio spectrum, there may be many other pulses detected by the receiver, either directly from the transmitter or as reflections from elsewhere. Because their apparent "distance" is defined by measuring their time relative to the last pulse transmitted by "our" radar, these "jamming" pulses could appear at any apparent distance. When the PRF of the "jamming" radar is very similar to "our" radar, those apparent distances may be very slow-changing, just like real targets. By using stagger, a radar designer can force the "jamming" to jump around erratically in apparent range, inhibiting integration and reducing or even suppressing its impact on true target detection.

Without staggered PRF, any pulses originating from another radar on the same radio frequency might appear stable in time and could be mistaken for reflections from the radar's own transmission. With staggered PRF the radar's own targets appear stable in range in relation to the transmit pulse, whilst the 'jamming' echoes may move around in apparent range (uncorrelated), causing them to be rejected by the receiver. Staggered PRF is only one of several similar techniques used for this, including jittered PRF (where the pulse timing is varied in a less-predictable manner), pulse-frequency modulation, and several other similar techniques whose principal purpose is to reduce the probability of unintentional synchronicity. These techniques are in widespread use in marine safety and navigation radars, by far the most numerous radars on planet Earth today.

Clutter

Clutter refers to radio frequency (RF) echoes returned from targets which are uninteresting to the radar operators. Such targets include natural objects such as ground, sea, precipitation (such as rain, snow or hail), sand storms, animals (especially birds), atmospheric turbulence, and other atmospheric effects, such as ionosphere reflections, meteor trails, and three body scatter spike. Clutter may also be returned from man-made objects such as buildings and, intentionally, by radar countermeasures such as chaff.

Some clutter may also be caused by a long radar waveguide between the radar transceiver and the antenna. In a typical plan position indicator (PPI) radar with a rotating antenna, this will usually be seen as a "sun" or "sunburst" in the centre of the display as the receiver responds to echoes from dust particles and misguided RF in the waveguide. Adjusting the timing between when the transmitter sends a pulse and when the receiver stage is enabled will generally reduce the sunburst without affecting the accuracy of the range, since most sunburst is caused by a diffused transmit pulse reflected before it leaves the antenna. Clutter is considered a passive interference source, since it only appears in response to radar signals sent by the radar.

Clutter is detected and neutralized in several ways. Clutter tends to appear static between radar scans; on subsequent scan echoes, desirable targets will appear to move, and all stationary echoes can be eliminated. Sea clutter can be reduced by using horizontal polarization, while rain is reduced with circular polarization (note that meteorological radars wish for the opposite effect, and therefore use linear polarization to detect precipitation). Other methods attempt to increase the signal-to-clutter ratio.

Clutter moves with the wind or is stationary. Two common strategies to improve measure or performance in a clutter environment are:

Moving target indication, which integrates successive pulses and
Doppler processing, which uses filters to separate clutter from desirable signals.

The most effective clutter reduction technique is pulse-Doppler radar with Look-down/shoot-down capability. Doppler separates clutter from aircraft and spacecraft using a frequency spectrum, so individual signals can be separated from multiple reflectors located in the same volume using velocity differences. This requires a coherent transmitter. Another technique uses a moving target indication that subtracts the receive signal from two successive pulses using phase to reduce signals from slow moving objects. This can be adapted for systems that lack a coherent transmitter, such as time-domain pulse-amplitude radar.

Constant False Alarm Rate, a form of Automatic Gain Control (AGC), is a method that relies on clutter returns far outnumbering echoes from targets of interest. The receiver's gain is automatically adjusted to maintain a constant level of overall visible clutter. While this does not help detect targets masked by stronger surrounding clutter, it does help to distinguish strong target sources. In the past, radar AGC was electronically controlled and affected the gain of the entire radar receiver. As radars evolved, AGC became computer-software controlled and affected the gain with greater granularity in specific detection cells.

Radar multipath echoes from a target cause ghosts to appear.

Clutter may also originate from multipath echoes from valid targets caused by ground reflection, atmospheric ducting or ionospheric reflection/refraction (e.g., Anomalous propagation). This clutter type is especially bothersome since it appears to move and behave like other normal (point) targets of interest. In a typical scenario, an aircraft echo is reflected from the ground below, appearing to the receiver as an identical target below the correct one. The radar may try to unify the targets, reporting the target at an incorrect height, or eliminating it on the basis of jitter or a physical impossibility. Terrain bounce jamming exploits this response by amplifying the radar signal and directing it downward. These problems can be overcome by incorporating a ground map of the radar's surroundings and eliminating all echoes which appear to originate below ground or above a certain height. Monopulse can be improved by altering the elevation algorithm used at low elevation. In newer air traffic control radar equipment, algorithms are used to identify the false targets by comparing the current pulse returns to those adjacent, as well as calculating return improbabilities.

Sensitivity time control (STC)

STC is used to avoid saturation of the receiver from close in ground clutter by adjusting the attenuation of the receiver as a function of distance. More attenuation is applied to returns close in and is reduced as the range increases.

Unambiguous range

Single PRF

In simple systems, echoes from targets must be detected and processed before the next transmitter pulse is generated if range ambiguity is to be avoided. Range ambiguity occurs when the time taken for an echo to return from a target is greater than the pulse repetition period (T); if the interval between transmitted pulses is 1000 microseconds, and the return-time of a pulse from a distant target is 1200 microseconds, the apparent distance of the target is only 200 microseconds. In sum, these 'second echoes' appear on the display to be targets closer than they really are.

Consider the following example : if the radar antenna is located at around 15 m above sea level, then the distance to the horizon is pretty close, (perhaps 15 km). Ground targets further than this range cannot be detected, so the PRF can be quite high; a radar with a PRF of 7.5 kHz will return ambiguous echoes from targets at about 20 km, or over the horizon. If however, the PRF was doubled to 15 kHz, then the ambiguous range is reduced to 10 km and targets beyond this range would only appear on the display after the transmitter has emitted another pulse. A target at 12 km would appear to be 2 km away, although the strength of the echo might be much lower than that from a genuine target at 2 km.

The maximum non ambiguous range varies inversely with PRF and is given by:

{\text{Range}}_{\text{max unambiguous}}=\left({\frac {c}{2\,PRF}}\right)

where c is the speed of light. If a longer unambiguous range is required with this simple system, then lower PRFs are required and it was quite common for early search radars to have PRFs as low as a few hundred Hz, giving an unambiguous range out to well in excess of 150 km. However, lower PRFs introduce other problems, including poorer target painting and velocity ambiguity in Pulse-Doppler systems (see below).

Multiple PRF

Modern radars, especially air-to-air combat radars in military aircraft, may use PRFs in the tens-to-hundreds of kilohertz and stagger the interval between pulses to allow the correct range to be determined. With this form of staggered PRF, a packet of pulses is transmitted with a fixed interval between each pulse, and then another packet is transmitted with a slightly different interval. Target reflections appear at different ranges for each packet; these differences are accumulated and then simple arithmetical techniques may be applied to determine true range. Such radars may use repetitive patterns of packets, or more adaptable packets that respond to apparent target behaviors. Regardless, radars that employ the technique are universally coherent, with a very stable radio frequency, and the pulse packets may also be used to make measurements of the Doppler shift (a velocity-dependent modification of the apparent radio frequency), especially when the PRFs are in the hundreds-of-kilohertz range. Radars exploiting Doppler effects in this manner typically determine relative velocity first, from the Doppler effect, and then use other techniques to derive target distance.

Maximum Unambiguous Range

At its most simplistic, MUR (Maximum Unambiguous Range) for a Pulse Stagger sequence may be calculated using the TSP (Total Sequence Period). TSP is defined as the total time it takes for the Pulsed pattern to repeat. This can be found by the addition of all the elements in the stagger sequence. The formula is derived from the speed of light and the length of the sequence:

MUR=\left(c*0.5*TSP\right)

where c is the speed of light, usually in metres per microsecond, and TSP is the addition of all the positions of the stagger sequence, usually in microseconds. However, in a stagger sequence, some intervals may be repeated several times; when this occurs, it is more appropriate to consider TSP as the addition of all the unique intervals in the sequence.

Also, it is worth remembering that there may be vast differences between the MUR and the maximum range (the range beyond which reflections will probably be too weak to be detected), and that the maximum instrumented range may be much shorter than either of these. A civil marine radar, for instance, may have user-selectable maximum instrumented display ranges of 72, or 96 or rarely 120 nautical miles, in accordance with international law, but maximum unambiguous ranges of over 40,000 nautical miles and maximum detection ranges of perhaps 150 nautical miles. When such huge disparities are noted, it reveals that the primary purpose of staggered PRF is to reduce "jamming", rather than to increase unambiguous range capabilities.

The radar signal in the frequency domain

Pure CW radars appear as a single line on a Spectrum analyser display and when modulated with other sinusoidal signals, the spectrum differs little from that obtained with standard analogue modulation schemes used in communications systems, such as Frequency Modulation and consist of the carrier plus a relatively small number of sidebands. When the radar signal is modulated with a pulse train as shown above, the spectrum becomes much more complicated and far more difficult to visualise.

Basic radar transmission frequency spectrum

3D Doppler Radar Spectrum showing a Barker Code of 13

Basic Fourier analysis shows that any repetitive complex signal consists of a number of harmonically related sine waves. The radar pulse train is a form of square wave, the pure form of which consists of the fundamental plus all of the odd harmonics. The exact composition of the pulse train will depend on the pulse width and PRF, but mathematical analysis can be used to calculate all of the frequencies in the spectrum. When the pulse train is used to modulate a radar carrier, the typical spectrum shown on the left will be obtained.

Examination of this spectral response shows that it contains two basic structures. The coarse structure; (the peaks or 'lobes' in the diagram on the left) and the Fine Structure which contains the individual frequency components as shown below. The envelope of the lobes in the coarse structure is given by: ${\frac {1}{\pi \,f}}$ .

Note that the pulse width ( $\,\tau$ ) determines the lobe spacing. Smaller pulse widths result in wider lobes and therefore greater bandwidth.

Radar transmission frequency fine spectrum

Examination of the spectral response in finer detail, as shown on the right, shows that the Fine Structure contains individual lines or spot frequencies. The formula for the fine structure is given by ${\frac {N}{T}}$ and since the period of the PRF (T) appears at the bottom of the fine spectrum equation, there will be fewer lines if higher PRFs are used. These facts affect the decisions made by radar designers when considering the trade-offs that need to be made when trying to overcome the ambiguities that affect radar signals.

Pulse profiling

If the rise and fall times of the modulation pulses are zero, (e.g. the pulse edges are infinitely sharp), then the sidebands will be as shown in the spectral diagrams above. The bandwidth consumed by this transmission can be huge and the total power transmitted is distributed over many hundreds of spectral lines. This is a potential source of interference with any other device and frequency-dependent imperfections in the transmit chain mean that some of this power never arrives at the antenna. In reality of course, it is impossible to achieve such sharp edges, so in practical systems the sidebands contain far fewer lines than a perfect system. If the bandwidth can be limited to include relatively few sidebands, by rolling off the pulse edges intentionally, an efficient system can be realised with the minimum of potential for interference with nearby equipment. However, the trade-off of this is that slow edges make range resolution poor. Early radars limited the bandwidth through filtration in the transmit chain, e.g. the waveguide, scanner etc., but performance could be sporadic with unwanted signals breaking through at remote frequencies and the edges of the recovered pulse being indeterminate. Further examination of the basic Radar Spectrum shown above shows that the information in the various lobes of the Coarse Spectrum is identical to that contained in the main lobe, so limiting the transmit and receive bandwidth to that extent provides significant benefits in terms of efficiency and noise reduction.

Radar transmission frequency spectrum of a trapezoid pulse profile

Recent advances in signal processing techniques have made the use of pulse profiling or shaping more common. By shaping the pulse envelope before it is applied to the transmitting device, say to a cosine law or a trapezoid, the bandwidth can be limited at source, with less reliance on filtering. When this technique is combined with pulse compression, then a good compromise between efficiency, performance and range resolution can be realised. The diagram on the left shows the effect on the spectrum if a trapezoid pulse profile is adopted. It can be seen that the energy in the sidebands is significantly reduced compared to the main lobe and the amplitude of the main lobe is increased.

Radar transmission frequency spectrum of a cosine pulse profile

Similarly, the use of a cosine pulse profile has an even more marked effect, with the amplitude of the sidelobes practically becoming negligible. The main lobe is again increased in amplitude and the sidelobes correspondingly reduced, giving a significant improvement in performance.

There are many other profiles that can be adopted to optimise the performance of the system, but cosine and trapezoid profiles generally provide a good compromise between efficiency and resolution and so tend to be used most frequently.

Unambiguous velocity

Doppler spectrum. Deliberately no units given (but could be dBu and MHz for example).

This is an issue only with a particular type of system; the pulse-Doppler radar, which uses the Doppler effect to resolve velocity from the apparent change in frequency caused by targets that have net radial velocities compared to the radar device. Examination of the spectrum generated by a pulsed transmitter, shown above, reveals that each of the sidebands, (both coarse and fine), will be subject to the Doppler effect, another good reason to limit bandwidth and spectral complexity by pulse profiling.

Consider the positive shift caused by the closing target in the diagram which has been highly simplified for clarity. It can be seen that as the relative velocity increases, a point will be reached where the spectral lines that constitute the echoes are hidden or aliased by the next sideband of the modulated carrier. Transmission of multiple pulse-packets with different PRF-values, e.g. staggered PRFs, will resolve this ambiguity, since each new PRF value will result in a new sideband position, revealing the velocity to the receiver. The maximum unambiguous target velocity is given by:

\pm {\frac {c\,PRF}{4\,f}}

Typical system parameters

Taking all of the above characteristics into account means that certain constraints are placed on the radar designer. For example, a system with a 3 GHz carrier frequency and a pulse width of 1 µs will have a carrier period of approximately 333 ps. Each transmitted pulse will contain about 3000 carrier cycles and the velocity and range ambiguity values for such a system would be:

PRF	Velocity Ambiguity	Range Ambiguity
Low (2 kHz)	50 m/s	75 km
Medium (12 kHz)	300 m/s	12.5 km
High (200 kHz)	5000 m/s	750 m

Indo-Aryan migrations

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/Indo-Aryan_migrations

The Indo-Aryan migrations were the migrations into the Indian subcontinent of Indo-Aryan peoples, an ethnolinguistic group that spoke Indo-Aryan languages, the predominant languages of today's North India, Pakistan, Nepal, Bangladesh, Sri Lanka and the Maldives. Indo-Aryan population movements into the region from Central Asia are considered to have started after 2000 BCE, as a slow diffusion during the Late Harappan period, which led to a language shift in the northern Indian subcontinent. Several hundreds year later, the Iranian languages were brought into the Iranian plateau by the Iranians, who were closely related to the Indo-Aryans.

The Proto-Indo-Iranian culture, which gave rise to the Indo-Aryans and Iranians, developed on the Central Asian steppes north of the Caspian Sea as the Sintashta culture (2050–1800 BCE) in present-day Russia and Kazakhstan, and developed further as the Andronovo culture (2000–1450 BCE).

The Indo-Aryans split off sometime between 2000 BCE and 1600 BCE from the Indo-Iranians, and migrated southwards to the Bactria–Margiana Culture (BMAC), from which they borrowed some of their distinctive religious beliefs and practices. From the BMAC, the Indo-Aryans migrated into northern Syria and, possibly in multiple waves, into the Punjab (northern Pakistan and India), while the Iranians could have reached western Iran before 1300 BCE, both bringing with them the Indo-Iranian languages.

Migration by an Indo-European people was first hypothesized in the late 18th century, following the discovery of the Indo-European language family, when similarities between western and Indian languages had been noted. Given these similarities, a single source or origin was proposed, which was diffused by migrations from some original homeland.

This linguistic argument of this theory is supported by archaeological, anthropological, genetic, literary and ecological research. Genetic research reveals that those migrations form part of a complex genetic puzzle on the origin and spread of the various components of the Indian population. Literary research reveals similarities between various, geographically distinct, Indo-Aryan historical cultures. Ecological studies reveal that in the second millennium BCE widespread aridization led to water shortages and ecological changes in both the Eurasian steppes and the Indian subcontinent, causing the collapse of sedentary urban cultures in south central Asia, Afghanistan, Iran, and India, and triggering large-scale migrations, resulting in the merger of migrating peoples with the post-urban cultures.

The Indo-Aryan migrations started sometime in the period from approximately 2000 to 1600 BCE, after the invention of the war chariot, and also brought Indo-Aryan languages into the Levant and possibly Inner Asia. It was part of the diffusion of Indo-European languages from the proto-Indo-European homeland at the Pontic–Caspian steppe, a large area of grasslands in far Eastern Europe, which started in the 5th to 4th millennia BCE, and the Indo-European migrations out of the Eurasian Steppes, which started approximately in 2000 BCE.

These Indo-Aryan speaking people were united by shared cultural norms and language, referred to as ārya, "noble". Diffusion of this culture and language took place by patron-client systems, which allowed for the absorption and acculturation of other groups into this culture, and explains the strong influence on other cultures with which it interacted.

Fundamentals

Scheme of Indo-European language dispersals from c. 4000 to 1000 BCE according to the widely held Kurgan hypothesis.
– Center: Steppe cultures
1 (black): Anatolian languages (archaic PIE)
2 (black): Afanasievo culture (early PIE)
3 (black) Yamnaya culture expansion (Pontic-Caspian steppe, Danube Valley) (late PIE)
4A (black): Western Corded Ware
4B-C (blue & dark blue): Bell Beaker; adopted by Indo-European speakers
5A-B (red): Eastern Corded ware
5C (red): Sintashta (proto-Indo-Iranian)
6 (magenta): Andronovo
7A (purple): Indo-Aryans (Mittani)
7B (purple): Indo-Aryans (India)
[NN] (dark yellow): proto-Balto-Slavic
8 (grey): Greek
9 (yellow):Iranians
– [not drawn]: Armenian, expanding from western steppe

The Indo-Aryan migration theory is part of a larger theoretical framework. This framework explains the similarities between a wide range of contemporary and ancient languages. It combines linguistic, archaeological and anthropological research. This provides an overview of the development of Indo-European languages, and the spread of these Indo-European languages by migration and acculturation.

Linguistics: relationships between languages

The linguistic part traces the connections between the various Indo-European languages, and reconstructs the proto-Indo-European language. This is possible because the processes that change languages are not random, but follow strict patterns. Sound shifts, the changing of vowels and consonants, are especially important, although grammar (especially morphology) and the lexicon (vocabulary) may also be significant. Historical-comparative linguistics thus makes it possible to see great similarities between related languages which at first sight might seem very different. Various characteristics of the Indo-European languages argue against an Indian origin of these languages, and point to a steppe origin.

Archaeology: migrations from the steppe Urheimat

The archaeological part posits an "Urheimat" on the Pontic steppes, which developed after the introduction of cattle on the steppes around 5,200 BCE. This introduction marked the change from foragist to pastoralist cultures, and the development of a hierarchical social system with chieftains, patron-client systems, and the exchange of goods and gifts. The oldest nucleus may have been the Samara culture (late 6th and early 5th millennium BCE), at a bend in the Volga.

A wider "horizon" developed, called the Kurgan culture by Marija Gimbutas in the 1950s. She included several cultures in this "Kurgan Culture", including the Samara culture and the Yamna culture, although the Yamna culture (36th–23rd centuries BCE), also called "Pit Grave Culture", may more aptly be called the "nucleus" of the proto-Indo-European language. From this area, which already included various subcultures, Indo-European languages spread west, south and east starting around 4,000 BCE. These languages may have been carried by small groups of males, with patron-client systems which allowed for the inclusion of other groups into their cultural system.

Eastward emerged the Sintashta culture (2050–1900 BCE), where common Indo-Iranian was spoken. Out of the Sintashta culture developed the Andronovo culture (2000–1450 BCE), which interacted with the Bactria-Margiana Culture (2250–1700 BCE). This interaction further shaped the Indo-Iranians, which split at sometime between 2000 and 1600 BCE into the Indo-Aryans and the Iranians. The Indo-Aryans migrated to the Levant and South Asia. The migration into northern India was not a large-scale immigration, but may have consisted of small groups which were genetically diverse. Their culture and language spread by the same mechanisms of acculturalisation, and the absorption of other groups into their patron-client system.

Anthropology: elite recruitment and language shift

Indo-European languages probably spread through language shifts. Small groups can change a larger cultural area, and elite male dominance by small groups may have led to a language shift in northern India.

According to Parpola, local elites joined "small but powerful groups" of Indo-European speaking migrants. These migrants had an attractive social system and good weapons, and luxury goods which marked their status and power. Joining these groups was attractive for local leaders, since it strengthened their position, and gave them additional advantages. These new members were further incorporated by matrimonial alliances.

According to Joseph Salmons, language shift is facilitated by "dislocation" of language communities, in which the elite is taken over. According to Salmons, this change is facilitated by "systematic changes in community structure", in which a local community becomes incorporated in a larger social structure.

Genetics: ancient ancestry and multiple gene flows

The Indo-Aryan migrations form part of a complex genetic puzzle on the origin and spread of the various components of the Indian population, including various waves of admixture and language shift. Studies indicate north and south Indians share a common maternal ancestry. A series of studies show that the Indian subcontinent harbours two major ancestral components, namely the Ancestral North Indians (ANI) which is "genetically close to Middle Easterners, Central Asians, and Europeans", and the Ancestral South Indians (ASI) which is clearly distinct from ANI. These two groups mixed in India between 4,200 and 1,900 years ago (2200 BCE – 100 CE), after which a shift to endogamy took place, possibly by the enforcement of "social values and norms" during the Gupta Empire.

Moorjani et al. (2013) describe three scenarios regarding the bringing together of the two groups: migrations before the development of agriculture before 8,000–9,000 years before present (BP); migration of western Asian people together with the spread of agriculture, maybe up to 4,600 years BP; migrations of western Eurasians from 3,000 to 4,000 years BP.

While Reich notes that the onset of admixture coincides with the arrival of Indo-European language, according to Moorjani et al. (2013) these groups were present "unmixed" in India before the Indo-Aryan migrations. Gallego Romero et al. (2011) propose that the ANI component came from Iran and the Middle East, less than 10,000 years ago, while according to Lazaridis et al. (2016) ANI is a mix of "early farmers of western Iran" and "people of the Bronze Age Eurasian steppe". Several studies also show traces of later influxes of maternal genetic material and of paternal genetic material related to ANI and possibly the Indo-Europeans.

Literary research: similarities, geography, and references to migration

The oldest inscription in Old Indic is found in northern Syria in Hittite records regarding the Hurrian-speaking Mitanni. The religious practices depicted in the Rigveda and those depicted in the Avesta, the central religious text of Zoroastrianism, show similarities. Some of the references to the Sarasvati in the Rigveda refer to the Ghaggar-Hakra River, while the Afghan river Haraxvaiti/Harauvati Helmand is sometimes quoted as the locus of the early Rigvedic river. The Rigveda does not explicitly refer to an external homeland or to a migration, but later Vedic and Puranic texts do show the movement into the Gangetic plains.

Ecological studies: widespread drought, urban collapse, and pastoral migrations

Climate change and drought may have triggered both the initial dispersal of Indo-European speakers, and the migration of Indo-Europeans from the steppes in south central Asia and India.

Around 4200–4100 BCE a climate change occurred, manifesting in colder winters in Europe. Steppe herders, archaic Proto-Indo-European speakers, spread into the lower Danube valley about 4200–4000 BCE, either causing or taking advantage of the collapse of Old Europe.

In the third millennium BCE widespread aridification led to water shortages and ecological changes in both the Eurasian steppes and the Indian subcontinent. On the steppes, humidification led to a change of vegetation, triggering "higher mobility and transition to nomadic cattle breeding". Water shortage also had a strong impact in the Indian subcontinent, "causing the collapse of sedentary urban cultures in south central Asia, Afghanistan, Iran, and India, and triggering large-scale migrations".

Development of the theory

Similarities between Sanskrit, Persian, Greek

In the 16th century, European visitors to India became aware of similarities between Indian and European languages and as early as 1653 Van Boxhorn had published a proposal for a proto-language ("Scythian") for Germanic, Romance, Greek, Baltic, Slavic, Celtic and Iranian.

In a memoir sent to the French Academy of Sciences in 1767 Gaston-Laurent Coeurdoux, a French Jesuit who spent all his life in India, had specifically demonstrated the existing analogy between Sanskrit and European languages.

In 1786 William Jones, a judge in the Supreme Court of Judicature at Fort William, Calcutta, linguist, and classics scholar, on studying Sanskrit, postulated, in his Third Anniversary Discourse to the Asiatic Society, a proto-language uniting Sanskrit, Persian, Greek, Latin, Gothic and Celtic languages, but in many ways his work was less accurate than his predecessors', as he erroneously included Egyptian, Japanese and Chinese in the Indo-European languages, while omitting Hindustani and Slavic:The Sanskrit language, whatever be its antiquity, is of a wonderful structure; more perfect than the Greek, more copious than the Latin, and more exquisitely refined than either, yet bearing to both of them a stronger affinity, both in the roots of verbs and in the forms of grammar, than could possibly have been produced by accident; so strong indeed, that no philologer could examine them all three, without believing them to have sprung from some common source, which, perhaps, no longer exists: there is a similar reason, though not quite so forcible, for supposing that both the Gothic and the Celtic, though blended with a very different idiom, had the same origin with the Sanskrit; and the old Persian might be added to the same family, if this were the place for discussing any question concerning the antiquities of Persia.

Jones concluded that all these languages originated from the same source.

Homeland

Scholars assume a homeland either in central Asia or in Western Asia, and Sanskrit must in this case have reached India by a language transfer from west to east. In 19th century Indo-European studies, the language of the Rigveda was the most archaic Indo-European language known to scholars, indeed the only records of Indo-European that could reasonably claim to date to the Bronze Age. This primacy of Sanskrit inspired scholars such as Friedrich Schlegel, to assume that the locus of the proto-Indo-European homeland had been in India, with the other dialects spread to the west by historical migration.

With the 20th-century discovery of Bronze-Age attestations of Indo-European (Anatolian, Mycenaean Greek), Vedic Sanskrit lost its special status as the most archaic Indo-European language known.

Aryan "race"

A 1910 depiction of Aryans entering India, from Hutchinson's History of the Nations

In the 1850s Max Müller introduced the notion of two Aryan races, a western and an eastern one, who migrated from the Caucasus into Europe and India respectively. Müller dichotomized the two groups, ascribing greater prominence and value to the western branch. Nevertheless, this "eastern branch of the Aryan race was more powerful than the indigenous eastern natives, who were easy to conquer".

Herbert Hope Risley expanded on Müller's two-race Indo-European speaking Aryan invasion theory, concluding that the caste system was a remnant of the Indo-Aryans domination of the native Dravidians, with observable variations in phenotypes between hereditary, race based, castes. Thomas Trautmann explains that Risley "found a direct relation between the proportion of Aryan blood and the nasal index, along a gradient from the highest castes to the lowest. This assimilation of caste to race proved very influential."

Müller's work contributed to the developing interest in Aryan culture, which often set Indo-European ('Aryan') traditions in opposition to Semitic religions. He was "deeply saddened by the fact that these classifications later came to be expressed in racist terms", as this was far from his intention. For Müller the discovery of common Indian and European ancestry was a powerful argument against racism, arguing that "an ethnologist who speaks of Aryan race, Aryan blood, Aryan eyes and hair, is as great a sinner as a linguist who speaks of a dolichocephalic dictionary or a brachycephalic grammar" and that "the blackest Hindus represent an earlier stage of Aryan speech and thought than the fairest Scandinavians". In his later work, Max Müller took great care to limit the use of the term "Aryan" to a strictly linguistic one.

"Aryan invasion"

The excavation of the Harappa, Mohenjo-daro and Lothal sites of the Indus Valley Civilisation (IVC) in the 1920, showed that northern India already had an advanced culture when the Indo-Aryans migrated into the area. The theory changed from a migration of advanced Aryans towards a primitive aboriginal population, to a migration of nomadic people into an advanced urban civilization, comparable to the Germanic migrations during the Fall of the Western Roman Empire, or the Kassite invasion of Babylonia.

This possibility was for a short time seen as a hostile invasion into northern India. The decline of the Indus Valley Civilisation at precisely the period in history in which the Indo-Aryan migrations probably took place, seemed to provide independent support of such an invasion. This argument was proposed by the mid-20th century archaeologist Mortimer Wheeler, who interpreted the presence of many unburied corpses found in the top levels of Mohenjo-daro as the victims of conquest wars, and who famously stated that the god "Indra stands accused" of the destruction of the Civilisation.

This position was discarded after finding no evidence of wars. The skeletons were found to be hasty interments, not massacred victims. Wheeler himself also nuanced this interpretation in later publications, stating "This is a possibility, but it can't be proven, and it may not be correct." Wheeler further notes that the unburied corpses may indicate an event in the final phase of human occupation of Mohenjo-Daro, and that thereafter the place was uninhabited, but that the decay of Mohenjo-Daro has to be ascribed to structural causes such as salinisation.

Nevertheless, although 'invasion' was discredited, critics of the Indo-Aryan Migration theory continue to present the theory as an "Aryan Invasion Theory", presenting it as a racist and colonialist discourse:

The theory of an immigration of IA speaking Arya ("Aryan invasion") is simply seen as a means of British policy to justify their own intrusion into India and their subsequent colonial rule: in both cases, a "white race" was seen as subduing the local darker colored population.

Aryan migration

An early 20th century depiction of Aryans settling in agricultural villages in India

In the later 20th century, ideas were refined along with data accrual, and migration and acculturation were seen as the methods whereby Indo-Aryans and their language and culture spread into northwest India around 1500 BCE. The term "invasion" is only being used nowadays by opponents^[who?] of the Indo-Aryan Migration theory. Michael Witzel:

...it has been supplanted by much more sophisticated models over the past few decades [...] philologists first, and archaeologists somewhat later, noticed certain inconsistencies in the older theory and tried to find new explanations, a new version of the immigration theories.

The changed approach was in line with newly developed thinking about language transfer in general, such as the migration of the Greeks into Greece (between 2100 and 1600 BCE) and their adoption of a syllabic script, Linear B, from the pre-existing Linear A, with the purpose of writing Mycenaean Greek, or the Indo-Europeanization of Western Europe (in stages between 2200 and 1300 BCE).

Future directions

Mallory notes that with the development and the growing sophistication of the knowledge on the Indo-European migrations and their purported homeland, new questions arise, and that "it is evident that we still have a very long way to go." One of those questions is the origin of the shared agricultural vocabulary, and the earliest dates for agriculturalism in areas settled by the Indo-Europeans. Those dates seem to be too late to account for the shared vocabulary, and raise the question what their origin is.

Linguistics: relationships between languages

Linguistic research traces the connections between the various Indo-European languages, and reconstructs proto-Indo-European. Accumulated linguistic evidence points to the Indo-Aryan languages as intrusive into the Indian subcontinent, some time in the 2nd millennium BCE. The language of the Rigveda, the earliest stratum of Vedic Sanskrit, is assigned to about 1500–1200 BCE.

Comparative method

Connections between languages can be traced because the processes that change languages are not random, but follow strict patterns. Especially sound shifts, the changing of vowels and consonants, are important, although grammar (especially morphology) and the lexicon (vocabulary) may also be significant. Historical-comparative linguistics thus makes it possible to see great similarities between languages which at first sight might seem very different.

Linguistics use the comparative method to study the development of languages by performing a feature-by-feature comparison of two or more languages with common descent from a shared ancestor, as opposed to the method of internal reconstruction, which analyses the internal development of a single language over time. Ordinarily both methods are used together to reconstruct prehistoric phases of languages, to fill in gaps in the historical record of a language, to discover the development of phonological, morphological, and other linguistic systems, and to confirm or refute hypothesized relationships between languages.

The comparative method aims to prove that two or more historically attested languages are descended from a single proto-language by comparing lists of cognate terms. From them, regular sound correspondences between the languages are established, and a sequence of regular sound changes can then be postulated, which allows the proto-language to be reconstructed. Relation is deemed certain only if at least a partial reconstruction of the common ancestor is feasible, and if regular sound correspondences can be established with chance similarities ruled out.

The comparative method was developed over the 19th century. Key contributions were made by the Danish scholars Rasmus Rask and Karl Verner and the German scholar Jacob Grimm. The first linguist to offer reconstructed forms from a proto-language was August Schleicher, in his Compendium der vergleichenden Grammatik der indogermanischen Sprachen, originally published in 1861.

Proto-Indo-European

Proto-Indo-European (PIE) is the linguistic reconstruction of the common ancestor of the Indo-European languages. August Schleicher's 1861 reconstruction of PIE was the first proposed proto-language to be accepted by modern linguists. More work has gone into reconstructing it than any other proto-language, and it is by far the best understood among all proto-languages of its age. During the 19th century, the vast majority of linguistic work was devoted to reconstruction of Proto-Indo-European or its daughter proto-languages such as Proto-Germanic, and most of the current techniques of linguistic reconstruction in historical linguistics (e.g., the comparative method and the method of internal reconstruction) were developed as a result.

PIE must have been spoken as a single language or a group of related dialects (before divergence began), though estimates of when this was by different authorities can vary massively, from the 7th millennium BCE to the second. A number of hypotheses have been proposed for the origin and spread of the language, the most popular among linguists being the Kurgan hypothesis, which postulates an origin in the Pontic–Caspian steppe of Eastern Europe in the 5th or 4th millennia BCE. Features of the culture of the speakers of PIE, known as Proto-Indo-Europeans, have also been reconstructed based on the shared vocabulary of the early attested Indo-European languages.

As mentioned above, the existence of PIE was first postulated in the 18th century by Sir William Jones, who observed the similarities between Sanskrit, Ancient Greek, and Latin. By the early 20th century, well-defined descriptions of PIE had been developed that are still accepted today (with some refinements). The largest developments of the 20th century were the discovery of the Anatolian and Tocharian languages and the acceptance of the laryngeal theory. The Anatolian languages have also spurred a major re-evaluation of theories concerning the development of various shared Indo-European language features and the extent to which these features were present in PIE itself. Relationships to other language families, including the Uralic languages, have been proposed but remain controversial.

PIE is thought to have had a complex system of morphology that included inflectional suffixes as well as ablaut (vowel alterations, as preserved in English sing, sang, sung). Nouns and verbs had complex systems of declension and conjugation respectively.

Arguments against an Indian origin of proto-Indo-European

Diversity

According to the linguistic center of gravity principle, the most likely point of origin of a language family is in the area of its greatest diversity. By this criterion, Northern India, home to only a single branch of the Indo-European language family (i.e., Indo-Aryan), is an exceedingly unlikely candidate for the Indo-European homeland, compared to Central-Eastern Europe, for example, which is home to the Italic, Venetic, Illyrian, Albanian, Germanic, Baltic, Slavic, Thracian and Greek branches of Indo-European.

Both mainstream Urheimat solutions locate the Proto-Indo-European homeland in the vicinity of the Black Sea.

Dialectal variation

It has been recognized since the mid-19th century, beginning with Schmidt and Schuchardt, that a binary tree model cannot capture all linguistic alignments; certain areal features cut across language groups and are better explained through a model treating linguistic change like waves rippling out through a pond. This is true of the Indo-European languages as well. Various features originated and spread while Proto-Indo-European was still a dialect continuum. These features sometimes cut across sub-families: for instance, the instrumental, dative and ablative plurals in Germanic and Balto-Slavic feature endings beginning with -m-, rather than the usual -*bh-, e.g. Gothic dative plural sunum 'to the sons' and Old Church Slavonic instrumental plural synъ-mi 'with sons', despite the fact that the Germanic languages are centum, while Balto-Slavic languages are satem.

The strong correspondence between the dialectal relationships of the Indo-European languages and their actual geographical arrangement in their earliest attested forms makes an Indian origin, as suggested by the Out of India Theory, unlikely.

Substrate influence

Already in the 1870s the Neogrammarians realised that the Greek/Latin vocalism couldn't be explained on the basis of the Sanskrit one, and therefore must be more original. The Indo-Iranian and Uralic languages influenced each other, with the Finno-Ugric languages containing Indo-European loan words. A telling example is the Finnish word vasara, "hammer", which is related to vajra, the weapon of Indra. Since the Finno-Ugric homeland was located in the northern forest zone in northern Europe, the contacts must have taken place – in line with the placement of the proto-Indo-European homeland at the Pontic-Caspian steppes – between the Black Sea and the Caspian Sea.

Dravidian and other South Asian languages share with Indo-Aryan a number of syntactical and morphological features that are alien to other Indo-European languages, including even its closest relative, Old Iranian. Phonologically, there is the introduction of retroflexes, which alternate with dentals in Indo-Aryan; morphologically there are the gerunds; and syntactically there is the use of a quotative marker (iti). These are taken as evidence of substratum influence.

It has been argued that Dravidian influenced Indic through "shift", whereby native Dravidian speakers learned and adopted Indic languages. The presence of Dravidian structural features in Old Indo-Aryan is thus plausibly explained, that the majority of early Old Indo-Aryan speakers had a Dravidian mother tongue which they gradually abandoned. Even though the innovative traits in Indic could be explained by multiple internal explanations, early Dravidian influence is the only explanation that can account for all of the innovations at once – it becomes a question of explanatory parsimony; moreover, early Dravidian influence accounts for several of the innovative traits in Indic better than any internal explanation that has been proposed.

A pre-Indo-European linguistic substratum in the Indian subcontinent would be a good reason to exclude India as a potential Indo-European homeland. However, several linguists, all of whom accept the external origin of the Aryan languages on other grounds, are still open to considering the evidence as internal developments rather than the result of substrate influences, or as adstratum effects.

Archaeology: migrations from the steppe Urheimat

The Sintashta, Andronovo, Bactria-Margiana and Yaz cultures have been associated with Indo-Iranian migrations in Central Asia. The Gandhara Grave, Cemetery H, Copper Hoard and Painted Grey Ware cultures are candidates for subsequent cultures within south India associated with Indo-Aryan movements. The decline of the Indus Valley Civilisation predates the Indo-Aryan migrations, but archeological data show a cultural continuity in the archeological record. Together with the presence of Dravidian loanwords in the Rigveda, this argues in favor of an interaction between post-Harappan and Indo-Aryan cultures.

Stages of migrations

About 6,000 years ago the Indo-Europeans started to spread out from their proto-Indo-European homeland in Central Eurasia, between the southern Ural Mountains, the North Caucasus, and the Black Sea. About 4,000 years ago Indo-European speaking peoples started to migrate out of the Eurasian steppes.

Diffusion from the "Urheimat"

Scholars regard the middle Volga, which was the location of the Samara culture (late 6th and early 5th millennium BCE), and the Yamna culture, to be the "Urheimat" of the Indo-Europeans, as described by the Kurgan hypothesis. From this "Urheimat", Indo-European languages spread throughout the Eurasian steppes between c. 4,500 and 2,500 BCE, forming the Yamna culture.

Sequence of migrations

David Anthony gives an elaborate overview of the sequence of migrations.

The oldest attested Indo-European language is Hittite, which belongs to the oldest written Indo-European languages, the Anatolian branch. Although the Hittites are placed in the 2nd millennium BCE, the Anatolian branch seems to predate Proto-Indo-European, and may have developed from an older Pre-Proto-Indo-European ancestor. If it separated from Proto-Indo-European, it is likely to have done so between 4500 and 3500 BCE.

A migration of archaic Proto-Indo-European speaking steppe herders into the lower Danube valley took place about 4200–4000 BCE, either causing or taking advantage of the collapse of Old Europe.

According to Mallory and Adams, migrations southward founded the Maykop culture (c. 3500–2500 BCE), and eastward the Afanasevo culture (c. 3500–2500 BCE), which developed into the Tocharians (c. 3700–3300 BCE).

According to Anthony, between 3100–2800/2600 BCE, a real folk migration of Proto-Indo-European speakers from the Yamna-culture took place toward the west, into the Danube Valley. These migrations probably split off Pre-Italic, Pre-Celtic and Pre-Germanic from Proto-Indo-European. According to Anthony, this was followed by a movement north, which split away Baltic-Slavic c. 2800 BCE. Pre-Armenian split off at the same time. According to Parpola, this migration is related to the appearance of Indo-European speakers from Europe in Anatolia, and the appearance of Hittite.

The Corded Ware culture in Middle Europe ( 2900–2450/2350 cal. BCE), has been associated with some of the languages in the Indo-European family. According to Haak et al. (2015) a massive migration took place from the Eurasian steppes to Central Europe.

Yamna culture

This migration is closely associated with the Corded Ware culture.

The Indo-Iranian language and culture emerged in the Sintashta culture (c. 2050–1900 BCE), where the chariot was invented. Allentoft et al. (2015) found close autosomal genetic relationship between peoples of Corded Ware culture and Sintashta culture, which "suggests similar genetic sources of the two", and may imply that "the Sintashta derives directly from an eastward migration of Corded Ware peoples".

The Indo-Iranian language and culture was further developed in the Andronovo culture (c. 2000–1450 BCE), and influenced by the Bactria–Margiana Archaeological Complex (c. 2250–1700 BCE). The Indo-Aryans split off sometime around 2000–1600 BCE from the Iranians, after which Indo-Aryan groups are thought to have moved to the Levant (Mitanni), the northern Indian subcontinent (Vedic people, c. 1500 BCE), and China (Wusun). Thereafter the Iranians migrated into Iran.

Central Asia: formation of Indo-Iranians

Indo-Iranian peoples are a grouping of ethnic groups consisting of the Indo-Aryan, Iranian and Nuristani peoples; that is, speakers of Indo-Iranian languages.

The Proto-Indo-Iranians are commonly identified with the Andronovo culture, that flourished c. 2000–1450 BCE in an area of the Eurasian Steppe that borders the Ural River on the west, the Tian Shan on the east. The older Sintashta culture (2050–1900), formerly included within the Andronovo culture, is now considered separately, but regarded as its predecessor, and accepted as part of the wider Andronovo horizon.

The Indo-Aryan migration was part of the Indo-Iranian migrations from the Andronovo culture into Anatolia, Iran and South Asia.

Sintashta-Petrovka culture

According to Allentoft (2015), the Sintashta culture probably derived from the Corded Ware Culture.

Map of the approximate maximal extent of the Andronovo culture. The formative Sintashta-Petrovka culture is shown in darker red. The location of the earliest spoke-wheeled chariot finds is indicated in purple. Adjacent and overlapping cultures (Afanasevo, Srubna and Bactria-Margiana cultures) are shown in green.

The Sintashta culture, also known as the Sintashta-Petrovka culture or Sintashta-Arkaim culture, is a Bronze Age archaeological culture of the northern Eurasian Steppe on the borders of Eastern Europe and Central Asia, dated to the period 2050–1900 BCE. The Sintashta culture is probably the archaeological manifestation of the Indo-Iranian language group.

The Sintashta culture emerged from the interaction of two antecedent cultures. Its immediate predecessor in the Ural-Tobol steppe was the Poltavka culture, an offshoot of the cattle-herding Yamnaya horizon that moved east into the region between 2800 and 2600 BCE. Several Sintashta towns were built over older Poltovka settlements or close to Poltovka cemeteries, and Poltovka motifs are common on Sintashta pottery. Sintashta material culture also shows the influence of the late Abashevo culture, a collection of Corded Ware settlements in the forest steppe zone north of the Sintashta region that were also predominantly pastoralist. Allentoft et al. (2015) also found close autosomal genetic relationship between peoples of Corded Ware culture and Sintashta culture.

The earliest known chariots have been found in Sintashta burials, and the culture is considered a strong candidate for the origin of the technology, which spread throughout the Old World and played an important role in ancient warfare. Sintashta settlements are also remarkable for the intensity of copper mining and bronze metallurgy carried out there, which is unusual for a steppe culture.

Because of the difficulty of identifying the remains of Sintashta sites beneath those of later settlements, the culture was only recently distinguished from the Andronovo culture. It is now recognised as a separate entity forming part of the 'Andronovo horizon'.

Andronovo culture

Archaeological cultures associated with Indo-Iranian migrations and Indo-Aryan migrations (after EIEC). The Andronovo, BMAC and Yaz cultures have often been associated with Indo-Iranian migrations. The GGC, Cemetery H, Copper Hoard and PGW cultures are candidates for cultures associated with Indo-Aryan migrations.

The Andronovo culture is a collection of similar local Bronze Age Indo-Iranian cultures that flourished c. 2000–1450 BC in western Siberia and the central Eurasian Steppe. It is probably better termed an archaeological complex or archaeological horizon. The name derives from the village of Andronovo (55°53′N 55°42′E), where in 1914, several graves were discovered, with skeletons in crouched positions, buried with richly decorated pottery. The older Sintashta culture (2050–1900 BCE), formerly included within the Andronovo culture, is now considered separately, but regarded as its predecessor, and accepted as part of the wider Andronovo horizon.

Currently only two sub-cultures are considered as part of Andronovo culture:

Alakul (2000–1700 BC) between Oxus (today Amu Darya), and Jaxartes, Kyzylkum desert
Fëdorovo (2000–1450 BC) in southern Siberia (earliest evidence of cremation and fire cult)

Other authors identified previously the following sub-cultures also as part of Andronovo:

Eastern Fedorovo (1750–1500 BC) in Tian Shan mountains (Northwestern Xinjiang, China), southeastern Kazakhstan, eastern Kyrgyzstan
Alekseyevka (1200–1000 BC) "final Bronze Age phase" in eastern Kazakhstan, contacts with Namazga VI in Turkmenia

The geographical extent of the culture is vast and difficult to delineate exactly. On its western fringes, it overlaps with the approximately contemporaneous, but distinct, Srubna culture in the Volga–Ural interfluvial. To the east, it reaches into the Minusinsk depression, with some sites as far west as the southern Ural Mountains, overlapping with the area of the earlier Afanasevo culture. Additional sites are scattered as far south as the Kopet Dag (Turkmenistan), the Pamir (Tajikistan) and the Tian Shan (Kyrgyzstan). The northern boundary vaguely corresponds to the beginning of the Taiga. In the Volga basin, interaction with the Srubna culture was the most intense and prolonged, and Federovo style pottery is found as far west as Volgograd.

Towards the middle of the 2nd millennium, the Andronovo cultures begin to move intensively eastwards. They mined deposits of copper ore in the Altai Mountains and lived in villages of as many as ten sunken log cabin houses measuring up to 30m by 60m in size. Burials were made in stone cists or stone enclosures with buried timber chambers.

In other respects, the economy was pastoral, based on cattle, horses, sheep, and goats. While agricultural use has been posited, no clear evidence has been presented.

Studies associate the Andronovo horizon with early Indo-Iranian languages, though it may have overlapped the early Uralic-speaking area at its northern fringe, including the Turkic-speaking area at its northeastern fringe.

Based on its use by Indo-Aryans in Mitanni and Vedic India, its prior absence in the Near East and Harappan India, and its 19–20th century BCE attestation at the Andronovo site of Sintashta, Kuz'mina (1994) argues that the chariot corroborates the identification of Andronovo as Indo-Iranian. Anthony & Vinogradov (1995) dated a chariot burial at Krivoye Lake to about 2000 BCE and a Bactria-Margiana burial that also contains a foal has recently been found, indicating further links with the steppes.

Mallory acknowledges the difficulties of making a case for expansions from Andronovo to northern India, and that attempts to link the Indo-Aryans to such sites as the Beshkent and Vakhsh cultures "only gets the Indo-Iranian to Central Asia, but not as far as the seats of the Medes, Persians or Indo-Aryans". He has developed the "kulturkugel" model that has the Indo-Iranians taking over Bactria-Margiana cultural traits but preserving their language and religion while moving into Iran and India. Fred Hiebert also agrees that an expansion of the BMAC into Iran and the margin of the Indus Valley is "the best candidate for an archaeological correlate of the introduction of Indo-Iranian speakers to Iran and South Asia." According to Narasimhan et al. (2018), the expansion of the Andronovo culture towards the BMAC took place via the Inner Asia Mountain Corridor.

Bactria-Margiana culture

The extent of the Bactria-Margiana Culture (after EIEC).

The Bactria-Margiana Culture, also called "Bactria-Margiana Archaeological Complex", was a non-Indo-European culture which influenced the Indo-Iranians. It was centered in what is nowadays northwestern Afghanistan and southern Turkmenistan. Proto-Indo-Iranian arose due to this influence.

The Indo-Iranians also borrowed their distinctive religious beliefs and practices from this culture. According to Anthony, the Old Indic religion probably emerged among Indo-European immigrants in the contact zone between the Zeravshan River (present-day Uzbekistan) and (present-day) Iran. It was "a syncretic mixture of old Central Asian and new Indo-European elements", which borrowed "distinctive religious beliefs and practices" from the Bactria–Margiana Culture. At least 383 non-Indo-European words were borrowed from this culture, including the god Indra and the ritual drink Soma.

The characteristically Bactria-Margiana (southern Turkmenistan/northern Afghanistan) artifacts found at burials in Mehrgarh and Balochistan are explained by a movement of peoples from Central Asia to the south. The Indo-Aryan tribes may have been present in the area of the BMAC from 1700 BCE at the latest (incidentally corresponding with the decline of that culture).

From the BMAC, the Indo-Aryans moved into the Indian subcontinent. According to Bryant, the Bactria-Margiana material inventory of the Mehrgarh and Baluchistan burials is "evidence of an archaeological intrusion into the subcontinent from Central Asia during the commonly accepted time frame for the arrival of the Indo-Aryans".

Two waves of Indo-Iranian migration

The Indo-Iranian migrations took place in two waves, belonging to the second and the third stage of Beckwith's description of the Indo-European migrations. The first wave consisted of the Indo-Aryan migration into the Levant, seemingly founding the Mitanni kingdom in northern Syria (c. 1600–1350 BCE), and the migration south-eastward of the Vedic people, over the Hindu Kush into northern India. Christopher I. Beckwith suggests that the Wusun, an Indo-European Europoid people of Inner Asia in antiquity, were also of Indo-Aryan origin. The second wave is interpreted as the Iranian wave.

First wave – Indo-Aryan migrations

Mittani

Map of the Near East c. 1400 BCE showing the Kingdom of Mitanni at its greatest extent

Mitanni (Hittite cuneiform KUR^URUMi-ta-an-ni), also Mittani (Mi-it-ta-ni) or Hanigalbat (Assyrian Hanigalbat, Khanigalbat cuneiform Ḫa-ni-gal-bat) or Naharin in ancient Egyptian texts was a Hurrian-speaking state in northern Syria and south-east Anatolia from c. 1600 BCE – 1350 BCE.

According to one hypothesis, founded by an Indo-Aryan ruling class governing a predominately Hurrian population, Mitanni came to be a regional power after the Hittite destruction of Amorite Babylon and a series of ineffectual Assyrian kings created a power vacuum in Mesopotamia. At the beginning of its history, Mitanni's major rival was Egypt under the Thutmosids. However, with the ascent of the Hittite empire, Mitanni and Egypt made an alliance to protect their mutual interests from the threat of Hittite domination.

At the height of its power, during the 14th century BCE, Mitanni had outposts centered on its capital, Washukanni, whose location has been determined by archaeologists to be on the headwaters of the Khabur River. Their sphere of influence is shown in Hurrian place names, personal names and the spread through Syria and the Levant of a distinct pottery type. Eventually, Mitanni succumbed to Hittite and later Assyrian attacks, and was reduced to the status of a province of the Middle Assyrian Empire.

The earliest written evidence for an Indo-Aryan language is found not in Northwestern India and Pakistan, but in northern Syria, the location of the Mitanni kingdom. The Mitanni kings took Old Indic throne names, and Old Indic technical terms were used for horse-riding and chariot-driving. The Old Indic term r'ta, meaning "cosmic order and truth", the central concept of the Rigveda, was also employed in the Mitanni kingdom. Old Indic gods, including Indra, were also known in the Mitanni kingdom.

North-India – Vedic culture

Multiple waves of migration into northern India

Geography of the Rigveda, with river names; the extent of the Swat and Cemetery H cultures are indicated.

The standard model for the entry of the Indo-European languages into India is that Indo-Aryan migrants went over the Hindu Kush, forming the Gandhara grave culture or Swat culture, in present-day Swat valley, into the headwaters of either the Indus or the Ganges (probably both). The Gandhara grave culture, which emerged c. 1600 BCE and flourished from c. 1500 BCE to 500 BCE in Gandhara, modern-day Pakistan and Afghanistan, is thus the most likely locus of the earliest bearers of Rigvedic culture.

According to Parpola, Indo-Aryan clans migrated into South Asia in subsequent waves. This explains the diversity of views found in the Rig Veda, and may also explain the existence of various Indo-Aryan cultural complexes in the later Vedic period, namely the Vedic culture centered on the Kuru Kingdom in the heartland of Aryavarta in the western Ganges plain, and the cultural complex of Greater Magadha at the eastern Ganges plain, which gave rise to Jainism and Buddhism.

Writing in 1998, Parpola postulated a first wave of immigration from as early as 1900 BCE, corresponding to the Cemetery H culture and the Copper Hoard Culture, c.q. Ochre Coloured Pottery culture, and an immigration to the Punjab . 1700–1400 BCE. In 2020, Parpola proposed an even earlier wave of proto-Indo-Iranian speaking people from the Sintashta culture into India at c. 1900 BCE, related to the Copper Hoard Culture, followed by a pre-Rig Vedic Indo-Aryan wave of migration:

It seems, then, that the earliest Aryan-speaking immigrants to South Asia, the Copper Hoard people, came with bull-drawn carts (Sanauli and Daimabad) via the BMAC and had Proto-Indo-Iranian as their language. They were, however, soon followed (and probably at least partially absorbed) by early Indo-Aryans.

This pre-Rig-Vedic wave of migration by early Indo-Aryans is associated by Parpola with "the early (Ghalegay IV–V) phase of the Gandhāra Grave culture" and the Atharva Veda tradition, and related to the Petrovka culture. The Rig-Vedic wave followed several centuries later, "perhaps in the fourteenth century BCE", and is associated by Parpola with the Fedorovo culture.

According to Kochhar there were three waves of Indo-Aryan immigration that occurred after the mature Harappan phase:

the "Murghamu" (Bactria-Margiana Culture) related people who entered Balochistan at Pirak, Mehrgarh south cemetery, and other places, and later merged with the post-urban Harappans during the late Harappans Jhukar phase (2000–1800 BCE);
the Swat IV that co-founded the Harappan Cemetery H phase in Punjab (2000–1800 BCE);
and the Rigvedic Indo-Aryans of Swat V that later absorbed the Cemetery H people and gave rise to the Painted Grey Ware culture (PGW) (to 1400 BCE).

Gandhara grave culture and Ochre Coloured Pottery culture

About 1800 BCE, there is a major cultural change in the Swat Valley with the emergence of the Gandhara grave culture. With its introduction of new ceramics, new burial rites, and the horse, the Gandhara grave culture is a major candidate for early Indo-Aryan presence. The two new burial rites—flexed inhumation in a pit and cremation burial in an urn—were, according to early Vedic literature, both practiced in early Indo-Aryan society. Horse-trappings indicate the importance of the horse to the economy of the Gandharan grave culture. Two horse burials indicate the importance of the horse in other respects. Horse burial is a custom that Gandharan grave culture has in common with Andronovo, though not within the distinctive timber-frame graves of the steppe.

Parpola (2020) states:

The dramatic new discovery of cart burials dated to c. 1900 at Sinauli have been reviewed in this paper, and they support my proposal of a pre-Ṛvedic wave (now set of waves) of Aryan speakers arriving in South Asia and their making contact with the Late Harappans.

Spread of Vedic-Brahmanic culture

During the Early Vedic Period (c. 1500–800 BCE) the Indo-Aryan culture was centered in the northern Punjab, or Sapta Sindhu. During the Later Vedic Period (c. 800–500 BCE) the Indo-Aryan culture started to extend into the western Ganges Plain, centering on the Vedic Kuru and Panchala area, and had some influence at the central Ganges Plain after 500 BCE. Sixteen Mahajanapada developed at the Ganges Plain, of which the Kuru and Panchala became the most notable developed centers of Vedic culture, at the western Ganges Plain.

The Central Ganges Plain, where Magadha gained prominence, forming the base of the Maurya Empire, was a distinct cultural area, with new states arising after 500 BCE during the so-called "Second urbanisation". It was influenced by the Vedic culture, but differed markedly from the Kuru-Panchala region. It "was the area of the earliest known cultivation of rice in the Indian subcontinent and by 1800 BCE was the location of an advanced neolithic population associated with the sites of Chirand and Chechar". In this region the Shramanic movements flourished, and Jainism and Buddhism originated.

Indus Valley Civilization

The Indo-Aryan migration into the northern Punjab started shortly after the decline of the Indus Valley Civilisation (IVC). According to the "Aryan Invasion Theory" this decline was caused by "invasions" of barbaric and violent Aryans who conquered the IVC. This "Aryan Invasion Theory" is not supported by the archeological and genetic data, and is not representative of the "Indo-Aryan migration theory".

Decline of Indus Valley Civilisation

The decline of the IVC from about 1900 BCE started before the onset of the Indo-Aryan migrations, caused by aridisation due to shifting mossoons. A regional cultural discontinuity occurred during the second millennium BCE and many Indus Valley cities were abandoned during this period, while many new settlements began to appear in Gujarat and East Punjab and other settlements such as in the western Bahawalpur region increased in size.

Jim G. Shaffer and Lichtenstein contend that in the second millennium BCE considerable "location processes" took place. In the eastern Punjab 79.9% and in Gujarat 96% of sites changed settlement status. According to Shaffer & Lichtenstein,

It is evident that a major geographic population shift accompanied this 2nd millennium BCE localisation process. This shift by Harappan and, perhaps, other Indus Valley cultural mosaic groups, is the only archaeologically documented west-to-east movement of human populations in the Indian subcontinent before the first half of the first millennium B.C.

Continuity of Indus Valley civilization

According to Erdosy, the ancient Harappans were not markedly different from modern populations in Northwestern India and present-day Pakistan. Craniometric data showed similarity with prehistoric peoples of the Iranian plateau and Western Asia, although Mohenjo-daro was distinct from the other areas of the Indus Valley.

According to Kennedy, there is no evidence of "demographic disruptions" after the decline of the Harappa culture. Kenoyer notes that no biological evidence can be found for major new populations in post-Harappan communities. Hemphill notes that "patterns of phonetic affinity" between Bactria and the Indus Valley Civilisation are best explained by "a pattern of long-standing, but low-level bidirectional mutual exchange".

According to Kennedy, the Cemetery H culture "shows clear biological affinities" with the earlier population of Harappa. The archaeologist Kenoyer noted that this culture "may only reflect a change in the focus of settlement organization from that which was the pattern of the earlier Harappan phase and not cultural discontinuity, urban decay, invading aliens, or site abandonment, all of which have been suggested in the past." Recent excavations in 2008 at Alamgirpur, Meerut District, appeared to show an overlap between the Harappan and PGW pottery indicating cultural continuity.

Relation with Indo-Aryan migrations

According to Kenoyer, the decline of the Indus Valley Civilisation is not explained by Aryan migrations, which took place after the decline of the Indus Valley Civilisation. Yet, according to Erdosy,

Evidence in material culture for systems collapse, abandonment of old beliefs and large-scale, if localised, population shifts in response to ecological catastrophe in the 2nd millennium B.C. must all now be related to the spread of Indo-Aryan languages.

Erdosy, testing hypotheses derived from linguistic evidence against hypotheses derived from archaeological data,states that there is no evidence of "invasions by a barbaric race enjoying technological and military superiority", but "some support was found in the archaeological record for small-scale migrations from Central Asia to the Indian subcontinent in the late 3rd/early 2nd millennia BCE". According to Erdosy, the postulated movements within Central Asia can be placed within a processional framework, replacing simplistic concepts of "diffusion", "migrations" and "invasions".

Scholars have argued that the historical Vedic culture is the result of an amalgamation of the immigrating Indo-Aryans with the remnants of the indigenous civilization, such as the Ochre Coloured Pottery culture. Such remnants of IVC culture are not prominent in the Rigveda, with its focus on chariot warfare and nomadic pastoralism in stark contrast with an urban civilization.

Inner Asia – Wusun and Yuezhi

The Tarim Basin, 2008

Wusun and their neighbours during the late 2nd century BCE. The Yancai did not change their name to Alans until the 1st century.

The migrations of the Yuezhi through Central Asia, from around 176 BCE to 30 CE

According to Christopher I. Beckwith the Wusun, an Indo-European Caucasian people of Inner Asia in antiquity, were also of Indo-Aryan origin. From the Chinese term Wusun, Beckwith reconstructs the Old Chinese *âswin, which he compares to the Old Indic aśvin "the horsemen", the name of the Rigvedic twin equestrian gods. Beckwith suggests that the Wusun were an eastern remnant of the Indo-Aryans, who had been suddenly pushed to the extremeties of the Eurasian Steppe by the Iranian peoples in the 2nd millennium BCE.

The Wusun are first mentioned by Chinese sources as vassals in the Tarim Basin of the Yuezhi, another Indo-European Caucasian people of possible Tocharian stock. Around 175 BCE, the Yuezhi were utterly defeated by the Xiongnu, also former vassals of the Yuezhi. The Yuezhi subsequently attacked the Wusun and killed their king (Kunmo Chinese: 昆莫 or Kunmi Chinese: 昆彌) Nandoumi (Chinese: 難兜靡), capturing the Ili Valley from the Saka (Scythians) shortly afterwards. In return the Wusun settled in the former territories of the Yuezhi as vassals of the Xiongnu.

The son of Nandoumi was adopted by the Xiongnu king and made leader of the Wusun. Around 130 BCE he attacked and utterly defeated the Yuezhi, settling the Wusun in the Ili Valley. After the Yuezhi were defeated by the Xiongnu, in the 2nd century BCE, a small group, known as the Little Yuezhi, fled to the south, while the majority migrated west to the Ili Valley, where they displaced the Sakas (Scythians). Driven from the Ili Valley shortly afterwards by the Wusun, the Yuezhi migrated to Sogdia and then Bactria, where they are often identified with the Tókharoi (Τοχάριοι) and Asii of Classical sources. They then expanded into northern Indian subcontinent, where one branch of the Yuezhi founded the Kushan Empire. The Kushan empire stretched from Turpan in the Tarim Basin to Pataliputra on the Indo-Gangetic Plain at its greatest extent, and played an important role in the development of the Silk Road and the transmission of Buddhism to China.

Soon after 130 BCE the Wusun became independent of the Xiongnu, becoming trusted vassals of the Han dynasty and powerful force in the region for centuries. With the emerging steppe federations of the Rouran, the Wusun migrated into the Pamir Mountains in the 5th century CE. They are last mentioned in 938 when a Wusun chieftain paid tribute to the Liao dynasty.

Second wave – Iranians

The first Iranians to reach the Black Sea may have been the Cimmerians in the 8th century BCE, although their linguistic affiliation is uncertain. They were followed by the Scythians, who would dominate the area, at their height, from the Carpathian Mountains in the west, to the easternmost fringes of Central Asia in the east. For most of their existence, the Scythians were based in what is modern-day Ukraine and southern European Russia. Sarmatian tribes, of whom the best known are the Roxolani (Rhoxolani), Iazyges (Jazyges) and the Alans, followed the Scythians westwards into Europe in the late centuries BCE and the 1st and 2nd centuries of the Common Era (The Migration Period). The populous Sarmatian tribe of the Massagetae, dwelling near the Caspian Sea, were known to the early rulers of Persia in the Achaemenid Period. In the east, the Scythians occupied several areas in Xinjiang, from Khotan to Tumshuq.

The Medes, Parthians and Persians begin to appear on the western Iranian Plateau from c. 800 BCE, after which they remained under Assyrian rule for several centuries, as it was with the rest of the peoples in the Near East. The Achaemenids replaced Median rule from 559 BCE. Around the first millennium of the Common Era (AD), the Kambojas, the Pashtuns and the Baloch began to settle on the eastern edge of the Iranian Plateau, on the mountainous frontier of northwestern and western Pakistan, displacing the earlier Indo-Aryans from the area.

In Central Asia, the Turkic languages have marginalized Iranian languages as a result of the Turkic migration of the early centuries CE. In Eastern Europe, Slavic and Germanic peoples assimilated and absorbed the native Iranian languages (Scythian and Sarmatian) of the region. Extant major Iranian languages are Persian, Pashto, Kurdish, and Balochi, besides numerous smaller ones.

Anthropology: elite recruitment and language shift

Elite dominance

Small groups can change a larger cultural area, and elite male dominance by small groups may have led to a language shift in northern India. Thapar notes that Indo-Aryan chiefs may have provided protection to non-Aryan agriculturalists, offering a system of patronage placing the chiefs in a superior position. This would have involved bilingualism, resulting in the adoption of Indo-Aryan languages by local populations. According to Parpola, local elites joined "small but powerful groups" of Indo-European speaking migrants. These migrants had an attractive social system and good weapons, and luxury goods which marked their status and power. Joining these groups was attractive for local leaders, since it strengthened their position, and gave them additional advantages. These new members were further incorporated by matrimonial alliances.

Renfrew: models of "linguistic replacement"

Basu et al. refer to Renfrew, who described four models for "linguistic replacement":

The demographic-subsistence model, exemplified by the process of agricultural dispersal, in which the incoming group has exploitive technologies which makes them dominant. It may lead to significant gene flow, and significant genetic changes in the population. But it may also lead to acculturalisation, in which case the technologies are taken over, but there is less change in the genetic composition of the population;
The existence of extended trading systems which led to the development of a lingua franca, in which case some gene flow is to be expected;
The elite dominance model, in which "a relatively small but well-organized group [...] take[s] over the system". Given the small size of the elite, its genetic influence may also be small, though "preferential access to marriage partners" may result in a relatively strong influence on the gene pool. Sexual asymmetry may also be of influence: incoming elites often consist mostly of males, who have no influence on the mitochondrial DNA of the gene pool, but may influence the Y chromosomes of the gene pool;
System collapse, in which territorial boundaries are changed, and elite dominance may appear for a while.

David Anthony: elite recruitment

David Anthony, in his "revised Steppe hypothesis" notes that the spread of the Indo-European languages probably did not happen through "chain-type folk migrations", but by the introduction of these languages by ritual and political elites, which are emulated by large groups of people. Anthony gives the example of the Southern Luo-speaking Acholi in northern Uganda in the 17th and 18th century, whose language spread rapidly in the 19th century. Anthony notes that "Indo-European languages probably spread in a similar way among the tribal societies of prehistoric Europe", carried forward by "Indo-European chiefs" and their "ideology of political clientage". Anthony notes that "elite recruitment" may be a suitable term for this system.

Michael Witzel: small groups and acculturation

Michael Witzel refers to Ehret's model "which stresses the osmosis, or a 'billiard ball', or Mallory's Kulturkugel, effect of cultural transmission". According to Ehret, ethnicity and language can shift with relative ease in small societies, due to the cultural, economic and military choices made by the local population in question. The group bringing new traits may initially be small, contributing features that can be fewer in number than those of the already local culture. The emerging combined group may then initiate a recurrent, expansionist process of ethnic and language shift.

Witzel notes that "arya/ārya does not mean a particular 'people' or even a particular 'racial' group but all those who had joined the tribes speaking Vedic Sanskrit and adhering to their cultural norms (such as ritual, poetry, etc.)." According to Witzel, "there must have been a long period of acculturation between the local population and the 'original' immigrants speaking Indo-Aryan." Witzel also notes that the speakers of Indo-Aryan and the local population must have been bilingual, speaking each other's languages and interacting with each other, before the Rg Veda was composed in the Punjab.

Salmons: systematic changes in community structure

Joseph Salmons notes that Anthony presents scarce concrete evidence or arguments. Salmons is critical about the notion of "prestige" as a central factor in the shift to Indo-European languages, referring to Milroy who notes that "prestige" is "a cover term for a variety of very distinct notions". Instead, Milroy offers "arguments built around network structure", though Salmons also notes that Anthony includes several of those arguments, "including political and technological advantages". According to Salmons, the best model is offered by Fishman, who

... understands shift in terms of geographical, social, and cultural "dislocation" of language communities. Social dislocation, to give the most relevant example, involves "siphoning off the talented, the enterprising, the imaginative and the creative" ([Fishman] 1991: 61), and sounds strikingly like Anthony's 'recruitment' scenario.

Salmons himself argues that

... systematic changes in community structure are what drive language shift, incorporating Milroy's network structures as well. The heart of the view is the quintessential element of modernization, namely a shift from local community-internal organization to regional (state or national or international, in modern settings), extra-community organizations. Shift correlates with this move from pre-dominantly "horizontal" community structures to more "vertical" ones.

Genetics: ancient ancestry and multiple gene flows

India has one of the most genetically diverse populations in the world, and the history of this genetic diversity is the topic of continued research and debate. The Indo-Aryan migrations form part of a complex genetical puzzle on the origin and spread of the various components of the Indian population, including various waves of admixture and language shift. The genetic impact of the Indo-Aryans may have been marginal, but this is not at odds with the cultural and linguistic influence, since language shift is possible without a change in genetics.

Ancestral groups

Common maternal ancestry

Sahoo et al. (2006) states that "there is general agreement that Indian caste and tribal populations share a common late Pleistocene maternal ancestry in India."

Kivisild et al. (1999) concluded that there is "an extensive deep late Pleistocene genetic link between contemporary Europeans and Indians" via the mitochondrial DNA, that is, DNA which is inherited from the mother. According to them, the two groups split at the time of the peopling of Asia and Eurasia and before modern humans entered Europe. Kivisild et al. (2000) note that "the sum of any recent (the last 15,000 years) western mtDNA gene flow to India comprises, in average, less than 10 percent of the contemporary Indian mtDNA lineages."

Kivisild et al. (2003) and Sharma (2005) note that north and south Indians share a common maternal ancestry: Kivisild et al. (2003) further note that "these results show that Indian tribal and caste populations derive largely from the same genetic heritage of Pleistocene southern and western Asians and have received limited gene flow from external regions since the Holocene.

"Ancestral North Indians" and "Ancestral South Indians"

Reich et al. (2009), in a collaborative effort between the Harvard Medical School and the Centre for Cellular and Molecular Biology (CCMB), examined the entire genomes worth 560,000 single nucleotide polymorphisms (SNPs), as compared to 420 SNPs in prior work. They also cross-compared them with the genomes of other regions available in the global genome database. Through this study, they were able to discern two genetic groups in the majority of populations in India, which they called "Ancestral North Indians" (ANI) and "Ancestral South Indians" (ASI). They found that the ANI genes are close to those of Middle Easterners, Central Asians and Europeans whereas the ASI genes are dissimilar to all other known populations outside India, though the indigenous Andamanese were determined to be the most closely related to the ASI population of any living group (albeit distinct from the ASI). These two distinct groups, which had split ca. 50,000 years ago, formed the basis for the present population of India.

The two groups mixed between 1,900 and 4,200 years ago (2200 BCE – 100 CE), where-after a shift to endogamy took place and admixture became rare. Speaking to Fountain Ink, David Reich stated, "Prior to 4,200 years ago, there were unmixed groups in India. Sometime between 1,900 to 4,200 years ago, profound, pervasive convulsive mixture occurred, affecting every Indo-European and Dravidian group in India without exception." Reich pointed out that their work does not show that a substantial migration occurred during this time.

Metspalu et al. (2011), representing a collaboration between the Estonian Biocenter and CCMB, confirmed that the Indian populations are characterized by two major ancestry components. One of them is spread at comparable frequency and haplotype diversity in populations of South and West Asia and the Caucasus. The second component is more restricted to South Asia and accounts for more than 50% of the ancestry in Indian populations. Haplotype diversity associated with these South Asian ancestry components is significantly higher than that of the components dominating the West Eurasian ancestry palette.

Additional components

ArunKumar et al. (2015) discern three major ancestry components, which they call "Southwest Asian", "Southeast Asian" and "Northeast Asian". The Southwest Asian component seems to be a native Indian component, while the Southeast Asian component is related to East Asian populations. Brahmin populations "contained 11.4 and 10.6% of Northern Eurasian and Mediterranean components, thereby suggesting a shared ancestry with the Europeans". They note that this fits with earlier studies which "suggested similar shared ancestries with Europeans and Mediterraneans". They further note that

Studies based on uni-parental marker have shown diverse Y-chromosomal haplogroups making up the Indian gene pool. Many of these Y-chromosomal markers show a strong correlation to the linguistic affiliation of the population. The genome-wide variation of the Indian samples in the present study correlated with the linguistic affiliation of the sample.

They conclude that, while there may have been an ancient settlement in the subcontinent, "male-dominated genetic elements shap[ed] the Indian gene pool", and that these elements "have earlier been correlated to various languages", and further note "the fluidity of female gene pools when in a patriarchal and patrilocal society, such as that of India".

Basu et al. (2016) extend the study of Reich et al. (2009) by postulating two other populations in addition to the ANI and ASI: "Ancestral Austro-Asiatic" (AAA) and "Ancestral Tibeto-Burman" (ATB), corresponding to the Austroasiatic and Tibeto-Burman language speakers. According to them, ancestral populations seem to have occupied geographically separated habitats. The ASI and the AAA were early settlers, who possibly arrived via the southern wave out of Africa. The ANI are related to Central South Asians and entered India through the northwest, while the ATB are related to East Asians and entered India through northeast corridors. They further note that

The asymmetry of admixture, with ANI populations providing genomic inputs to tribal populations (AA, Dravidian tribe, and TB) but not vice versa, is consistent with elite dominance and patriarchy. Males from dominant populations, possibly upper castes, with high ANI component, mated outside of their caste, but their offspring were not allowed to be inducted into the caste. This phenomenon has been previously observed as asymmetry in homogeneity of mtDNA and heterogeneity of Y-chromosomal haplotypes in tribal populations of India as well as the African Americans in United States.

Male-mediated migration

Reich et al. (2009), citing Kivisild et al. (1999), indicate that there has been a low influx of female genetic material since 50,000 years ago, but a "male gene flow from groups with more ANI relatedness into ones with less".

ArunKumar et al. (2015) "suggest that ancient male-mediated migratory events and settlement in various regional niches led to the present day scenario and peopling of India."

North-south cline

According to Metspalu et al. (2011) there is "a general principal component cline stretching from Europe to south India". This northwest component is shared with populations from the Middle East, Europe and Central Asia, and is thought to represent at least one ancient influx of people from the northwest. According to Saraswathy et al. (2010), there is "a major genetic contribution from Eurasia to North Indian upper castes" and a "greater genetic inflow among North Indian caste populations than is observed among South Indian caste and tribal populations". According to Basu et al. (2003) and Saraswathy et al. (2010) certain sample populations of upper caste North Indians show a stronger affinity to Central Asian caucasians, whereas southern Indian Brahmins show a less stronger affinity.

Scenarios

While Reich notes that the onset of admixture coincides with the arrival of Indo-European language, according to Metspalu (2011), the commonalities of the ANI with European genes cannot be explained by the influx of Indo-Aryans at ca. 3,500 BP alone. They state that the split of ASI and ANI predates the Indo-Aryan migration, both of these ancestry components being older than 3,500 BP." Moorjani (2013) states that "We have further shown that groups with unmixed ANI and ASI ancestry were plausibly living in India until this time." Moorjani (2013) describes three scenarios regarding the bringing together of the two groups:

"migrations that occurred prior to the development of agriculture [8,000–9,000 years before present (BP)]. Evidence for this comes from mitochondrial DNA studies, which have shown that the mitochondrial haplogroups (hg U2, U7, and W) that are most closely shared between Indians and West Eurasians diverged about 30,000–40,000 years BP."
"Western Asian peoples migrated to India along with the spread of agriculture [...] Any such agriculture related migrations would probably have begun at least 8,000–9,000 years BP (based on the dates for Mehrgarh) and may have continued into the period of the Indus civilization that began around 4,600 years BP and depended upon West Asian crops."
"migrations from Western or Central Asia from 3,000 to 4,000 years BP, a time during which it is likely that Indo-European languages began to be spoken in the subcontinent. A difficulty with this theory, however, is that by this time India was a densely populated region with widespread agriculture, so the number of migrants of West Eurasian ancestry must have been extraordinarily large to explain the fact that today about half the ancestry in India derives from the ANI."

Pre-agricultural migrations

Metspalu et al. (2011) detected a genetic component in India, k5, which "distributed across the Indus Valley, Central Asia, and the Caucasus". According to Metspalu et al. (2011), k5 "might represent the genetic vestige of the ANI", though they also note that the geographic cline of this component within India "is very weak, which is unexpected under the ASI-ANI model", explaining that the ASI-ANI model implies an ANI contribution which decreases toward southern India. According to Metspalu et al. (2011), "regardless of where this component was from (the Caucasus, Near East, Indus Valley, or Central Asia), its spread to other regions must have occurred well before our detection limits at 12,500 years." Speaking to Fountain Ink, Metspalu said, "the West Eurasian component in Indians appears to come from a population that diverged genetically from people actually living in Eurasia, and this separation happened at least 12,500 years ago." Moorjani et al. (2013) refer to Metspalu (2011) as "fail[ing] to find any evidence for shared ancestry between the ANI and groups in West Eurasia within the past 12,500 years". CCMB researcher Thangaraj believes that "it was much longer ago", and that "the ANI came to India in a second wave of migration that happened perhaps 40,000 years ago."

Narasimhan et al. (2019) conclude that ANI and ASI were formed in the 2nd millennium BCE. They were preceded by IVC-people, a mixture of AASI (ancient ancestral south Indians, that is, hunter-gatherers related), and people related to but distinct from Iranian agri-culturalists, lacking the Anatolian farmer-related ancestry which was common in Iranian farmers after 6000 BCE. Those Iranian farmers-related people may have arrived in India before the advent of farming in northern India, and mixed with people related to Indian hunter-gatherers ca. 5400 to 3700 BCE, before the advent of the mature IVC. This mixed IVC-population, which probably was native to the Indus Valley Civilisation, "contributed in large proportions to both the ANI and ASI", which took shape during the 2nd millennium BCE. ANI formed out of a mixture of "Indus_Periphery-related groups" and migrants from the steppe, while ASI was formed out of "Indus_Periphery-related groups" who moved south and mixed with hunter-gatherers.

Agricultural migrations

Near-Eastern migrations

Late Harappan phase (1900–1300 BCE)

Early Vedic Culture (1700–1100 BCE)

Kivisild et al. (1999) note that "a small fraction of the 'Caucasoid-specific' mtDNA lineages found in Indian populations can be ascribed to a relatively recent admixture." at ca. 9,300 ± 3,000 years before present, which coincides with "the arrival to India of cereals domesticated in the fertile Crescent" and "lends credence to the suggested linguistic connection between Elamite and Dravidic populations".

According to Gallego Romero et al. (2011), their research on lactose tolerance in India suggests that "the west Eurasian genetic contribution identified by Reich et al. (2009) principally reflects gene flow from Iran and the Middle East." Gallego Romero notes that Indians who are lactose-tolerant show a genetic pattern regarding this tolerance which is "characteristic of the common European mutation". According to Gallego Romero, this suggests that "the most common lactose tolerance mutation made a two-way migration out of the Middle East less than 10,000 years ago. While the mutation spread across Europe, another explorer must have brought the mutation eastward to India – likely traveling along the coast of the Persian Gulf where other pockets of the same mutation have been found." In contrast, Allentoft et al. (2015) found that lactose-tolerance was absent in the Yamnaya culture, noting that while "the Yamnaya and these other Bronze Age cultures herded cattle, goats, and sheep, they couldn't digest raw milk as adults. Lactose tolerance was still rare among Europeans and Asians at the end of the Bronze Age, just 2000 years ago."

According to Lazaridis et al. (2016), "farmers related to those from Iran spread northward into the Eurasian steppe; and people related to both the early farmers of Iran and to the pastoralists of the Eurasian steppe spread eastward into South Asia." They further note that ANI "can be modelled as a mix of ancestry related to both early farmers of western Iran and to people of the Bronze Age Eurasian steppe".

Haplogroup R1a and related haplogroups

R1a origins (Underhill 2010; R1a migration to Eastern Europe; R1a1a diversification (Pamjav 2012); and R1a1a oldest expansion and highest frequency (Underhill 2014)

The distribution and proposed origin of haplogroup R1a, more specifically R1a1a1b, is often being used as an argument pro or contra the Indo-Aryan migrations. It is found in high frequencies in Eastern Europe (Z282) and south Asia (Z93), the areas of the Indo-European migrations. The place of origin of this haplogroup may give an indication of the "homeland" of the Indo-Europeans, and the direction of the first migrations.

Cordeaux et al. (2004), based on the spread of a cluster of haplogroups (J2, R1a, R2, and L) in India, with higher rates in northern India, argue that agriculture in south India spread with migrating agriculturalists, which also influenced the genepool in south India.

Sahoo et al. (2006), in response to Cordeaux et al. (2004), suggest that those haplogroups originated in India, based on the spread of these various haplogroups in India. According to Sahoo et al. (2006), this spread "argue[s] against any major influx, from regions north and west of India, of people associated either with the development of agriculture or the spread of the Indo-Aryan language family". They further propose that "the high incidence of R1* and R1a throughout Central Asian and East European populations (without R2 and R* in most cases) is more parsimoniously explained by gene flow in the opposite direction", which according to Sahoo et al. (2006) explains the "sharing of some Y-chromosomal haplogroups between Indian and Central Asian populations".

Sengupta et al. (2006) also comment on Cordeaux et al. (2004), stating that "the influence of Central Asia on the pre-existing gene pool was minor", and arguing for "a peninsular origin of Dravidian speakers than a source with proximity to the Indus and with significant genetic input resulting from demic diffusion associated with agriculture".

Sharma et al. (2009) found a high frequency of R1a1 in India. They therefore argue for an Indian origin of R1a1, and dispute "the origin of Indian higher most castes from Central Asian and Eurasian regions, supporting their origin within the Indian subcontinent".

Underhill et al. (2014/2015) conclude that R1a1a1, the most frequent subclade of R1a, split into Z282 (Europe) and Z93 (Asia) at circe 5,800 before present. According to Underhill et al. (2014/2015), "[t]his suggests the possibility that R1a lineages accompanied demic expansions initiated during the Copper, Bronze, and Iron ages." They further note that the diversification of Z93 and the "early urbanization within the Indus Valley also occurred at this time and the geographic distribution of R1a-M780 (Figure 3d) may reflect this".

Palanichamy et al. (2015), while responding to Cordeaux et al. (2004), Sahoo et al. (2006) and Sengupta et al. (2006), elaborated on Kivisild et al.'s (1999) suggestion that West Eurasian haplogroups "may have been spread by the early Neolithic migrations of proto-Dravidian farmers spreading from the eastern horn of the Fertile Crescent into India". They conclude that "the L1a lineage arrived from western Asia during the Neolithic period and perhaps was associated with the spread of the Dravidian language to India", indicating that "the Dravidian language originated outside India and may have been introduced by pastoralists coming from western Asia (Iran)." They further conclude that two subhalogroups originated with the Dravidian speaking peoples, and may have come to South India when the Dravidian language spread.

Poznik et al. (2016) note that "striking expansions" occurred within R1a-Z93 at ~4,500–4,000 years ago, which "predates by a few centuries the collapse of the Indus Valley Civilisation". Mascarenhas et al. (2015) note that the expansion of Z93 from Transcaucasia into South Asia is compatible with "the archeological records of eastward expansion of West Asian populations in the 4th millennium BCE culminating in the so-called Kura-Araxes migrations in the post-Uruk IV period".

Indo-European migrations

Genetic impact of Indo-Aryan migrations

Bamshad et al. (2001), Wells et al. (2002) and Basu et al. (2003) argue for an influx of Indo-European migrants into the Indian subcontinent, but not necessarily an "invasion of any kind". Bamshad et al. (2001) notice that the correlation between caste-status and West Eurasian DNA may be explained by subsequent male immigration into the Indian subcontinent. Basu et al. (2003) argue that the Indian subcontinent was subjected to a series of Indo-European migrations about 1500 BCE.

Metspalu et al. (2011) note that "any nonmarginal migration from Central Asia to South Asia should have also introduced readily apparent signals of East Asian ancestry into India" (although this presupposes the unproven assumption that East Asian ancestry was present – to a significant extent – in prehistorical Central Asia), which is not the case, and conclude that if there was a major migration of Eurasians into India, this happened before the rise of the Yamna culture. Based on Metspalu (2011), Lalji Singh, a co-author of Metspalu, concludes that "[t]here is no genetic evidence that Indo-Aryans invaded or migrated to India".

Moorjani et al. (2013) notes that the period of 4,200–1,900 years BP was a time of dramatic changes in northern India, and coincides with the "likely first appearance of Indo-European languages and Vedic religion in the subcontinent". Moorjani further notes that there must have been multiple waves of admixture, which had more impact on higher-caste and northern Indians and took place more recently. This may be explained by "additional gene flow", related to the spread of languages:

...at least some of the history of population mixture in India is related to the spread of languages in the subcontinent. One possible explanation for the generally younger dates in northern Indians is that after an original mixture event of ANI and ASI that contributed to all present-day Indians, some northern groups received additional gene flow from groups with high proportions of West Eurasian ancestry, bringing down their average mixture date.

Palanichamy et al. (2015), elaborating on Kivisild et al. (1999) conclude that "A large proportion of the west Eurasian mtDNA haplogroups observed among the higher-ranked caste groups, their phylogenetic affinity and age estimate indicate recent Indo-Aryan migration to India from west Asia. According to Palanichamy et al. (2015), "the west Eurasian admixture was restricted to caste rank. It is likely that Indo-Aryan migration has influenced the social stratification in the pre-existing populations and helped in building the Hindu caste system, but it should not be inferred that the contemporary Indian caste groups have directly descended from Indo-Aryan immigrants.

Jones et al. (2015) state that Caucasus hunter gatherer(CHG) was "a major contributor to the Ancestral North Indian component". According to Jones et al. (2015), it "may be linked with the spread of Indo-European languages", but they also note that "earlier movements associated with other developments such as that of cereal farming and herding are also plausible".

Basu et al. (2016) note that the ANI are inseparable from Central-South Asian populations in present-day Pakistan. They hypothesise that "the root of ANI is in Central Asia".

According to Lazaridis et al. (2016) ANI "can be modelled as a mix of ancestry related to both early farmers of western Iran and to people of the Bronze Age Eurasian steppe".

Silva et al. (2017) state that "the recently refined Y-chromosome tree strongly suggests that R1a is indeed a highly plausible marker for the long-contested Bronze Age spread of Indo-Aryan speakers into South Asia." Silva et al. (2017) further notes "they likely spread from a single Central Asian source pool, there do seem to be at least three and probably more R1a founder clades within the Subcontinent, consistent with multiple waves of arrival."

Narasimhan et al. (2018) conclude that pastoralists spread southwards from the Eurasian steppe during the period 2300–1500 BCE. These pastoralists during the 2nd millennium BCE, who were likely associated with Indo-European languages, presumably mixed with the descendants of the Indus Valley Civilisation, who in turn were a mix of Iranian agriculturalists and South Asian hunter-gatherers forming "the single most important source of ancestry in South Asia."

Zerjal et al. (2002) argue that "multiple recent events" may have reshaped India's genetic landscape.

Origins of R1a-Z93

Ornella Semino et al. (2000) proposed Ukrainian origins of R1a1, and a postglacial spread of the R1a1 gene during the Late Glacial, subsequently magnified by the expansion of the Kurgan culture into Europe and eastward. Spencer Wells proposes central Asian origins, suggesting that the distribution and age of R1a1 points to an ancient migration corresponding to the spread by the Kurgan people in their expansion from the Eurasian Steppe. According to Pamjav et al. (2012), "Inner and Central Asia is an overlap zone for the R1a1-Z280 and R1a1-Z93 lineages [which] implies that an early differentiation zone of R1a1-M198 conceivably occurred somewhere within the Eurasian Steppes or the Middle East and Caucasus region as they lie between South Asia and Eastern Europe."

A 2014 study by Peter A. Underhill et al., using 16,244 individuals from over 126 populations from across Eurasia, concluded that there was compelling evidence that "the initial episodes of haplogroup R1a diversification likely occurred in the vicinity of present-day Iran."

According to Martin P. Richards, co-author of Silva et al. (2017), "[the prevalence of R1a in India was] very powerful evidence for a substantial Bronze Age migration from central Asia that most likely brought Indo-European speakers to India."

Literary research: similarities, geography, and references to migration

Similarities

Mitanni

The oldest inscriptions in Old Indic, the language of the Rig Veda, is found not in India, but in northern Syria in Hittite records regarding one of their neighbors, the Hurrian-speaking Mitanni. In a treaty with the Hittites, the king of Mitanni, after swearing by a series of Hurrian gods, swears by the gods Mitrašil, Uruvanaššil, Indara, and Našatianna, who correspond to the Vedic gods Mitra, Varuna, Indra, and Nāsatya (Aśvin). Contemporary equestrian terminology, as recorded in a horse-training manual whose author is identified as "Kikkuli", contains Indo-Aryan loanwords. The personal names and gods of the Mitanni aristocracy also bear significant traces of Indo-Aryan. Because of the association of Indo-Aryan with horsemanship and the Mitanni aristocracy, it is presumed that, after superimposing themselves as rulers on a native Hurrian-speaking population about the 15th–16th centuries BCE, Indo-Aryan charioteers were absorbed into the local population and adopted the Hurrian language.

Brentjes argues that there is not a single cultural element of central Asian, Eastern European, or Caucasian origin in the Mitannian area; he also associates with an Indo-Aryan presence the peacock motif found in the Middle East from before 1600 BCE and quite likely from before 2100 BCE.

Scholars reject the possibility that the Indo-Aryans of Mitanni came from the Indian subcontinent as well as the possibility that the Indo-Aryans of the Indian subcontinent came from the territory of Mitanni, leaving migration from the north the only likely scenario. The presence of some Bactria-Margiana loan words in Mitanni, Old Iranian and Vedic further strengthens this scenario.

Iranian Avesta

The religious practices depicted in the Rigveda and those depicted in the Avesta, the central religious text of Zoroastrianism—the ancient Iranian faith founded by the prophet Zoroaster—have in common the deity Mitra, priests called hotṛ in the Rigveda and zaotar in the Avesta, and the use of a ritual substance that the Rigveda calls soma and the Avesta haoma. However, the Indo-Aryan deva 'god' is cognate with the Iranian daēva 'demon'. Similarly, the Indo-Aryan asura 'name of a particular group of gods' (later on, 'demon') is cognate with the Iranian ahura 'lord, god,' which 19th and early 20th century authors such as Burrow explained as a reflection of religious rivalry between Indo-Aryans and Iranians.

Linguists such as Burrow argue that the strong similarity between the Avestan of the Gāthās—the oldest part of the Avesta—and the Vedic Sanskrit of the Rigveda pushes the dating of Zarathustra or at least the Gathas closer to the conventional Rigveda dating of 1500–1200 BCE, i.e. 1100 BCE, possibly earlier. Boyce concurs with a lower date of 1100 BCE and tentatively proposes an upper date of 1500 BCE. Gnoli dates the Gathas to around 1000 BCE, as does Mallory (1989), with the caveat of a 400-year leeway on either side, i.e. between 1400 and 600 BCE. Therefore, the date of the Avesta could also indicate the date of the Rigveda.

There is mention in the Avesta of Airyan Vaejah, one of the '16 the lands of the Aryans'. Gnoli's interpretation of geographic references in the Avesta situates the Airyanem Vaejah in the Hindu Kush. For similar reasons, Boyce excludes places north of the Syr Darya and western Iranian places. With some reservations, Skjaervo concurs that the evidence of the Avestan texts makes it impossible to avoid the conclusion that they were composed somewhere in northeastern Iran. Witzel points to the central Afghan highlands. Humbach derives Vaējah from cognates of the Vedic root "vij", suggesting the region of fast-flowing rivers. Gnoli considers Choresmia (Xvairizem), the lower Oxus region, south of the Aral Sea to be an outlying area in the Avestan world. However, according to Mallory & Mair (2000), the probable homeland of Avestan is, in fact, the area south of the Aral Sea.

Geographical location of Rigvedic rivers

Cluster of Indus Valley Civilization site along the course of the Indus River and in Pakistan and the Ghaggar-Hakra in India and Pakistan. See Sameer et al. (2018) for a more detailed map.

The geography of the Rigveda seems to be centered on the land of the seven rivers. While the geography of the Rigvedic rivers is unclear in some of the early books of the Rigveda, the Nadistuti sukta is an important source for the geography of late Rigvedic society.

The Sarasvati River is one of the chief Rigvedic rivers. The Nadistuti sukta in the Rigveda mentions the Sarasvati between the Yamuna in the east and the Sutlej in the west, and later texts like the Brahmanas and Mahabharata mention that the Sarasvati dried up in a desert.

Scholars agree that at least some of the references to the Sarasvati in the Rigveda refer to the Ghaggar-Hakra River, while the Afghan river Haraxvaiti/Harauvati Helmand is sometimes quoted as the locus of the early Rigvedic river. Whether such a transfer of the name has taken place from the Helmand to the Ghaggar-Hakra is a matter of dispute. Identification of the early Rigvedic Sarasvati with the Ghaggar-Hakra before its assumed drying up early in the second millennium would place the Rigveda BCE, well outside the range commonly assumed by Indo-Aryan migration theory.

A non-Indo-Aryan substratum in the river-names and place-names of the Rigvedic homeland would support an external origin of the Indo-Aryans. However, most place-names in the Rigveda and the vast majority of the river-names in the north-west of the Indian subcontinent are Indo-Aryan. Non-Indo-Aryan names are, however, frequent in the Ghaggar and Kabul River areas, the first being a post-Harappan stronghold of Indus populations.

Textual references to migrations

Rigveda

Probable geographic expansion of late Vedic culture.

Just as the Avesta does not mention an external homeland of the Zoroastrians, the Rigveda does not explicitly refer to an external homeland or to a migration. Later Hindu texts, such as the Brahmanas, Mahabharata, Ramayana, and Puranas, are centered in the Ganges region (rather than Haryana and Punjab) and mention regions still further to the south and east, suggesting a later movement or expansion of the Vedic religion and culture to the east. There is no clear indication of general movement in either direction in the Rigveda itself; searching for indirect references in the text, or by correlating geographic references with the proposed order of composition of its hymns, has not led to any consensus on the issue.

Srauta Sutra of Baudhayana

According to Romila Thapar, the Srauta Sutra of Baudhayana "refers to the Parasus and the arattas who stayed behind and others who moved eastwards to the middle Ganges valley and the places equivalent such as the Kasi, the Videhas and the Kuru Pancalas, and so on. In fact, when one looks for them, there are evidence for migration."

Later Vedic and Hindu texts

Texts like the Puranas and Mahabharata belong to a much later period than the Rigveda, making their evidence less than sufficient to be used for or against the Indo-Aryan migration theory.

Later Vedic texts show a shift of location from the Punjab to the East. According to the Yajurveda, Yajnavalkya (a Vedic ritualist and philosopher) lived in the eastern region of Mithila. Aitareya Brahmana 33.6.1. records that Vishvamitra's sons migrated to the north, and in Shatapatha Brahmana 1:2:4:10 the Asuras were driven to the north. In much later texts, Manu was said to be a king from Dravida. In the legend of the flood he stranded with his ship in Northwestern India or the Himalayas. The Vedic lands (e.g. Aryavarta, Brahmavarta) are located in Northern India or at the Sarasvati and Drishadvati river. However, in a post-Vedic text the Mahabharata Udyoga Parva (108), the East is described as the homeland of the Vedic culture, where "the divine Creator of the universe first sang the Vedas". The legends of Ikshvaku, Sumati and other Hindu legends may have their origin in Southeast Asia.

The Puranas record that Yayati left Prayag (confluence of the Ganges & Yamuna) and conquered the region of Sapta Sindhu. His five sons Yadu, Druhyus, Puru, Anu and Turvashu correspond to the main tribes of the Rigveda.

The Puranas also record that the Druhyus were driven out of the land of the seven rivers by Mandhatr and that their next king Gandhara settled in a north-western region which became known as Gandhara. The sons of the later Druhyu king Prachetas are supposed by some to have 'migrated' to the region north of Afghanistan though the Puranic texts only speak of an "adjacent" settlement.

Ecology

Climate change and drought may have triggered both the initial dispersal of Indo-European speakers, and the migration of Indo-Europeans from the steppes in south-central Asia and India.

Around 4200–4100 BCE a climate change occurred, manifesting in colder winters in Europe. Between 4200 and 3900 BCE many tell settlements in the lower Danube Valley were burned and abandoned, while the Cucuteni-Tripolye culture showed an increase in fortifications, meanwhile moving eastwards towards the Dniepr. Steppe herders, archaic Proto-Indo-European speakers, spread into the lower Danube valley about 4200–4000 BCE, either causing or taking advantage of the collapse of Old Europe.

The Yamna horizon was an adaptation to a climate change which occurred between 3500 and 3000 BCE, in which the steppes became drier and cooler. Herds needed to be moved frequently to feed them sufficiently, and the use of wagons and horse-back riding made this possible, leading to "a new, more mobile form of pastoralism". It was accompanied by new social rules and institutions, to regulate the local migrations in the steppes, creating a new social awareness of a distinct culture, and of "cultural Others" who did not participate in these new institutions.

In the second century BCE widespread aridization led to water shortages and ecological changes in both the Eurasian steppes and south Asia. At the steppes, humidization led to a change of vegetation, triggering "higher mobility and transition to the nomadic cattle breeding". Water shortage also had a strong impact in south Asia:

This time was one of great upheaval for ecological reasons. Prolonged failure of rains caused acute water shortage in a large area, causing the collapse of sedentary urban cultures in south-central Asia, Afghanistan, Iran, and India, and triggering large-scale migrations. Inevitably, the new arrivals came to merge with and dominate the post-urban cultures.

The Indus Valley Civilisation was localised, that is, urban centers disappeared and were replaced by local cultures, due to a climatic change that is also signalled for the neighbouring areas of the Middle East. As of 2016 many scholars believe that drought and a decline in trade with Egypt and Mesopotamia caused the collapse of the Indus Civilisation. The Ghaggar-Hakra system was rain-fed, and water-supply depended on the monsoons. The Indus valley climate grew significantly cooler and drier from about 1800 BCE, linked to a general weakening of the monsoon at that time. The Indian monsoon declined and aridity increased, with the Ghaggar-Hakra retracting its reach towards the foothills of the Himalaya, leading to erratic and less extensive floods that made inundation agriculture less sustainable. Aridification reduced the water supply enough to cause the civilisation's demise, and to scatter its population eastward.

Indigenous Aryanism

The approximate extent of Āryāvarta during the late Vedic period (ca. 1100–500 BCE). Aryavarta was limited to northwest India and the western Ganges plain, while Greater Magadha in the east was habitated by non-Vedic Indo-Aryans, who gave rise to Jainism and Buddhism.

Indian nationalistic opponents of the Indo-Aryan migration question it, and instead promote Indigenous Aryanism, claiming that speakers of Indo-Iranian languages (sometimes called Aryan languages) are "indigenous" to the Indian subcontinent. Indigenous Aryanism has no support in contemporary mainstream scholarship, as it is contradicted by a broad range of research on Indo-European migrations.

Search This Blog

Saturday, April 30, 2022

Radar signal characteristics

The radar signal in the time domain

Carrier

Pulse width

Pulse repetition frequency (PRF)

Staggered PRF

Clutter

Sensitivity time control (STC)

Unambiguous range

The radar signal in the frequency domain

Pulse profiling

Unambiguous velocity

Typical system parameters

Indo-Aryan migrations

Fundamentals

Linguistics: relationships between languages

Archaeology: migrations from the steppe Urheimat

Anthropology: elite recruitment and language shift

Genetics: ancient ancestry and multiple gene flows

Literary research: similarities, geography, and references to migration

Ecological studies: widespread drought, urban collapse, and pastoral migrations

Development of the theory

Similarities between Sanskrit, Persian, Greek

Homeland

Aryan "race"

"Aryan invasion"

Aryan migration

Future directions

Linguistics: relationships between languages

Comparative method

Proto-Indo-European

Arguments against an Indian origin of proto-Indo-European

Diversity

Dialectal variation

Substrate influence

Archaeology: migrations from the steppe Urheimat

Stages of migrations

Diffusion from the "Urheimat"

Sequence of migrations

Central Asia: formation of Indo-Iranians

Sintashta-Petrovka culture

Andronovo culture

Bactria-Margiana culture

Two waves of Indo-Iranian migration

First wave – Indo-Aryan migrations

Mittani

North-India – Vedic culture

Multiple waves of migration into northern India

Gandhara grave culture and Ochre Coloured Pottery culture

Spread of Vedic-Brahmanic culture

Indus Valley Civilization

Decline of Indus Valley Civilisation

Continuity of Indus Valley civilization

Relation with Indo-Aryan migrations

Inner Asia – Wusun and Yuezhi

Second wave – Iranians

Anthropology: elite recruitment and language shift

Elite dominance

Renfrew: models of "linguistic replacement"

David Anthony: elite recruitment

Michael Witzel: small groups and acculturation

Salmons: systematic changes in community structure

Genetics: ancient ancestry and multiple gene flows

Ancestral groups

Common maternal ancestry

"Ancestral North Indians" and "Ancestral South Indians"

Additional components

Male-mediated migration

North-south cline

Scenarios

Pre-agricultural migrations

Agricultural migrations

Near-Eastern migrations

Haplogroup R1a and related haplogroups

Indo-European migrations

Genetic impact of Indo-Aryan migrations

Origins of R1a-Z93

Literary research: similarities, geography, and references to migration