Search This Blog

Wednesday, May 22, 2019

Proto-Indo-European language

From Wikipedia, the free encyclopedia

Proto-Indo-European (PIE) is the linguistic reconstruction of the ancient common ancestor of the Indo-European languages, the most widely spoken language family in the world.

Far more work has gone into reconstructing PIE than any other proto-language, and it is by far the best understood of all proto-languages of its age. The vast majority of linguistic work during the 19th century was devoted to the reconstruction of PIE or its daughter proto-languages (such as Proto-Germanic and Proto-Indo-Iranian), and most of the modern techniques of linguistic reconstruction (such as the comparative method) were developed as a result. These methods supply all current knowledge concerning PIE, since there is no written record of the language.

PIE is estimated to have been spoken as a single language from 4500 BC to 2500 BC during the Late Neolithic to Early Bronze Age, though estimates vary by more than a thousand years. According to the prevailing Kurgan hypothesis, the original homeland of the Proto-Indo-Europeans may have been in the Pontic–Caspian steppe of Eastern Europe. The linguistic reconstruction of PIE has also provided insight into the culture and religion of its speakers.

As speakers of Proto-Indo-European became isolated from each other through the Indo-European migrations, the regional dialects of Proto-Indo-European spoken by the various groups diverged from each other, as each dialect underwent different shifts in pronunciation (the Indo-European sound laws), morphology, and vocabulary. Thus these dialects slowly but eventually transformed into the known ancient Indo-European languages. From there, further linguistic divergence led to the evolution of their current descendants, the modern Indo-European languages. Today, the descendant languages, or daughter languages, of PIE with the most native speakers are Spanish, English, Portuguese, Hindustani (Hindi and Urdu), Bengali, Russian, Punjabi, German, Persian, French, Italian and Marathi. Hundreds of other living descendants of PIE include languages as diverse as Albanian (gjuha shqipe), Kurdish (کوردی‎), Nepali (खस भाषा), Tsakonian (τσακώνικα), Ukrainian (українська мова), and Welsh (Cymraeg).

PIE is believed to have had an elaborate system of morphology that included inflectional suffixes (analogous to English life, lives, life's, lives'‍) as well as ablaut (vowel alterations, for example, as preserved in English sing, sang, sung) and accent. PIE nominals and pronouns had a complex system of declension, and verbs similarly had a complex system of conjugation. The PIE phonology, particles, numerals, and copula are also well-reconstructed.

An asterisk is used to mark reconstructed words, such as *wódr̥ 'water', *ḱwṓ 'dog', or *tréyes 'three (masculine)'; these forms are the reconstructed ancestors of the moden English words water, hound , and three.

Development of the hypothesis

No direct evidence of PIE exists – scholars have reconstructed PIE from its present-day descendants using the comparative method.

The comparative method follows the Neogrammarian rule: the Indo-European sound laws apply without exception. The method compares languages and uses the sound laws to find a common ancestor. For example, compare the pairs of words in Italian and English: piede and foot, padre and father, pesce and fish. Since there is a consistent correspondence of the initial consonants that emerges far too frequently to be coincidental, one can assume that these languages stem from a common parent language.

Many consider William Jones, an Anglo-Welsh philologist and puisne judge in Bengal, to have begun Indo-European studies in 1786, when he postulated the common ancestry of Sanskrit, Latin, and Greek. However, he was not the first to make this observation. In the 1500s, European visitors to the Indian subcontinent became aware of similarities between Indo-Iranian languages and European languages, and as early as 1653 Marcus Zuerius van Boxhorn had published a proposal for a proto-language ("Scythian") for the following language families: Germanic, Romance, Greek, Baltic, Slavic, Celtic, and Iranian. In a memoir sent to the Académie des Inscriptions et Belles-Lettres in 1767 Gaston-Laurent Coeurdoux, a French Jesuit who spent all his life in India, had specifically demonstrated the analogy between Sanskrit and European languages. In the perspective of current academic consensus, Jones' work was less accurate than his predecessors', as he erroneously included Egyptian, Japanese and Chinese in the Indo-European languages, while omitting Hindi

In 1818 Rasmus Christian Rask elaborated the set of correspondences to include other Indo-European languages, such as Sanskrit and Greek, and the full range of consonants involved. In 1816 Franz Bopp published On the System of Conjugation in Sanskrit in which he investigated a common origin of Sanskrit, Persian, Greek, Latin, and German. In 1833 he began publishing the Comparative Grammar of Sanskrit, Zend, Greek, Latin, Lithuanian, Old Slavic, Gothic, and German.

In 1822 Jacob Grimm formulated what became known as Grimm's law as a general rule in his Deutsche Grammatik. Grimm showed correlations between the Germanic and other Indo-European languages and demonstrated that sound change systematically transforms all words of a language. From the 1870s the Neogrammarians proposed that sound laws have no exceptions, as shown in Verner's law, published in 1876, which resolved apparent exceptions to Grimm's law by exploring the role that accent (stress) had played in language change.

August Schleicher's A Compendium of the Comparative Grammar of the Indo-European, Sanskrit, Greek and Latin Languages (1874–77) represented an early attempt to reconstruct the proto-Indo-European language.

By the early 1900s Indo-Europeanists had developed well-defined descriptions of PIE which scholars still accept today. Later, the discovery of the Anatolian and Tocharian languages added to the corpus of descendant languages. A new principle won wide acceptance in the laryngeal theory, which explained irregularities in the linguistic reconstruction of Proto-Indo-European phonology as the effects of hypothetical sounds which had disappeared from all documented languages, but which were later observed in excavated cuneiform tablets in Anatolian.

Julius Pokorny's Indogermanisches etymologisches Wörterbuch ("Indo-European Etymological Dictionary", 1959) gave a detailed, though conservative, overview of the lexical knowledge then accumulated. Kuryłowicz's 1956 Apophonie gave a better understanding of Indo-European ablaut. From the 1960s, knowledge of Anatolian became robust enough to establish its relationship to PIE. 

Classification of Indo-European languages. Red: Extinct languages. White: categories or unattested proto-languages. Left half: centum languages; right half: satem languages

Historical and geographical setting

Scholars have proposed multiple hypotheses about when, where, and by whom PIE was spoken. The Kurgan hypothesis, first put forward in 1956 by Marija Gimbutas, has become the most popular of these. It proposes that the Yamna culture associated with the kurgans (burial mounds) on the Pontic–Caspian steppe north of the Black Sea were the original speakers of PIE.

According to the theory, PIE became widespread because its speakers from the Kurgan culture could migrate into a vast area of Europe and Asia thanks to technologies such as the domestication of the horse, herding, and the use of wheeled vehicles.

The people of these cultures were nomadic pastoralists, who, according to the model, by the early 3rd millennium BC had expanded throughout the Pontic–Caspian steppe and into Eastern Europe.

Other theories include the Anatolian hypothesis, the Armenia hypothesis, the Paleolithic Continuity Theory, and the indigenous Aryans theory.

Due to early language contact, there are some lexical similarities between the Proto-Kartvelian and Proto-Indo-European languages.

An overview map summarises theories presented above.

Subfamilies (clades)

The following are listed by their theoretical glottochronological development:

Subfamily clades
Description Modern descendants
Proto-Anatolian All now extinct, the best attested being the Hittite language. None
Proto-Tocharian An extinct branch known from manuscripts dating from the 6th to the 8th century AD, which were found in north-west China. None
Proto-Italic This included many languages, but only descendants of Latin survive. Portuguese and Galician, Occitan, Spanish, Catalan, French, Italian, Romanian, Aromanian, Rhaeto-Romance, Sardinian
Proto-Celtic The ancestor of modern Celtic languages. Once spoken across Europe, but now mostly confined to its northwestern edge. Irish, Scottish Gaelic, Welsh, Breton, Cornish, Manx
Proto-Germanic The reconstructed proto-language of the Germanic languages. It developed into three branches: West Germanic, East Germanic (now extinct), and North Germanic. English, German, Afrikaans, Dutch, Norwegian, Danish, Swedish, Frisian, Icelandic, Faroese
Proto-Balto-Slavic Branched into the Baltic languages and the Slavic languages. Baltic Latvian and Lithuanian; Slavic Russian, Ukrainian, Belarusian, Polish, Czech, Slovak, Serbo-Croatian, Bulgarian, Slovenian, Macedonian
Proto-Indo-Iranian Branched into the Indo-Aryan, Iranian and Nuristani languages. Indic Hindustani, Bengali, Sinhala, Punjabi, Dardic; Iranic Persian, Pashto, Balochi, Kurdish, Zaza, Ossetian, Luri, Talyshi, Tati, Gilaki, Mazandarani, Semnani, Old Azeri (extinct); Nuristani
Proto-Armenian
Eastern Armenian, Western Armenian
Proto-Greek
Modern Greek, Romeyka, Tsakonian
Proto-Albanian Albanian is the only modern representative of a distinct branch of the Indo-European language family. Albanian

Common subgroups of Indo-European languages which are proposed include Italo-Celtic, Graeco-Aryan, Graeco-Armenian, Graeco-Phrygian, Daco-Thracian, and Thraco-Illyrian.

Marginally attested languages

The Lusitanian language is a marginally attested language found in the area of modern Portugal.

The Paleo-Balkan languages, which occur in or near the Balkan peninsula, do not appear to be members of any of the subfamilies of PIE but are so poorly attested that proper classification of them is not possible. Albanian and Greek are the only surviving Indo-European languages in the group.

Phonology

Proto-Indo-European phonology has been reconstructed in some detail. Notable features of the most widely accepted (but not uncontroversial) reconstruction include:
  • three series of stop consonants reconstructed as voiceless, voiced, and breathy voiced;
  • sonorant consonants that could be used syllabically;
  • three so-called laryngeal consonants, whose exact pronunciation is not well-established but which are believed to have existed in part based on their visible effects on adjacent sounds;
  • the fricative /s/; and
  • a five-vowel system of which /e/ and /o/ were the most frequently occurring vowels.
The Proto-Indo-European accent is reconstructed today as having had variable lexical stress, which could appear on any syllable and whose position often varied among different members of a paradigm (e.g. between singular and plural of a verbal paradigm). Stressed syllables received a higher pitch; therefore it is often said that PIE had a pitch accent. The location of the stress is associated with ablaut variations, especially between normal-grade vowels (/e/ and /o/) and zero-grade (i.e. lack of a vowel), but not entirely predictable from it.

The accent is best preserved in Vedic Sanskrit and (in the case of nouns) Ancient Greek, and indirectly attested in a number of phenomena in other IE languages. To account for mismatches between the accent of Vedic Sanskrit and Ancient Greek, as well as a few other phenomena, a few historical linguists prefer to reconstruct PIE as a tone language where each morpheme had an inherent tone; the sequence of tones in a word then evolved, according to that hypothesis, into the placement of lexical stress in different ways in different IE branches.

Morphology

Root

Proto-Indo-European roots were affix-lacking morphemes which carried the core lexical meaning of a word and were used to derive related words (e.g., "-friend-" in the English words "befriend", "friends", and "friend" by itself). Proto-Indo-European was a fusional language, in which inflectional morphemes signalled the grammatical relationships between words. This dependence on inflectional morphemes means that roots in PIE, unlike those found in English, were rarely found by themselves. A root plus a suffix formed a word stem, and a word stem plus a desinence (usually an ending) formed a word.

Ablaut

Many morphemes in Proto-Indo-European had short e as their inherent vowel; the Indo-European ablaut is the change of this short e to short o, long e (ē), long o (ō), or no vowel. This variation in vowels occurred both within inflectional morphology (e.g., different grammatical forms of a noun or verb may have different vowels) and derivational morphology (e.g., a verb and an associated abstract verbal noun may have different vowels).

Categories that PIE distinguished through ablaut were often also identifiable by contrasting endings, but the loss of these endings in some later Indo-European languages has led them to use ablaut alone to identify grammatical categories, as in the Modern English words sing, sang, sung.

Noun

Proto-Indo-European nouns are declined for eight or nine cases:
  • nominative: marks the subject of a verb, such as They in They ate. Words that follow a linking verb and rename the subject of that verb also use the nominative case. Thus, both They and linguists are in the nominative case in They are linguists. The nominative is the dictionary form of the noun.
  • accusative: used for the direct object of a transitive verb.
  • genitive: marks a noun as modifying another noun.
  • dative: used to indicate the indirect object of a transitive verb, such as Jacob in Maria gave Jacob a drink.
  • instrumental: marks the instrument or means by, or with which, the subject achieves or accomplishes an action. It may be either a physical object or an abstract concept.
  • ablative: used to express motion away from something.
  • locative: corresponds vaguely to the English prepositions in, on, at, and by.
  • vocative: used for a word that identifies an addressee. A vocative expression is one of direct address where the identity of the party spoken to is set forth expressly within a sentence. For example, in the sentence, "I don't know, John", John is a vocative expression that indicates the party being addressed.
  • allative: used as a type of locative case that expresses movement towards something. Only the Anatolian languages maintain this case, and it may not have existed in Proto-Indo-European at all.
There were three grammatical genders:
  • masculine
  • feminine
  • neuter

Pronoun

Proto-Indo-European pronouns are difficult to reconstruct, owing to their variety in later languages. PIE had personal pronouns in the first and second grammatical person, but not the third person, where demonstrative pronouns were used instead. The personal pronouns had their own unique forms and endings, and some had two distinct stems; this is most obvious in the first person singular where the two stems are still preserved in English I and me. There were also two varieties for the accusative, genitive and dative cases, a stressed and an enclitic form.

Personal pronouns

First person Second person
Singular Plural Singular Plural
Nominative *h₁eǵ(oH/Hom) *wei *tuH *yuH
Accusative *h₁mé, *h₁me *nsmé, *nōs *twé *usmé, *wōs
Genitive *h₁méne, *h₁moi *ns(er)o-, *nos *tewe, *toi *yus(er)o-, *wos
Dative *h₁méǵʰio, *h₁moi *nsmei, *ns *tébʰio, *toi *usmei
Instrumental *h₁moí *nsmoí *toí *usmoí
Ablative *h₁med *nsmed *tued *usmed
Locative *h₁moí *nsmi *toí *usmi

Verb

Proto-Indo-European verbs, like the nouns, exhibited a system of ablaut. The most basic categorisation for the Indo-European verb was grammatical aspect. Verbs were classed as:
  • stative: verbs that depict a state of being
  • imperfective: verbs depicting ongoing, habitual or repeated action
  • perfective: verbs depicting a completed action or actions viewed as an entire process.
Verbs have at least four grammatical moods:
  • indicative: indicates that something is a statement of fact; in other words, to express what the speaker considers to be a known state of affairs, as in declarative sentences.
  • imperative: forms commands or requests, including the giving of prohibition or permission, or any other kind of advice or exhortation.
  • subjunctive: used to express various states of unreality such as wish, emotion, possibility, judgment, opinion, obligation, or action that has not yet occurred
  • optative: indicates a wish or hope. It is similar to the cohortative mood and is closely related to the subjunctive mood.
Verbs had two grammatical voices:
Verbs had three grammatical persons: (first, second and third).

Verbs had three grammatical numbers:
  • singular
  • dual: referring to precisely two of the entities (objects or persons) identified by the noun or pronoun.
  • plural: a number other than singular or dual.
Verbs were also marked by a highly developed system of participles, one for each combination of tense and voice, and an assorted array of verbal nouns and adjectival formations. 

The following table shows a possible reconstruction of the PIE verb endings from Sihler, which largely represents the current consensus among Indo-Europeanists. 


Sihler (1995)
Athematic Thematic
Singular 1st *-mi *-oh₂
2nd *-si *-esi
3rd *-ti *-eti
Dual 1st *-wos *-owos
2nd *-th₁es *-eth₁es
3rd *-tes *-etes
Plural 1st *-mos *-omos
2nd *-te *-ete
3rd *-nti *-onti

Numbers

Proto-Indo-European numerals are generally reconstructed as follows.


Sihler
one *(H)óynos/*(H)óywos/*(H)óyk(ʷ)os; *sḗm
two *d(u)wóh₁
three *tréyes (full grade), *tri- (zero grade)
four *kʷetwóres (o-grade), *kʷ(e)twr̥- (zero grade)
five *pénkʷe
six *s(w)éḱs; originally perhaps *wéḱs
seven *septḿ̥
eight *oḱtṓ(w) or *h₃eḱtṓ(w)
nine *h₁néwn̥
ten *déḱm̥(t)

Rather than specifically 100, *ḱm̥tóm may originally have meant "a large number".

Particle

Proto-Indo-European particles could be used both as adverbs and postpositions, like *upo "under, below". The postpositions became prepositions in most daughter languages. Other reconstructible particles include negators (*ne, *mē), conjunctions (*kʷe "and", *wē "or" and others) and an interjection (*wai!, an expression of woe or agony).

Derivational morphology

Proto-Indo-European employed various means of deriving words from other words, or directly from verb roots.

Internal derivation

Internal derivation was a process that derived new words through changes in accent and ablaut alone. It was not as productive as external (affixing) derivation, but is firmly established by the evidence of various later languages.
Possessive adjectives
Possessive or associated adjectives could be created from nouns through internal derivation. Such words could be used directly as adjectives, or they could be turned back into a noun without any change in morphology, indicating someone or something characterised by the adjective. They could also be used as the second element of a compound. If the first element was a noun, this created an adjective that resembled a present participle in meaning, e.g. "having much rice" or "cutting trees". When turned back into nouns, such compounds were Bahuvrihis or semantically resembled agent nouns

In thematic stems, creating a possessive adjective involved shifting the accent one syllable to the right, for example:
  • *tómh₁-o-s "slice" (Greek tómos) *tomh₁-ó-s "cutting" (i.e. "making slices"; Greek tomós) > *dr-u-tomh₁-ó-s "cutting trees" (Greek drutómos "woodcutter" with irregular accent).
  • *wólh₁-o-s "wish" (Sanskrit vára-) *wolh₁-ó-s "having wishes" (Sanskrit vará- "suitor").
In athematic stems, there was a change in the accent/ablaut class. The known four classes followed an ordering, in which a derivation would shift the class one to the right:
acrostatic → proterokinetic → hysterokinetic → amphikinetic
The reason for this particular ordering of the classes in derivation is not known. Some examples:
  • Acrostatic *krót-u-s ~ *krét-u-s "strength" (Sanskrit krátu-) > proterokinetic *krét-u-s ~ *kr̥t-éw-s "having strength, strong" (Greek kratús).
  • Hysterokinetic *ph₂-tḗr ~ *ph₂-tr-és "father" (Greek patḗr) > amphikinetic *h₁su-péh₂-tōr ~ *h₁su-ph₂-tr-és "having a good father" (Greek eupátōr).
Vrddhi
A vrddhi derivation, named after the Sanskrit grammatical term, signified "of, belonging to, descended from". It was characterised by "upgrading" the root grade, from zero to full (e) or from full to lengthened (ē). When upgrading from zero to full grade, the vowel could sometimes be inserted in the "wrong" place, creating a different stem from the original full grade.

Examples:
  • full grade *swéḱuro-s "father-in-law" (Vedic Sanskrit śváśura-) lengthened grade *swēḱuró-s "relating to one's father-in-law" (Vedic śvāśura-, Old High German swāgur "brother-in-law").
  • (*dyḗw-s ~) zero grade *diw-és "sky" > full grade *deyw-o-s "god, sky god" (Vedic devás, Latin deus, etc.). Note the difference in vowel placement, *dyew- in the full-grade stem of the original noun but *deyw- in the vrddhi derivative.
Nominalization
Adjectives with accent on the thematic vowel could be turned into nouns by moving the accent back onto the root. A zero grade root could remain so, or be "upgraded" to full grade like in a vrddhi derivative. Some examples:
  • PIE *ǵn̥h₁-tó-s "born" (Vedic jātá-) *ǵénh₁-to- "thing that is born" (German Kind).
  • Greek leukós "white" leũkos "a kind of fish", literally "white one".
  • Vedic kṛṣṇá- "dark" kṛ́ṣṇa- "dark one", also "antelope".
This kind of derivation is likely related to the possessive adjectives, and can be seen as essentially the reverse of it.

Syntax

The syntax of the older Indo-European languages has been studied in earnest since at least the late nineteenth century, by such scholars as Hermann Hirt and Berthold Delbrück. In the second half of the twentieth century, interest in the topic increased and led to reconstructions of Proto-Indo-European syntax.

Since all the early attested IE languages were inflectional, PIE is thought to have relied primarily on morphological markers, rather than word order, to signal syntactic relationships within sentences. Still, a default (unmarked) word order is thought to have existed in PIE. This was reconstructed by Jacob Wackernagel as being subject–verb–object (SVO), based on evidence in Vedic Sanskrit, and the SVO hypothesis still has some adherents, but as of 2015 the "broad consensus" among PIE scholars is that PIE would have been a subject–object–verb (SOV) language.

The SOV default word order with other orders used to express emphasis (e.g., verb–subject–object to emphasise the verb) is attested in Old Indic, Old Iranian, Old Latin and Hittite, while traces of it can be found in the enclitic personal pronouns of the Tocharian languages. A shift from OV to VO order is posited to have occurred in late PIE since many of the descendant languages have this order: modern Greek, Romance and Albanian prefer SVO, Insular Celtic has VSO as the default order, and even the Anatolian languages show some signs of this word order shift. The inconsistent order preference in Baltic, Slavic and Germanic can be attributed to contact with outside OV languages.

In popular culture

The Ridley Scott film Prometheus features an android named "David" (played by Michael Fassbender) who learns Proto-Indo-European to communicate with the "Engineer", an extraterrestrial whose race may have created humans. David practices PIE by reciting Schleicher's fable and goes on to attempt communication with the Engineer through PIE. Linguist Dr Anil Biltoo created the film's reconstructed dialogue and had an onscreen role teaching David Schleicher's fable.

Proto-Indo-Europeans

From Wikipedia, the free encyclopedia

The Proto-Indo-Europeans were hypothetical prehistoric people of Eurasia who spoke Proto-Indo-European (PIE), the ancestor of the Indo-European languages according to linguistic reconstruction.
 
Knowledge of them comes chiefly from that linguistic reconstruction, along with material evidence from archaeology and archaeogenetics. The Proto-Indo-Europeans likely lived during the late Neolithic, or roughly the 4th millennium BC. Mainstream scholarship places them in the Pontic–Caspian steppe zone in Eastern Europe (present day Ukraine and Russia). Some archaeologists would extend the time depth of PIE to the middle Neolithic (5500 to 4500 BC) or even the early Neolithic (7500 to 5500 BC), and suggest alternative location hypotheses.

By the early second millennium BC, offshoots of the Proto-Indo-Europeans had reached far and wide across Eurasia, including Anatolia (Hittites), the Aegean (the ancestors of Mycenaean Greece), the north of Europe (Corded Ware culture), the edges of Central Asia (Yamnaya culture), and southern Siberia (Afanasievo culture).

Culture

Using linguistic reconstruction, hypothetical features of the Proto-Indo-European language are deduced. Assuming that these linguistic features reflect culture and environment of the Proto-Indo-Europeans, the following cultural and environmental traits are widely proposed:
The Proto-Indo-Europeans had domesticated horses*eḱwos (cf. Latin equus). The cow (*gwous) played a central role, in religion and mythology as well as in daily life. A man's wealth would have been measured by the number of his animals (small livestock), *peḱu (cf. English fee, Latin pecunia).
As for technology, reconstruction indicates a culture of the late Neolithic bordering on the early Bronze Age, with tools and weapons very likely composed of "natural bronze" (i.e., made from copper ore naturally rich in silicon or arsenic). Silver and gold were known, but not silver smelting (as PIE has no word for lead, a by-product of silver smelting), thus suggesting that silver was imported. Sheep were kept for wool, and textiles were woven

Burials in barrows or tomb chambers apply to the Kurgan culture, in accordance with the original version of the Kurgan hypothesis, but not to the previous Sredny Stog culture, which is also generally associated with PIE. Important leaders would have been buried with their belongings in kurgans

Many Indo-European societies know a threefold division of priests, a warrior class, and a class of peasants or husbandmen. Georges Dumézil has suggested such a division for Proto-Indo-European society. 

If there was a separate class of warriors, traces of initiation rites in several Indo-European societies suggest that this group would have identified with wolves.

History of research

Researchers have made many attempts to identify particular prehistoric cultures with the Proto-Indo-European-speaking peoples, but all such theories remain speculative. Any attempt to identify an actual people with an unattested language depends on a sound reconstruction of that language that allows identification of cultural concepts and environmental factors associated with particular cultures (such as the use of metals, agriculture vs. pastoralism, geographically distinctive plants and animals, etc.).
 
The scholars of the 19th century who first tackled the question of the Indo-Europeans' original homeland (also called Urheimat, from German), had essentially only linguistic evidence. They attempted a rough localization by reconstructing the names of plants and animals (importantly the beech and the salmon) as well as the culture and technology (a Bronze Age culture centered on animal husbandry and having domesticated the horse). The scholarly opinions became basically divided between a European hypothesis, positing migration from Europe to Asia, and an Asian hypothesis, holding that the migration took place in the opposite direction.

In the early 20th century, the question became associated with the expansion of a supposed "Aryan race," a fallacy promoted during the expansion of European empires and the rise of "scientific racism." The question remains contentious within some flavours of ethnic nationalism (see also Indigenous Aryans). 

A series of major advances occurred in the 1970s due to the convergence of several factors. First, the radiocarbon dating method (invented in 1949) had become sufficiently inexpensive to be applied on a mass scale. Through dendrochronology (tree-ring dating), pre-historians could calibrate radiocarbon dates to a much higher degree of accuracy. And finally, before the 1970s, parts of Eastern Europe and Central Asia had been off limits to Western scholars, while non-Western archaeologists did not have access to publication in Western peer-reviewed journals. The pioneering work of Marija Gimbutas, assisted by Colin Renfrew, at least partly addressed this problem by organizing expeditions and arranging for more academic collaboration between Western and non-Western scholars.

The Kurgan hypothesis, as of 2017 the most widely held theory, depends on linguistic and archaeological evidence, but is not universally accepted. It suggests PIE origin in the Pontic-Caspian steppe during the Chalcolithic. A minority of scholars prefer the Anatolian hypothesis, suggesting an origin in Anatolia during the Neolithic. Other theories (Armenian hypothesis, Out of India theory, Paleolithic Continuity Theory, Balkan hypothesis) have only marginal scholarly support.

In regard to terminology, in the 19th and early 20th centuries, the term Aryan was used to refer to the Proto-Indo-Europeans and their descendants. However, Aryan more properly applies to the Indo-Iranians, the Indo-European branch that settled parts of the Middle East and South Asia, as only Indic and Iranian languages explicitly affirm the term as a self-designation referring to the entirety of their people, whereas the same Proto-Indo-European root (*aryo-) is the basis for Greek and Germanic word forms which seem only to denote the ruling elite of Proto-Indo-European (PIE) society. In fact, the most accessible evidence available confirms only the existence of a common, but vague, socio-cultural designation of "nobility" associated with PIE society, such that Greek socio-cultural lexicon and Germanic proper names derived from this root remain insufficient to determine whether the concept was limited to the designation of an exclusive, socio-political elite, or whether it could possibly have been applied in the most inclusive sense to an inherent and ancestral "noble" quality which allegedly characterized all ethnic members of PIE society. Only the latter could have served as a true and universal self-designation for the Proto-Indo-European people. 

By the early twentieth century this term had come to be widely used in a racist context referring to a hypothesized white, blonde and blue eyed master race, culminating with the pogroms of the Nazis in Europe. Subsequently, the term Aryan as a general term for Indo-Europeans has been largely abandoned by scholars (though the term Indo-Aryan is still used to refer to the branch that settled in Southern Asia).

Urheimat hypotheses

Scheme of Indo-European migrations from ca. 4000 to 1000 BC according to the Kurgan hypothesis. The magenta area corresponds to the assumed Urheimat (Samara culture, Sredny Stog culture). The red area corresponds to the area which may have been settled by Indo-European-speaking peoples up to ca. 2500 BC; the orange area to 1000 BC.
 
According to some archaeologists, PIE speakers cannot be assumed to have been a single, identifiable people or tribe, but were a group of loosely related populations ancestral to the later, still partially prehistoric, Bronze Age Indo-Europeans. This view is held especially by those archaeologists who posit an original homeland of vast extent and immense time depth. However, this view is not shared by linguists, as proto-languages, like all languages before modern transport and communication, occupied small geographical areas over a limited time span, and were spoken by a set of close-knit communities—a tribe in the broad sense.

Researchers have put forward a great variety of proposed locations for the first speakers of Proto-Indo-European. Few of these hypotheses have survived scrutiny by academic specialists in Indo-European studies sufficiently well to be included in modern academic debate.

Steppe theory

In 1956 Marija Gimbutas (1921–1994) first proposed the Kurgan hypothesis. The name originates from the kurgans (burial mounds) of the Eurasian steppes. The hypothesis suggests that the Indo-Europeans, a nomadic culture of the Pontic-Caspian steppe (now part of Eastern Ukraine and Southern Russia), expanded in several waves during the 3rd millennium BC. Their expansion coincided with the taming of the horse. Leaving archaeological signs of their presence, they subjugated the peaceful European neolithic farmers of Gimbutas' Old Europe. As Gimbutas' beliefs evolved, she put increasing emphasis on the patriarchal, patrilinear nature of the invading culture, sharply contrasting it with the supposedly egalitarian, if not matrilinear culture of the invaded, to a point of formulating essentially feminist archaeology. A modified form of this theory by JP Mallory (1945- ), dating the migrations earlier (to around 3500 BC) and putting less insistence on their violent or quasi-military nature, remains the most widely accepted view of the Proto-Indo-European expansion.

Near-Eastern origins

Armenian hypothesis

The Armenian hypothesis, based on the glottalic theory, suggests that the Proto-Indo-European language was spoken during the 4th millennium BC in the Armenian Highland. This Indo-Hittite model does not include the Anatolian languages in its scenario. The phonological peculiarities of PIE proposed in the glottalic theory would be best preserved in the Armenian language and the Germanic languages, the former assuming the role of the dialect which remained in situ, implied to be particularly archaic in spite of its late attestation. Proto-Greek would be practically equivalent to Mycenean Greek and would date to the 17th century BC, closely associating Greek migration to Greece with the Indo-Aryan migration to India at about the same time (viz., Indo-European expansion at the transition to the Late Bronze Age, including the possibility of Indo-European Kassites). The Armenian hypothesis argues for the latest possible date of Proto-Indo-European (sans Anatolian), a full millennium later than the mainstream Kurgan hypothesis. In this, it figures as an opposite to the Anatolian hypothesis, in spite of the geographical proximity of the respective Urheimaten suggested, diverging from the time-frame suggested there by a full three millennia.

Zagros mountains

Bernard Sergent associates the Indo-European language family with certain archaeological cultures in Southern Russia, and he reconstructs an Indo-European religion (relying on the method of Georges Dumézil). He writes that the lithic assemblage of the first Kurgan culture in Ukraine (Sredni Stog II), originated from the Volga and South Urals, recalls that of the Mesolithic-Neolithic sites to the east of the Caspian sea, Dam Dam Chesme II and the cave of Djebel. Thus, he places the roots of the Gimbutas' Kurgan cradle of Indo-Europeans in a more southern cradle, and adds that the Djebel material is related to a Paleolithic material of Northwestern Iran, the Zarzian culture, dated 10,000-8,500 BC, and in the more ancient Kebarian of the Near East. He concludes that more than 10,000 years ago the Indo-Europeans were a small people grammatically, phonetically and lexically close to Semitic-Hamitic populations of the Near East.

Anatolian hypothesis

The Anatolian hypothesis proposes that the Indo-European languages spread peacefully into Europe from Asia Minor from around 7000 BC with the advance of farming (wave of advance). The leading propagator of the theory is Colin Renfrew. The culture of the Indo-Europeans as inferred by linguistic reconstruction raises difficulties for this theory, since early neolithic cultures had neither the horse, nor the wheel, nor metal, terms for all of which are securely reconstructed for Proto-Indo-European. Renfrew dismisses this argument, comparing such reconstructions to a theory that the presence of the word "café" in all modern Romance languages implies that the ancient Romans had cafés too. The linguistic counter-argument to this might state that whereas there can be no clear Proto-Romance reconstruction of the word "café" according to historical linguistic methodology, words such as "wheel" in the Indo-European languages clearly point to an archaic form of the protolanguage. Another argument against Renfrew is the fact that ancient Anatolia is known to have been inhabited by non-Indo-European Caucasian-speaking peoples, namely the Hattians, the Chalybes, and the Hurrians.

Genetics

The rise of archaeogenetic evidence which uses genetic analysis to trace migration patterns also added new elements to the origins puzzle.

Kurgan hypothesis

R1b and R1a

According to three autosomal DNA studies, haplogroups R1b and R1a, now the most common in Europe (R1a is also very common in South Asia) would have expanded from the Russian steppes, along with the Indo European languages; they also detected an autosomal component present in modern Europeans which was not present in Neolithic Europeans, which would have been introduced with paternal lineages R1b and R1a, as well as Indo European Languages. Studies which analysed ancient human remains in Ireland and Portugal suggest that R1b was introduced in these places along with autosomal DNA from the Eastern European steppes.

R1a1a

The subclade R1a1a (R-M17 or R-M198) is most commonly associated with Indo-European speakers, although the subclade R1b1a (P-297) has also been linked to the Centum branch of Indo-European. Data so far collected indicate that there are two widely separated areas of high frequency, one in Eastern Europe, around Poland and the Russian core, and the other in South Asia, around Indo-Gangetic Plain. The historical and prehistoric possible reasons for this are the subject of on-going discussion and attention amongst population geneticists and genetic genealogists, and are considered to be of potential interest to linguists and archaeologists also. 

A large, 2014 study by Underhill et al., using 16,244 individuals from over 126 populations from across Eurasia, concluded there was compelling evidence, that R1a-M420 originated in the vicinity of Iran. The mutations that characterize haplogroup R1a occurred ~10,000 years BP. Its defining mutation (M17) occurred about 10,000 to 14,000 years ago.

Ornella Semino et al. propose a postglacial (Holocene) spread of the R1a1 haplogroup from north of the Black Sea during the time of the Late Glacial Maximum, which was subsequently magnified by the expansion of the Kurgan culture into Europe and eastward.

Yamnaya culture

According to Jones et al. (2015) and Haak et al. (2015), Yamnaya culture was exclusively R1b, autosomic tests indicate that the Yamnaya-people were the result of admixture between two different hunter-gatherer populations: distinctive "Eastern European hunter-gatherers" with high affinity to the Mal'ta-Buret' culture or other, closely related Ancient North Eurasian (ANE) people from Siberia and to Western Hunter Gatherers(WHG) and a population of "Caucasus hunter-gatherers" who probably arrived from somewhere in the Near East, probably the Caucasus or Iran. Each of those two populations contributed about half the Yamnaya DNA. According to co-author Dr. Andrea Manica of the University of Cambridge:
The question of where the Yamnaya come from has been something of a mystery up to now [...] we can now answer that, as we've found that their genetic make-up is a mix of Eastern European hunter-gatherers and a population from this pocket of Caucasus hunter-gatherers who weathered much of the last Ice Age in apparent isolation.
Eastern European hunter-gatherers
According to Haak et al. (2015), "Eastern European hunter-gatherers" who inhabited Russia were a distinctive population of hunter-gatherers with high affinity to a ~24,000-year-old Siberian from Mal'ta-Buret' culture, or other, closely related Ancient North Eurasian (ANE) people from Siberia and to the Western Hunter Gatherers (WHG). Remains of the "Eastern European hunter-gatherers" have been found in Mesolithic or early Neolithic sites in Karelia and Samara Oblast, Russia, and put under analysis. Three such hunter-gathering individuals of the male sex have had their DNA results published. Each was found to belong to a different Y-DNA haplogroup: R1a, R1b, and J. R1b is also the most common Y-DNA haplogroup found among both the Yamnaya and modern-day Western Europeans.
Near East population
The Near East population were most likely hunter-gatherers from the Caucasus (CHG) c.q. Iran Chalcolithic related people with a CHG-component.

Jones et al. (2015) analyzed genomes from males from western Georgia, in the Caucasus, from the Late Upper Palaeolithic (13,300 years old) and the Mesolithic (9,700 years old). These two males carried Y-DNA haplogroup: J* and J2a. The researchers found that these Caucasus hunters were probably the source of the farmer-like DNA in the Yamnaya, as the Caucasians were distantly related to the Middle Eastern people who introduced farming in Europe. Their genomes showed that a continued mixture of the Caucasians with Middle Eastern took place up to 25,000 years ago, when the coldest period in the last Ice Age started.

According to Lazaridis et al. (2016), "a population related to the people of the Iran Chalcolithic contributed ~43% of the ancestry of early Bronze Age populations of the steppe." According to Lazaridis et al. (2016), these Iranian Chalcolithic people were a mixture of "the Neolithic people of western Iran, the Levant, and Caucasus Hunter Gatherers." Lazaridis et al. (2016) also note that farming spread at two places in the Near East, namely the Levant and Iran, from where it spread, Iranian people spreading to the steppe and south Asia.

Corded Ware

Haak et al. (2015) studied DNA from 94 skeletons from Europe and Russia aged between 3,000 and 8,000 years old. They concluded that about 4,500 years ago there was a major influx into Europe of Yamnaya culture people originating from the Pontic-Caspian steppe north of the Black Sea and that the DNA of copper-age Europeans matched that of the Yamnaya. The genetic basis of a number of features of the Yamnaya people were ascertained: they were genetically tall (phenotypic height is determined by both genetics and environmental factors), overwhelmingly dark-eyed (brown), dark-haired and had a skin colour that was moderately light, though somewhat darker than that of the average modern European:
The four Corded Ware people could trace an astonishing three-quarters of their ancestry to the Yamnaya, according to the paper. That suggests a massive migration of Yamnaya people from their steppe homeland into eastern Europe about 4500 years ago when the Corded Ware culture began, perhaps carrying an early form of Indo-European language.

Andronovo

From the Corded Ware culture the Indo-Europeans spread eastward again, forming the Andronovo culture. Most researchers associate the Andronovo horizon with early Indo-Iranian languages, though it may have overlapped the early Uralic-speaking area at its northern fringe. According to Allentoft et al. (2015), the Sintashta culture and Andronovo culture are derived from the Corded Ware culture. According to Keyser et al. (2009), out of 10 human male remains assigned to the Andronovo horizon from the Krasnoyarsk region, nine possessed the R1a Y-chromosome haplogroup and one had the C-M130 haplogroup (xC3). Furthermore, 90% of the Bronze Age period mtDNA haplogroups were of west Eurasian origin, and the study determined that at least 60% of the individuals overall (out of the 26 Bronze and Iron Age human-remains samples from the study that could be tested) had dark hair and brown or green eyes.

A 2004 study also established that during the Bronze Age/Iron Age period, the majority of the population of Kazakhstan (part of the Andronovo culture during Bronze Age), was of west Eurasian origin (with mtDNA haplogroups such as U, H, HV, T, I and W), and that prior to the 13th–7th centuries BC, all samples from Kazakhstan belonged to European lineages.

Anatolian hypothesis

Luigi Luca Cavalli-Sforza and Alberto Piazza argue that Renfrew and Gimbutas reinforce rather than contradict each other. Cavalli-Sforza (2000) states that "It is clear that, genetically speaking, peoples of the Kurgan steppe descended at least in part from people of the Middle Eastern Neolithic who immigrated there from Turkey." Piazza and Cavalli-Sforza (2006) state that
if the expansions began at 9,500 years ago from Anatolia and at 6,000 years ago from the Yamnaya culture region, then a 3,500-year period elapsed during their migration to the Volga-Don region from Anatolia, probably through the Balkans. There a completely new, mostly pastoral culture developed under the stimulus of an environment unfavourable to standard agriculture, but offering new attractive possibilities. Our hypothesis is, therefore, that Indo-European languages derived from a secondary expansion from the Yamnaya culture region after the Neolithic farmers, possibly coming from Anatolia and settled there, developing pastoral nomadism.
Spencer Wells suggests in a 2001 study that the origin, distribution and age of the R1a1 haplotype points to an ancient migration, possibly corresponding to the spread by the Kurgan people in their expansion across the Eurasian steppe around 3000 BC.

About his old teacher Cavalli-Sforza's proposal, Wells (2002) states that "there is nothing to contradict this model, although the genetic patterns do not provide clear support either", and instead argues that the evidence is much stronger for Gimbutas' model:
While we see substantial genetic and archaeological evidence for an Indo-European migration originating in the southern Russian steppes, there is little evidence for a similarly massive Indo-European migration from the Middle East to Europe. One possibility is that, as a much earlier migration (8,000 years old, as opposed to 4,000), the genetic signals carried by Indo-European-speaking farmers may simply have dispersed over the years. There is clearly some genetic evidence for migration from the Middle East, as Cavalli-Sforza and his colleagues showed, but the signal is not strong enough for us to trace the distribution of Neolithic languages throughout the entirety of Indo-European-speaking Europe.

Armenian hypothesis/Caucasus

David Reich (2018) argues that the most likely location of the Proto-Indo-European homeland is south of the Caucasus, because "ancient DNA from people who lived there matches what we would expect for a source population both for the Yamnaya and for ancient Anatolians". 

Computer-aided software engineering

From Wikipedia, the free encyclopedia ...