Map 2 Geographic limits of historically known Oceanic speakers and of presently documented Lapita sites (after Kirch_1997: 17, 54)
This is the third in a series of volumes bringing together the results of recent work on the lexicon of the Proto Oceanic (POc) language.1 POc is the immediate ancestor of the Oceanic subgroup of the Austronesian language family (see Map 1). This subgroup consists of all the Austronesian languages of Melanesia east of 136°E, together with those of Polynesia and (with two exceptions) those of Micronesia-more than 450 languages in all.2 Extensive arguments for the existence of Oceanic as a clearly demarcated branch of Austronesian were first put forward by Otto Dempwolff in the 1920s, and the validity of the subgroup is now recognised by virtually all scholars working in Austronesian historical linguistics.
The development and break-up of the POc language and speech community were stages in a truly remarkable chapter in human prehistory-the colonisation by Austronesian speakers of the Indo-Pacific region in the period after about 2000 BC. The outcome was the largest of the world’s well-established language families and (until the expansion of Indo-European after Columbus) the most widespread. The Austronesian family comprises more than 1,000 distinct languages. Its eastern and western outliers, Madagascar and Easter Island, are two-thirds of a world apart, and its northernmost extensions, Hawaii and Taiwan, are separated by 70 degrees of latitude from its southernmost outpost, Stewart Island in New Zealand.
It is likely that the divergence of Oceanic from its nearest relatives, which are the Austronesian languages spoken around Cenderawasih Bay and in South Halmahera (Blust 1978a), began when Austronesian speakers from the Cenderawasih Bay area moved eastwards along the north coast of New Guinea and into the Bismarck Archipelago. There is a strong school of opinion that associates the subsequent break-up of POc with the rapid colonisation of Island Melanesia and the central Pacific by bearers of the Lapita culture between about 1200 and 900 BC (see Map 2 and volume 2, chapter 2).
The present project aims to bring together a large corpus of lexical reconstructions for POc, with supporting cognate sets, organised according to semantic fields and using a stan- dard orthography for POc. We hope that this thesaurus will be a useful resource for culture historians, archaeologists and others interested in the prehistory of the Pacific region. The comparative lexical material should also be a rich source of data for various kinds of purely linguistic research, e.g. on semantic change and subgrouping in the more than 400 daughter languages.
Volume 1 of The lexicon of Proto Oceanic dealt with material culture. Volumes 2, 3 and 4 examine relevant sets of cognate terms in order to gain insights into how POc speakers viewed and exploited their environment. Volume 2 deallt with the geophysical or inanimate environ- ment, the present volume treats plants and volume 4 animals. Volume 5, as it is planned at the time of writing, will deal with terminologies centring on people and society, including the body and human conditions and activities, and social organisation, belief systems, rituals, recreation and other elements of non-material culture. Volume 6 will treat closed categories including adjectives, pronouns, and number. It will also include a review of the main findings of the project, especially comparing the evidence from archaeology and historical linguistics concerning the culture and dispersal of Austronesian speakers into and across the Pacific Is- lands. Volume 6 will also provide an index to lthe POc and other reconstructions presented in the whole work, as well as an English-to-POc finderlist and a list of all languages cited, together with their subgroups.3
The organisation of the present volume is as follows. Chapter 2 is an introduction to the present volume, discussing plant classifications and plant names. It also provides a rationale for the organisation of this volume. Chapter 3 reconstructs major categories of POc ethnob- otanical classification. Chapter 4 deals with POc terms for the parts of trees and other plants. Chapters 5-8 present reconstructed names for wild trees and plants of the bush. They are or- ganised according to vegetation habitat. Chapter 5 treats plants of the coastal strand, chapter 6 plants of the mangrove swamp, chapter 7 primary lowland rain forest plants and chapter 8 plants that are found mainly in secondary lowland rain forest and grassland. Chapters 9-12 present reconstructed names for cultivated food plants.4 Chapter 9 deals with staple foods, chapter 10 with green vegetables, chapter 11 with nut- and fruit-bearing trees and plants and chapter 12 with the coconut palm and its products. Finally, chapter 13 treats trees and plants that are cultivated for purposes other than food production.
Reconstructions of POc phonology and lexicon began with Dempwolff’s pioneering work in the 1920s and 1930s. Dempwolff’s dictionary of reconstructions attributed to Proto Austronesian (PAn) (1938) - but equivalent in modern terms to Proto Malaya-Polynesian (PMP) - contains some 600 reconstructions with reflexes in Oceanic languages.
Since the 1950s, POc and other early Oceanic interstage languages have been the subject of a considerable body of research. However, relatively few new reconstructions safely attributable to POc were added to Dempwolff’s material until the 1970s. In 1969 George Grace made available as a working paper a compilation of reconstructions from various sources amounting to some 700 distinct items, attributed either to POc or to early Oceanic interstages. These materials were presented in a new orthography for POc, based largely on Biggs’ (1965) orthography for an interstage he called Proto Eastern Oceanic. Updated compilations of Oceanic cognate sets were produced at the University of Hawaii in the period 1977-1983 as part of a project directed by Grace and Pawley. These compilations and the supporting data are problematic in various respects and we have made only limited use of them.
Comparative lexical studies have been carried out for several lower-order subgroups of Oceanic: for Proto Polynesian by Biggs (resulting in Walsh & Biggs 1966, Biggs, Walsh & Waqa 1970 and subsequent versions of the POLLEX file, including Biggs & Clark (1993) and Clark & Biggs (2006);5 for Proto Micronesian by scholars at the University of Hawaii (Bender et al. 1983, Bender et al. 2003); for Proto North-Central Vanuatu by Clark (1996, 2009); for Proto Southern Vanuatu by Lynch (1978, 1996b, 2001); for New Caledonia by Ozanne-Rivierre (1992), Haudricourt & Ozanne-Rivierre (1982) and Geraghty (1989); for Proto SE Solomonic by Levy (1980) and Frantisek Lichtenberk (1988); for Proto Central Pacific by Hockett (1976), Geraghty (1983, 1986, 1996 together with a number of unpublished papers); for Proto Eastern Oceanic by Biggs (1965), Cashmore (1969), Levy & Smith (1970), and Geraghty (1990); and for Proto Central Papuan by Pawley (1975), Lynch (1978, 1980), and Ross (1994a).
Robert Blust of the University of Hawaii has, in a series of papers (1970, 1980, 1983-84, 1986, 1989) published extensive, alphabetically ordered, lexical reconstructions (with supporting cognate sets) for interstages earlier than POc, especially for Proto Austronesian, Proto Malaya-Polynesian and Proto Eastern Malaya-Polynesian. He has also written several papers investigating specific semantic fields (1980, 1982, 1987, 1994). At the time of writing, Blust has a major work in progress, the Austronesian Comparative Dictionary (ACD), which will bring together all his reconstructions for Proto Austronesian and lower-order stages. This is stored in electronic form at the University of Hawaii. The version to which we refer dates from 1998.
Several papers predating our project systematically investigated particular semantic domains in the lexicon of POc, e.g. Milke (1958), French-Wright (1983), Pawley (1982, 1985), Pawley & Green (1985), Frantisek Lichtenberk (1986), R. Walter (1989), and the various papers in Pawley & Ross (1994). Ross (1988) contains a substantial number of new POc lexical reconstructions, as well as proposed modifications to the reconstructed POc sound system and the orthography.
These earlier works have provided valuable points of reference, both inside and outside the Oceanic group, and we are indebted particularly to Biggs and Clark (1993), Clark (1996), Lynch (2001c) and Blust (ACD). However, previous Oceanic lexical studies were limited both by large gaps in the data, with a distinct bias in favour of ‘Eastern Oceanic’ languages, and by the technical problems of collating large quantities of data. Although most languages in Melanesia remain poorly described, there are now many more dictionaries and extended word lists, particularly for Papua New Guinea, than there were in the 1980s. And developments in computing hardware and software now permit much faster and more precise handling of data than was possible then. A list of sources and a summary of the Project’ s collation procedures is found in Appendix 1.
As the present project proceeded, we came to realise that the form in which preliminary publications were presented-namely as essays, each discussing cognate sets for a particular semantic field at some length-would also be the best form for the presentation of our final synthesis. A discursive treatment of individual terminologies, as opposed, say, to a dictionary-type listing of reconstructions with supporting cognate sets, makes it easier to relate the linguistic comparisons to relevant issues of culture history, language change, and methodology. Hence each of the present volumes has as its core a collection of analytic essays. Some of these have been published or presented elsewhere, but are printed here in revised form. In some cases we have updated the earlier versions in the light of subsequent research, and, where appropriate, have inserted cross-references between contributions. Authorship is in some cases something of a problem, as a number of people have had a hand in collating the data, doing the reconstructions, and (re)writing for publication here. In most chapters, however, one person did the research which determined the structure of the terminology, and that person appears as the first or only author, and where another or others had a substantial part in putting together the paper itself, they appear as the second and further authors.
The lexical reconstructions presented in these volumes are arrived at using the standard methods of comparative linguistics, which require as preliminaries a theory of sub grouping (§3.2) and the working out of systematic sound correspondences among cognate vocabulary in contemporary languages (§3 .3). As well as cognate: sets clearly attributable to POc, we have included some cognate sets which at this stage are attributable to various interstage languages, particularly Proto Western and Proto Eastern Oceanic (but see §3.2 for definitions). We have set out to pay more careful attention to reconstmcting the semantics of POc forms than has generally been done in earlier work, treating words not as isolates but as parts of terminologies.
Our method of doing ‘terminological reconstruction’ is as follows. First, the terminologies of present-day speakers of Oceanic languages are used as the basis for constructing a hypothesis about the semantic structure of a corresponding POc terminology, taking account of (i) ethnographic evidence, i.e. descriptions of the lifestyles of Oceanic communities and (ii) the geographical and physical resources of particular regions of Oceania. For example, by comparing terms in several languages for parts of an outrigger canoe, or for growth stages of a coconut, one can see which concepts recur and so are likely to have been present in POc. Secondly, a search is made for cognate sets from which forms can be reconstructed to match each meaning in this hypothesised terminology. The search is not restricted to mem- bers of the Oceanic subgroup; if a term found in an Oceanic language proves to have external (non-Oceanic) cognates, the POc antiquity of that term will be confirmed and additional evidence concerning its meaning will be provided. Thirdly, the hypothesised terminology is re-examined to see if it needs modification in the light of the reconstructions. There are cases, highlighted in the various contributions to these volumes, where we were able to reconstruct a term where we did not expect to do so and conversely, often more significantly, where we were unable to reconstruct a term where we had believed we should be able to. In each case, we have discussed the reasons why our expectations were not met and what this may mean for Oceanic culture history.
Blust (1987: 81) distinguishes between conventional ‘semantic reconstruction’, which asks, “What was the probable meaning of protomorpheme X?”, and Dyen and Aberle’s (1974) ‘lexical reconstruction’, where one asks, “What was the protomorpheme which probably meant ‘X’?” At first sight, it might appear that terminological reconstruction is aversion of lexical reconstruction. However, there are sharp differences. Lexical reconstruction applies a formal procedure: likely protomeanings are selected from among the glosses of words in available cognate sets, then an algorithm is applied to determine which meaning should be attributed to each set. This procedure may have unsatisfactory results, as Blust points out. Several reconstructions may end up with the same meaning; or no meaning may be reconstructed for a form because none of the glosses of its reflexes is its protomeaning.
Terminological reconstruction is instead similar to the semantic reconstruction approach. In terminological reconstruction the meanings ofprotomorphemes are not determined in ad- vance. Instead, cognate sets are collected and their meanings are compared with regard to:
For example, it proved possible to reconstruct the following POc terms for tying with cords (vol.1, ch.9, §10):
In each of the supporting cognate sets from contemporary languages there are a number of items whose glosses in the dictionaries or word lists are too vague to tell the analyst anything about the specific denotation of the item, and in the case of *kiti this prevents the assignment of a more specific meaning. The verb *buku can be identified as the generic term for tying a knot because of its derivational relationship (by zero derivation) with a noun whose denotation is clearly generic, *buku ‘node (as in bamboo or sugarcane); joint; knuckle; knot in wood, string or rope’ (vol.1, ch.4, §3.2). Reconstruction of the meaning of *pʷita as ‘tie by encircling’ is supported by the meanings of the Lukep, Takia and Longgu reflexes, respectively ‘tie by encircling’, ‘tie on (as grass-skirt)’, and ‘trap an animal’s leg; tie s.t. around ankle or wrist’: Lukep and Takia are North New Guinea languages, whilst Longgu is SE Solomonic. Reconstruction of the meaning of *paqu(s), *paqus-i- as ‘bind, lash; construct (canoe +) by tying together’ is supported by the meanings of the Takia, Kiribati and Samoan reflexes, respectively ‘tie, bind; construct (a canoe)’, ‘construct (canoe, house)’, and ‘make, construct (wooden objects, canoes + )’: Takia is a North New Guinea language, Kiribati is Microne- sian, and Samoan is Polynesian. The meaning of *pisi is similarly reconstructed by reference to the meanings of its Mono-Alu, Mota, Port Sandwich, Nguna and Fijian reflexes.
Often, however, the contributors have often been less fortunate in the information available to them. For example, Osmond (vol.1, ch.8, §9) reconstructs six POc terms broadly glossed as ‘spear’. Multiple terms for implements within one language imply that these items were used extensively and possibly in specialised ways. Can we throw light on these specialised ways? Unfortunately, some of the word lists and dictionaries available give minimal glosses, e.g. ‘spear’, for reflexes of the six reconstructions. What we need to know for each reflex is: what is the level of reference? Is it a term for all spears, or perhaps all pointed projectiles including arrows and darts? Or does it refer to a particular kind of spear? Is it noun or verb or both? If a noun, does it refer to both the instrument and the activity? Most word lists are frustratingly short on detail. For this kind of detail, ethnographies have proved a more fruitful source of information than many word lists.
Another problem is inherent in the dangers of sampling from over 450 languages. The greater the number of languages, the greater are the possible variations in meaning of any given term, and the greater the chances of two languages making the same semantic leaps quite independently. Does our (sometimes quite limited) cognate set provide us with a clear unambiguous gloss, or have we picked up an accidental bias, a secondary or distantly related meaning? Did etymon x refer to fishhook or the material from which the fishhook was made? Did etymon y refer to the slingshot or to the action of turning round and round?
Although the subgrouping of Austronesian languages and questions about which protolanguage was spoken where remain in some cases somewhat controversial, it is impossible to proceed without making some assumptions about these matters. Figures 1.1 and 1.2 are approximate renderings of our subgrouping assumptions. The upper part of the tree, shown in Figure 1.1, is due to Blust, originally presented in Blust (1977b) and repeated with additional supporting evidence in subsequent publications (1978, 1982, 1983-84, 1993).6 The diagram of the lower (Oceanic) part of the tree in Figure 1.2 shows nine primary subgroups of Oceanic. Its rake-like structure indicates that no convincing body of shared innovations has been found to allow any of the nine subgroups to be combined into higher-order groupings. Sections 3.2.2, 3.2.3, and 3.2.4 offer some commentary on our subgrouping, and in §3.2.3 we explain how we handle the rake-like structure in making reconstructions.
In Figures 1.1 and 1.2 each node is either a single language,7 usually a reconstructed proto language, or, in italics, a group of languag1es.
Where a node is a proto language, its descendants form a proper subgroup (in the technical sense in which historical linguists use the term ‘subgroup’ ). A proper subgroup is identified by innovations shared by its member languages, i.e. it is ‘innovation-defined’ in the terminology of Pawley & Ross (1995). These innovations are assumed to have occurred just once in the subgroup’ s protolanguage, i.e. the exclusively shared ancestor of its members. Thus languages of the large Oceanic subgroup of Austronesian share a set of innovations relative to the earlier Austronesian stages shown in Figure 1.1 (Dempwolff 1934).8 By inference these innovations occurred in their common ancestor, POc, and the claim that they are innovations is based on a comparison of reconstructed POc with reconstructed PMP. The innovations may be phonological (e.g. PMP *e, pronounced [g], and PMP *aw both became POc *o), morphological (e.g. POc acquired a morphological distinction between three kinds of possessive relationship: food, drink and default), or lexical (e.g. PMP *limaw ‘citrus fruit’ was replaced by POc *molis).
Proto Oceanic
Italics are used in Figures 1.1 and 1.2 to indicate a group of languages which is not a proper subgroup, i.e. has no identifiable exclusively shared parent. Thus Formosan languages in Figure 1.1 indicates a collection of languages descended (along with PMP) from PAn. They are spoken in Taiwan, but do not form a subgroup. There was no ‘Proto Formosan’, as Formosan languages and language groups are all descended directly from PAn.
Some of the italicised labels in Figures 1.1 and 1.2 include the term linkage. A linkage (an ‘innovation-linked group’ in the terminology of Pawley & Ross 1995) is a collection of usually quite closely related languages or dialects,9 speakers of which were in sufficient contact at one time or another during their history for innovations to pass from one language to the next, often resulting in a pattern such that the domains of various innovations overlap but are not coterminous.10 A number of Oceanic linkages were recognised during the 1980s (Geraghty 1983, Pawley & Green 1984, Ross 1988).11 A linkage may arise in at least three ways, but distinguishing between them is often impossible.
First, what would otherwise be a proper subgroup may happen to lack exclusively shared innovations, perhaps because the parent did not exist as a unit for long enough to undergo any innovations of its own.12
Second, a linkage may consist of some but not all of the languages descended from a single parent. The Western Oceanic linkage reflects the innovations of POc, but no innovation is common to the whole of Western Oceanic (although the merger of POc *r and *R comes close). However, the languages of its three component linkages - North New Guinea, Papuan Tip and Meso-Melanesian - display complex patterns of overlapping innovations. The Western Oceanic linkage appears to be descended from the dialects of POc that were left behind in the Bismarck Archipelago after speakers of the languages ancestral to the other eight primary subgroups in Figure 1.2 had moved away to the north or east. After these departures various innovations occurred. Each arose somewhere in the Western Oceanic dialect network and spread to neighbouring dialects without reaching every dialect in the network.
The third type of linkage is the result of contact among languages descended from more than one immediate parent. An example is the Fijian linkage, which represents a reintegration of parts of earlier Western and Eastern Fijian linkages (Geraghty & Pawley 1981, Geraghty 1983, Pawley 1996b).13 Geraghty reconstructed the history of the Fijian linkage by painstaking analysis of innovations from at least two stages in its history. From the earlier period Western Fijian languages share innovations with Rotuman and Eastern Fijian with Polynesian. From a more recent period Western Fijian and Eastern Fijian languages share innovations with each other, reflecting their reintegration into a single linkage, within which the present Western/Eastern boundary has shifted relative to the (fuzzy) boundary of the earlier period.
For most of the linkages noted in Figures 1.1 and 1.2 this kind of analysis is not available. For example, Blust (1993) indicates that CEMP was a linkage. But its history is far from clear. Was there perhaps a PCEMP that was so shortlived that it underwent no innovations of its own? Or does CEMP perhaps include some languages that share history with languages to their west and others that share history with those to their north? The North/Central Vanuatu linkage, long assumed to be some sort of genealogical unit, appears to reflect the partial reintegration of at least two dialect networks, North Vanuatu and Central Vanuatu, that probably never diverged greatly from each other, but the details of this history are difficult to elucidate (Lynch 2000c).14
The languages of a linkage have no identifiable exclusively shared parent. Yet we have found many instances in which a cognate set is limited to one of the linkages in Figures 1.1 and 1.2: CEMP, Western Oceanic, New Guinea Oceanic, Southern Oceanic or the reintegrated North and Central Vanuatu linkage. As with PEOc and PROc (§3.2.3), we think it is preferable to attribute these reconstructions to a hypothetical protolanguage rather than to a higher node in the tree. Hence there are reconstructions labelled PCEMP, PWOc and so on. Again these apparent lexical innovations offer only the weakest evidence for the protolanguage to which they are attributed. In addition to the explanations of the kinds offered for PEOc and PROc etyma in §3.2.3 it is possible, for example, that an innovatory ‘PWOc’ etymon arose when the Western Oceanic dialect network was still close-knit, and spread from dialect to dialect before the network broke into the two networks ancestral to its present-day first-order subgroups.
The strength of a lexical reconstruction rests crucially on the distribution of the supporting cognate set across subgroups. The distribution of cognate forms and agreements in their meanings is much more important than the number of cognates. It is enough to make a secure reconstruction if a cognate set occurs in just two languages in a family, with agreement in meaning, provided that the two languages belong to different primary subgroups and provided that there is no reason to suspect that the resemblances are due to borrowing or chance. The PMP term *apij ‘twins’ is reflected in several western Malaya-Polynesian languages (e.g. Batak apid ‘twins, double (fused) banana’) but only a single Oceanic reflex is known, namely Roviana avisi ‘twins of the same sex’. Because Roviana belongs to a different first-order branch of Malaya-Polynesian from the western Malaya-Polynesian witnesses and because there is virtually no chance that the agreement is due to borrowing or chance similarity, this distribution is enough to justify the reconstruction of PMP *apij, POc *apic ‘twins’.
The rake-like form of Figure 1.2 almost certainly reflects the very rapid settlement of Oceania out of the Bismarcks,15 but it confronts us with a methodological question. If we follow the rubric that we make a reconstruction if a cognate set occurs in languages of just two primary subgroups, then reflexes of an etymon in, say, a SE Solomonic language and a Micronesian language would be sufficient evidence for a POc reconstruction and the absence of reflexes in Admiralty and Western Oceanic would be irrelevant. Given what we know about the location of the POc homeland (in the Bismarcks; vol.2, ch.2) and the early eastward spread of Oceanic speakers, this is too loose a criterion. Instead, we assume two hypothetical nodes not shown in the tree in Figure 1.2.16 These are
If a cognate set occurs in two or all three of the groups in Remote Oceanic, the reconstruction is attributed to PROc (PROc ). If a cognate set occurs in one or more of the groups in Remote Oceanic and in SE Solomonic, it is attributed to Proto Eastern Oceanic (PEOc). In this way we acknowledge that such reconstructions may represent an innovation that postdates the spread of the early Oceanic speech community. There are enough PROc and PEOc reconstructions to suggest that such lexical innovations indeed occurred. This in turn provides weak evidence for Remote Oceanic and Eastern Oceanic subgroups, but evidence that is too weak to be relied on, for at least two reasons. First, it is quite possible that some of our PROc and PEOc reconstructions will be promoted to POc as more Admiralty and Western Oceanic data become available. Second, it is reasonable to assume that some of our PROc and PEOc etyma are of POc antiquity but happen to have been lost in Proto Admiralty and Proto Western Oceanic. Without supporting phonological or morphological evidence we are unwilling to treat PROc or PEOc as anything other than convenient hypotheses which allow us to retain rigorous criteria for a POc reconstruction.
In volumes 1 and 2 a reconstruction here labelled ‘PROc’ would have been labelled ‘PEOc’, but the absence of SE Solomonic reflexes from among its reflexes indicates that it has the same status as a PROc reconstruction in the present volume. Two factors have led to the distinction between PEOc and PROc here. One is that, because the primary biogeographic divide in Oceania is between Near and Remote Oceania (see vol. 2, Map 5), i.e. between the Solomon Islands and Vanuatu, the question of whether or not a plant name includes a SE Solomonic reflex is significant, and there are many plant names that do not (and are thus attributed to PROc ). The other is that the historical separateness of SE Solomonic from both Western Oceanic and the groups treated as Remote Oceanic has become increasingly clear through recent research (Pawley 2007).
Our criterion for recognising a reconstruction as POc is that the cognate set must occur in at least two out of four criteria! groupings: Admiralties (or Yapese or Mussau), Western Oceanic, Temotu and our hypothetical Eastern Oceanic. Both here and at the hypothetical interstages defined above, no reconstruction is made if there are grounds to infer borrowing from one of these groupings to another.18 We also reconstruct an etymon to POc if it is reflected in just one of the four criteria! groupings and in a non-Oceanic Austronesian language (a member of one of the subgroups on the left branches in Figure 1.1), as illustrated above by the reconstruction of POc *apic ‘twins’.
These criteria are identical to those appli1ed in volumes 1 and 2 except for the addition of Temotu (which figures in very few cognate sets). The establishment of Temotu as a primary subgroup (Ross & Nᴂss 2007) postdates the publication of volumes 1 and 2. Temotu comprises the languages of the Reef Islands, Santa Cruz, Utupua and Vanikoro, located 400 km east of the main Solomons archipelago and to the north of Vanuatu (Map 3).
There are indications that Yapese (a single-language subgroup) and Mussau and Tench (a subgroup with two closely related languages) may be more closely related to Admiralty than to any other Oceanic subgroup,19 and for this reason they are treated as Admiralty languages for the purposes of reconstruction. That is, the presence of a reflex in one or more of these languages and in Admiralty does not support a POc reconstruction, but the presence of of a reflex in one or more of these languages and one of Western Oceanic, Temotu and Eastern Oceanic does support one.
In chapter 2 (§4) of volume 2 Pawley discusses Blust’s (1998a) proposal that the primary split in Oceanic divides Admiralty from a subgroup embracing all other Oceanic languages. Pawley dubs the latter ‘Nuclear Oceanic’. If Blust’s subgrouping were accepted, then an etymon which lacked cognates outside Oceanic would need to be reflected both in an Admiralties language and in a non-Admiralties language for a POc reconstruction to be made. Etyma with reflexes in both Western and Eastern Oceanic, but not in the Admiralties, would be reconstructed as Proto Nuclear Oceanic. Under the criteria outlined above, however, we attribute these reconstructions to POc. These criteria were used in volumes 1 and 2, and we have thought it wise to maintain them throughout the volumes of this work. The reader who wishes to single out reconstructions attributable to a putative Proto Nuclear Oceanic (rather than to POc) can easily recognise them, however. They are those POc reconstructions for which (i) there are no Admiralties reflexes, and (ii) there is no higher-order reconstruc- tion (i.e. PEMP, PCEMP, PMP or PAn), since the latter would be based on cognates outside Oceanic.
This section brings together brief notes on the subgroups in Figure 1.2 beyond those mentioned in the discussion in §§3.2.3-3.2.2.
Admiralty is a proper subgroup (Ross 1988: ch.9).
Western Oceanic consists of the North New Guinea (NNG), Papuan Tip (PT), Meso-Melanesian (MM) linkages and the Sarmi/Jayapura (SJ) group (see Map 4). The last-named may belong to the NNG linkage, but this is uncertain (Ross 1996b). It is not shown in Figure 1.2 and its languages do not play a crucial role in reconstruction. It is likely that the
NNG and PT groups form a super-group, the New Guinea Oceanic linkage, and so etyma reflected only in NNG and PT languages are attributed to a putative Proto New Guinea Oceanic (Milke 1958, Pawley 1978), and etyma reflected in either NNG or PT (or both) and in MM are labelled PWOc.
SE Solomonic was established as a proper subgroup by Pawley (1972: 98-110). Further support was provided by Levy (1979, 1980, n.d.), Tryon & Hackman (1983) and Frantisek Lichtenberk (1988).
The Southern Oceanic linkage as proposed by Lynch (1999, 2000c, 2001, 2004) is characterised by complex overlapping innovations, but by none that are reflected in all its member languages and would qualify it as a proper subgroup (see discussion in Lynch et al. 2002: 112-114).20
Micronesian is a proper subgroup (Jackson 1983, 1986, Bender et al. 2003).
Central Pacific is a proper subgroup, but one defined by only a handful of shared innovations, indicating that the period of unity was short (Geraghty 1996). The high-order sub-grouping of Central Pacific is due to Geraghty (1983), except for the position of Rotuman, due to Pawley (1996b). Within Central Pacific is another long recognised proper subgroup, Polynesian, for which Pawley (1996a) lists diagnostic innovations.
As we noted above, reconstruction depends on working out the systematic sound correspondences among cognate vocabulary in contemporary languages and on having a working hypothesis about how the sounds of POc have: changed and are reflected in modern Oceanic languages. Working out sound correspondences even for twenty languages is a large task, and so we have relied heavily on our own previous work and the work of others. The sound correspondences we have used are those given by Ross (1988) for Western Oceanic and Admiralties; by Levy (1979, 1980) and Frantisek Lichtenberk (1988) for Cristobal-Malaitan, by Pawley (1972) and Tryon & Hackman (1983) for SE Solomonic; by Ross & Nᴂss (2007) for Temotu; by Tryon (1976) and Clark (1996, 2009) for North and Central Vanuatu; by Lynch (1978, 2001) for Southern Vanuatu; by Geraghty (1989) and Ozanne-Rivierre (1992) for New Caledonia; by Jackson (1986) and Ross (1996a) for Nuclear Micronesian; by Geraghty (1986) for Central Pacific; by Biggs (1978) for Polynesian; by Ross (1996a) for Yapese; and by Ross (1996b) for Oceanic languages of Irian Jaya.
For non-Oceanic languages we have referred to sound correspondences given by Tsuchida (1976) for Formosan languages; by Zorc (1977, 1986) and Reid (1982) for the Philippines; by Adelaar (1992b) and Nothofer (1975) for Malay and Javanese; by Sneddon (1984) for Sulawesi; by Collins (1983) for Central Maluku; and by Blust (1978a) for South Halmahera and Irian Jaya.
We are well aware that regular sound correspondences can be interfered with in various ways: by phonetic conditioning that the analyst has not identified (see, e.g., Blust 1996a), by borrowing (for an extreme Oceanic case, see Grace 1996), or, as recent research suggests, by the frequency of an item’s use (Bybee 1994). We have tried at least to note, and sometimes to account for, irregularities in cognate sets.
*pʷ | *p | *t | *c | *k | *q |
*bʷ | *b | *d | *j | *g | |
*s | |||||
*mʷ | *m | *n | *ñ | *ŋ | |
*r | *R | ||||
*dr | |||||
*l | |||||
*w | *y |
*i | *u | |
*e | *o | |
*a |
Work based on the sound correspondences of both Oceanic and non-Oceanic languages has resulted in the reconstructed paradigm of POc phonemes shown in Table 1.1. The orthography used here and in the POc reconstructions in this work is from Ross (1988), with the addition of *pʷ. POc phonology and its relationship to PMP is discussed in greater detail in Chapter 2 (§2) of volume 1. Since the publication of volume 1, articles by John Lynch have appeared on POc stress (2000) and POc labiovelar phonemes (2002).
Table 1.2 shows two POc orthographies. The first was established by Biggs (1965), for PEOc, and Grace (1969), who applied it to POc. It has been used with a number of variants, separated by a slash in Table 1.2. The second, introduced by Ross (1988), is the one generally used in this work. The terms ‘oral grade’ and ‘nasal grade’ were used by Grace (1969) and have become conventional among Oceanic linguists to refer to the outcomes of certain sound changes that occurred between PMP and POc (vol.1, ch.2, §2.4).
Grace | oral grade | *p | *pw | *t | *d/*r | *s | *j | *k | |
Ross | *p | *pʷ | *t | *r | *s | *c | *k | ||
Grace | nasal grade | *mp | *ŋp/*mpw | *nt | *nd/*nr | *nj | *ŋk | ||
Ross | *b | *bʷ | *d | *dr | *j | *g |
Grace | *m | *ŋm/*mw | *n | *ñ | *ŋ | *w | *y | *l | *q | *R |
Ross | *m | *mʷ | *n | *ñ | *ŋ | *w | *y | *l | *q | *R |
Grace | *i | *o | *e | *a | *u |
Ross | *i | *o | *e | *a | *u |
Each of the contributions to these volumes concerns a particular POc ‘terminology’. Generally, each contribution begins with an introduction to the issues raised by the reconstruction of its particular terminology, and the bulk of each contribution consists of reconstructed etyma with supporting data and a commentary on matters of meaning and form.
The reconstruction of POc *kayu ‘tree’ below, adapted from Chapter 3, §4.2, shows how reconstructions and supporting cognate sets are presented. Above it is Dempwolff’s earlier superordinate (PMP) reconstruction. Below it are supporting reflexes. Contributors vary in the degree to which they insert lower-order reconstructions like PPn *kau below. Lower-order reconstructions are sometimes given to clarify the relationship of reflexes to the higher-order reconstruction: Southern Vanuatu languages, for example, have undergone so much phonological change that a Proto Southern Vanuatu reconstruction helps explicate the relationship between Southern Vanuatu reflexes and the POc reconstruction. In the set below the Proto Polynesian (PPn) reconstruction shows an extension in its meaning.
PMP | *kayu | ‘tree, wood, timber’ (Dempwolff 1938) |
POc | *kayu | ‘tree or shrub: generic name for plants with woody stems and branches, probably not including palms or tree-ferns; wood, stick’ |
Adm: Loniu | ke | ‘tree, wood’ |
Adm: Titan | kei | ‘firewood’ |
NNG: Lukep (Pono) | kai | ‘tree, wood’ |
NNG: Takia | ai | ‘tree (generic), wood, firewood, plant’ |
PT: Iduna | ai | ‘tree, plant, wood, fire, light’ |
PT: Motu | au | ‘tree, firewood’ |
MM: Patpatar | ai- | ‘tree species, followed by name of species’ |
MM: Mono | au | ‘tree’ |
SES: Bugotu | yai- | ‘tree, shaft of spear’ |
SES: Kwaio | ʔai | ‘branch, tree, stick; woody plant (shrub, tree)’ |
NCV: NE Ambae | kai | ‘tree, wood’ |
NCV: Nguna | na-kau | ‘tree’ |
SV: Anejom | in-ɣai | ‘tree, wood, often used for relatively small bushes’ |
NCal: Xaracuu | kʷãã | ‘wood, tree (general term)’ |
Mic: Kiribati | kai | ‘wood (in general), tree, plant, stick’ |
Mic: Ulithian | -xᴂy | ‘counting classifier for trees’ |
Fij: Kadavu | kaðu | ‘tree, piece of wood, stick’ |
Fij: Wayan | kai | ‘wood; generic for trees and shrubs, and occasionally also low bushy plants; used in certain compounds as generic for all plants; piece of wood, stick’ |
PPn | *kau | ’wood, timber, stalk, stem, handle” (POLLEX) |
Pn: Tongan | kau | ‘stalk, stem’ |
Pn: Hawaiian | ʔau | ‘handle, staff, stem, bone of lower arm or leg’ |
Because our supporting data are drawn from such a wide range of languages, the convention is adopted of prefixing each language name with the abbreviation for the genealogical or geographic group to which the language belongs, so that the distribution of a cognate set is more immediately obvious. Table 1.3 is a key to the labels. Figure § 1.2 shows the positions of these groups in the Oceanic tree. We have sought to be consistent in always listing these groups in the same order, but contributors vary in the ordering of languages within groups.
Lynch’s recent research on Southern Oceanic (§3.2.4) renders the NCV group mildly anomalous, although there is no doubt that it is an integrated dialect network. There are a number of etyma whose reflexes are confined to North and Central Vanuatu, and so we continue to make ‘Proto North/Central Vanuatu’ reconstructions, even though these perhaps represent a Southern Oceanic term that has been lost in southern Vanuatu and New Caledonia. Where the distribution of reflexes requires it, the chapters in this volume include reconstructions for PROc and for PSOc. Etyma with these distributions were attributed to PEOc in volumes 1 and 2, but the distributions are transparent, thanks to the presence of the group labels in cognate sets.
In the interests of space we have not given the history of the reconstructions themselves, as this would often require commentary on the modifications made by others and by us, and on why we have made them. Where a reconstruction is not new, we have tried to give its earliest source, e.g. ‘Dempwolff 1938’ above, but this is difficult when earlier reconstructions differ in form and meaning.
In general, the contributions to these volumes are concerned with items reconstructable in POc, PWOc, PEOc, PROc and occasionally Prolto New Guinea Oceanic (PNGOc). Etyma for PWOc, PNGOc and PEOc are reconstructed because these may well also be POc etyma for which known reflexes are not well distributed (see discussion in §3.2.3). The contributors vary in the degree to which they reconstruct etyma for interstages further down the tree.
Yap: | Yapese (one language) |
Adm: | Admiralty and Mussau/Tench |
SJ: | Sarmi/Jayapura |
NNG: | North New Guinea |
PT: | Papuan Tip |
MM: | Meso-Melanesian |
SES: | Southeast Solomonic |
TM: | Temotu |
NCV: | North/Central Vanuatu, i.e. the reintegrated network formed by the North and Central Vanuatu linkages |
SV: | Southern Vanuatu |
NCal: | Loyalty and New Caledonia |
Mic: | Micronesian |
Fij: | Fijian, i.e. the reintegrated network formed by Western and Eastern Fijian dialects |
Pn: | Polynesian |
Reconstructions for lower-order interstages are decreasingly likely to reflect POc etyma and may be the results of cultural change as Oceanic speakers moved further out into the Pacific.
Contributors have usually not sought to make fresh reconstructions at interstages superordinate to POc. What they have done, however, is to cite other scholars’ reconstructions for higher-order interstages, as these represent a summary of the non-Oceanic evidence in support of a given POc reconstruction. These interstages are shown in Figure 1.1, together with their abbreviations.
Sometimes non-Oceanic evidence has been found to support a POc reconstruction where no reconstruction at a higher-level interstage has previously been made. In this case a new higher-order reconstruction is made, and the non-Oceanic evidence is given in a footnote.
Whilst we have tried to use the internal organisation of the lexicons of Oceanic languages themselves as a guide in setting the boundaries of each terminology, we have inevitably taken decisions which differ from those that others might have made. There are, obviously, overlaps and connections between various semantic domains and therefore between the contributions here. We have done our best to provide cross-references, but we have sometimes duplicated information rather than ask the reader repeatedly to look elsewhere in the book. Indexes at the end of each volume and in the final volume are intended to make it easier to use the volumes collectively as a work of reference.
Data sources are listed in Appendix 1.
For some reconstructed etyma only a representative sample of reflexes is given. We have endeavoured to ensure, however, that in each case this sample not only is geographically and genetically representative, but also provides evidence to justify the shape of the reconstruction. Where only a few reflexes are known to us, this is usually noted.
Although there are accepted or standard orthographies for a number of the languages from which data are cited here, all data are transcribed into a standard orthography based on that used by Ross (1988: 3-4) in order to facilitate comparison.21 This means, for example, that the j of the German-based orthographies of Yabem and Gedaged becomes y, Yabem c becomes ʔ; Gedaged z becomes ɬ and so on; the ng of English-based orthographies becomes ŋ and Fijian g, q and c become ŋ, g and ð respectively.
Unless otherwise indicated, the following symbols have their usual phonetic values: ð, g, ɢ, ɣ, h, k, l, ɬ, ʎ, m, n, ŋ, ñ, p, q, χ, ɾ, r, s, t, w, x, z, ʔ, a, ᴂ, e, ɛ, ə, i, ɨ, o, œ, ɔ, ʌ, u, ɯ. The voiced stops b, d, g and the voiced bilabial trill ʙ are prenasalised in some languages, but prenasalisation is not written unless it is phonemically distinctive. Other orthographic symbols (with values in IPA) are:
f | [ɸ, f] | voiceless bilabial or (less often) labio-dental fricative |
v | [β, v] | voiced bilabial or (less often) labio-dental fricative |
c | [ts], [tʃ] | voiceless alveolar or palatal affricate |
j | [dz], [dʒ] | voiced alveolar or palatal affricate |
y | [j] | palatal glide |
dr | [ⁿr] | prenasalised voiced alveolar trill (as in Fijian) |
ö | [ø] | rounded mid front vowel |
ü | [y] | rounded high front vowel |
Other superscripts and diacritics are as follows:
Except for inflexional morphemes, non-cognate portions of reflexes, i.e. derivational morphemes and non-cognate parts of compounds, are shown in parentheses (…). Where an inflexional morpheme is an affix or clitic and can readily be omitted, its omission is indicated by a hyphen at the beginning or end of the base. This applies particularly to possessor suffixes on directly possessed nouns (vol.1, ch.2, §3.2). Where an inflexional morpheme cannot readily be omitted, then it is separated from its base by a hyphen. This may happen because of complicated morphophonemics or because the morpheme is always present, like the adjectival -n in some NNG and Admiralties languages and prefixed reflexes of the POc article *na in scattered languages. When a reflex is itself polymorphemic (i.e. the morphemes reflect morphemes present in the reconstructed etymon) or contains a reduplication, the morphemes or reduplicates are also separated by a hyphen.
Languages from which data are cited in this volume are listed in Appendix B in their subgroups, together with an index allowing the reader to find the subgroup to which a given language belongs. Appendix B also includes alternative language names. The difficulty of deciding where the borderline between dialect and language lies, combined with the fact that these volumes contain work by a number of contributors, has resulted in some inconsistency in the naming of dialects in the cognate sets (cf the cognate set supporting POc *kayu on ch.3, §4.2). Some occur in the form ‘Kara (E)’, i.e. the East dialect of the Kara language, or ‘Lukep (Pono)’, i.e. the Pono dialect of the Lukep language, whilst others are represented simply by the dialect name, e.g. Iduna, noted in Appendix B as ‘Iduna (= dialect of Bwaidoga)’ .
Reconstructions are marked with an asterisk, e.g. *kayu ‘tree’, in keeping with the standard convention in historical linguistics. POc reconstructions, and also PWOc, PEOc and PNGOc reconstructions, are given in the orthography of §3.4. For reconstructions at higher-order interstages the orthographies are those used by Blust in his various publications and the ACD. Reconstructions at lower-order interstages are given in the standard orthography adopted for data (§4.2). Geraghty’s (1986) PCP orthography, for example, is based on Standard Fijian spelling, and is converted into our standard orthography in the same way as Fijian spelling is. In practice, this means that the orthographies for PROc and PCP are the same as for POc, except that a distinction between *p and *v is recognised and *R is absent. Biggs and Clark’s PPn reconstructions are in any case written in an orthography identical to our standard. Bracketing and segmentation conventions in protoforms are shown in Table 1.4.
(x) | it cannot be determined whether x was present |
(x,y) | either x or y was present |
[x] | the item is reconstructable in two forms, one with and one without x |
[x,y] | the item is reconstructable in two forms, one with x and one withy |
x-y | x and y are separate morphemes |
x- | x takes an enclitic or a suffix |
⟨X⟩ | x is an infix |
PMP final consonants are usually retained in POc. However, it happens fairly often that the final consonant in a higher-order reconstmcted etymon (e.g. *-R in PMP *kamaliR ‘men’s house’) is not evidenced in any Oceanic refllex because POc final consonants are regularly lost in all the daughter languages from which reflexes are drawn, and we therefore have no evidence as to whether or not the final consonant was retained in the POc etymon in question. In such cases the consonant is reconstructed in brackets (e.g. POc *kamali(R)).
In presenting words that display anomalies of form, it is often necessary to posit an expected form. For example, in ch.2, §5, the Gela form ao is presented in support of PEOc *wao ‘forest’. Given the reconstruction, however, we would expect the Gela form to be wao. In this volume we use a less widely employed convention and mark expected forms with a dagger, e.g. †wao, to distinguish them both from reconstructions and real data.22
There are occasions on which we need to posit a hypothetical form in a reconstructed protolanguage. In such cases the dagger and asterisk conventions are used together. For example in ch.4, §2.4, POc *lali(c,t) ‘buttress roots’ is reconstructed. This is a reflex of PMP *dalij, but the first consonant has undergone assimilation to the second in POc: the expected (but unattested) POc form would be †*ralic.
When historical linguists compile cognate sets they commonly retain word for word the glosses given in the sources from which the items are taken. However, again in the interests of standardisation, we have often reworded (and sometimes abbreviated) the glosses of our sources, while preserving the meaning. Where glosses were in a language other than English we have translated them. In the interests of space and legibility, and because data often have multiple sources, we have given the source of a reflex only when it is not included in the listings in Appendix A.
In glosses we use the conventional abbreviations ‘k.o.’ (as in ‘k.o. yam’) for ‘kind of, ’s.o.’ for ‘someone’, ‘s.t.’ for ‘something’, ‘sp.’ for a species and ‘spp.’ for more than one species.
In putting together cognate sets we have quite often found apparent reflexes which do not quite ‘fit’ the set: either they display an unexplained phonological irregularity or their meaning is just a little too different from the rest of the set for us to assume cognacy. Rather than eliminate them our authors often include them below the cognate set under the rubric ‘cf. also’.
As explained in §4.1, a cognate set is almost always headed by a reconstruction for the highest-order Oceanic protolanguage that the set allows us to infer. In this volume these reconstructions are mostly for Proto Oceanic, Proto Western Oceanic, Proto Eastern Oceanic or PROc, and less often for Proto Southern Oceanic or Proto Central Pacific. Since Oceanic speakers spread across the Pacific from the Bismarck Archipelago to Fiji with remarkable speed, reconstructions at any of these interstages are likely to be early. Proto Admiralty, Proto New Guinea Oceanic and PMM reconstructions are also listed in the few cases where they occur, as there is a fair probability that they are also of early Oceanic antiquity. ‘Proto North/Central Vanuatu’ reconstructions are given where reflexes are restricted to North/Central Vanuatu, even though it is improbable that these languages form an exclusive subgroup (§3.2.2). A PNCV reconstruction is more likely to represent a PSOc term that has not survived in southern Vanuatu and New Caledonia.
Proto Polynesian and Proto Micronesian terms that are not reconstructable to an earlier interstage are not given here, partly because they are likely to represent significantly later developments than the interstages enumerated in the previous paragraph, and partly because they are available elsewhere (Proto Polynesian in POLLEX and Proto Micronesian in Bender et al. 2003).
Lower-order reconstructions are also given within a cognate set in certain circumstances. If a term has undergone formal change or has been reapplied to a different plant at a particular point in its history, then a reconstruction is given for the relevant interstage. Also often included are Proto Southern Vanuatu reconstructions from Lynch (2001c, 2004a), as the phonological histories of Southern Vanuatu languages are far from transparent and the reconstruction often helps to illuminate the cognacy of SV items.
Sources of the Oceanic plant-name data used to make the reconstructions in this volume are listed in Appendix 1 after general data sources. Four works which provide Austronesian plant names in non-Oceanic languages have been of particular use in the search for cognates which allow the reconstruction of higher-order (PCEMP and PMP) plant names. These works are:
Certain works which catalogue plants and give information about their forms, habitats and uses have been consulted over and over again, as will be obvious from the recurrence of certain references. They are:
All six works contain vernacular plant names. Since the Bismarck Archipelago is central to the study of POc plant names, it is a pity that there is no modern work dealing with the plants of the region parallel to, say Henderson & Hancock (1988) for the Solomons or Wheatley (1992) for Vanuatu. The Guide to the trees of Papua New Guinea website (Conn & Damas 2006) has been helpful, but still has many gaps in its coverage of the Bismarck Archipelago. Peekel’s coverage is excellent and includes all plants (even seagrasses). Indeed his coverage is considerably better than that of either Henderson & Hancock (1988), which covers ‘useful plants’, or Wheatley (1992), which is restricted to trees. The ethnobotanical works on the Solomons, Kwa’ioloa & Burt (2001) and Hviding (2005), also provide good coverage, but this is balanced by rather thin information about many plants.
Pacific food plants have a literature of their own, and here three works were extensively consulted in addition to those listed above:
Of these only the last provides vernacular terms, with a strong bias towards Vanuatu.
The scientific names of plants are a minefield for the non-botanist, as there are often synonyms and the accepted name for a plant may change over time. In order to keep track of synonyms frequent use was made of the International plant names index (IPNI = IPNI 2004) and the Australian plant names index (APNI = APNI 1991).
Linnaean families have undergone recent changes. For present purposes the most important is that Leguminosae are subdivided into Cisalpiniaceae (formerly Cisalpinioideae), Fabaceae (formerly Papilionatae) and Mimosaceae (formerly Mimosoideae). Other changes relevant here are that Gramineae (grasses) are now Poaceae, and Palmae are now Arecaceae.
Wherever plant names in English and/or one of the three major Pacific pidgins are available, they are included in the section titles in chapters 5 to 13. Names in the three pidgins are marked with an abbreviation. Together with major sources of names, these are:
The pidgin names are given to help some readers to recognise the plant under discussion. In Papua New Guinea it is common for both indigenous people and expatriates to know a plant by its Tok Pisin name but to know neither an English nor a scientific name. The situation is presumably similar in the Solomon Islands and Vanuatu.
The volume has four indexes. The first, as in previous volumes, is an index of reconstructions arranged by their protolanguages. The second is an alphabetical list of reconstructions, the third an index of plants by genus and species, the fourth by botanical family.