The lexicon of Proto Oceanic -

Chapter 4.1 Introduction

Malcolm Ross and Andrew Pawley and Meredith Osmond

Map 1: The Austronesian language family and major subgroups — **Map 1:** The Austronesian language family and major subgroups

1. Aims⇫

This is the fourth in a series of volumes on the lexicon of the Proto Oceanic (POc) language.¹ POc is the immediate ancestor of the Oceanic subgroup of the Austronesian language family. This subgroup consists of all the Austronesian languages of Melanesia east of 136° E, together with those of Polynesia and (with two exceptions) those of Micronesia—more than 450 languages in all (see Map 1).² Extensive arguments for the existence of Oceanic as a clearly demarcated branch of Austronesian were first put forward by Otto Dempwolff in the 1920s, and the validity of the subgroup is now recognised by virtually all scholars working in Austronesian historical linguistics.

The development and break-up of the POc language and speech community were stages in a truly remarkable chapter in human prehistory—the colonisation by Austronesian speakers of the Indo-Pacific region in the period after about 2000 BC. The outcome was the largest of the world’s well-established language families and (until the expansion of Indo-European after Columbus) the most widespread. The Austronesian family comprises more than 1,000 distinct languages. Its eastern and western outliers, Madagascar and Easter Island, are two-thirds of a world apart, and its northernmost extensions, Hawai‘i and Taiwan, are separated by 70 degrees of latitude from its southernmost outpost, Stewart Island in New Zealand. It is likely that the divergence of Oceanic from its nearest relatives, which are the Austronesian languages spoken around Cenderawasih Bay and in South Halmahera (Blust 1978a), began when Austronesian speakers from the Cenderawasih Bay area moved eastwards along the north coast of New Guinea and into the Bismarck Archipelago. There is a strong school of opinion that associates the subsequent break-up of POc with the rapid colonisation of Island Melanesia and the central Pacific by bearers of the Lapita culture between about 1200 and 900 BC (see Map 2 and volume 2, chapter 2).

Map 2: Geographic limits of historically known Oceanic speakers and of presently documented Lapita sites (after Kirch_1997:17, 54) — **Map 2:** Geographic limits of historically known Oceanic speakers and of presently documented Lapita sites (after Kirch_1997:17, 54)

The present project aims to bring together a large corpus of lexical reconstructions for POc, with supporting cognate sets, organised according to semantic fields and using a standard orthography for POc. We hope that this thesaurus will be a useful resource for culture historians, archaeologists and others interested in the prehistory of the Pacific region. The comparative lexical material should also be a rich source of data for various kinds of purely linguistic research, e.g. on semantic change and subgrouping in the more than 400 daughter languages.

Volume 1 of The lexicon ofProto Oceanic deals with material culture. Volumes 2, 3 and 4 examine relevant sets of cognate terms in order to gain insights into how POc speakers viewed and exploited their environment. Volume 2 deals with the geophysical or inanimate environment, volume 3 treats plants and the present volume animals. Volume 5, as it is planned at the time of writing, will investigate terminologies centring on people, including the body and human conditions and activities, and volume 6 will concern social organisation, belief systems, rituals, recreation and other elements of non-material culture. The seventh and final volume will perform a number of functions. It will treat certain lexical categories, e.g. closed classes of lexical roots, not dealt with in earlier volumes. It will review the main findings of the project concerning the culture and environment of Proto Oceanic speakers and will compare these findings with what archaeology tells us about the way of life and environment of the bearers of the Lapita culture. Volume 7 will also provide an index to the POc and other reconstructions presented in the whole work, as well as an English-to-POc finderlist and a list of all languages cited, together with their subgroups.³

The major headings under which animals are divided in this volume, e.g. Fish, Birds, Aquatic invertebrates, etc., largely follow English-language rather than Oceanic categories. There are several reasons for this choice. One is that descriptions of Oceanic taxonomies of animals are few and imperfect. Another is that although the best-described Oceanic taxonomies typically agree with one another in some respects, they vary in others. Yet another is that we assume that readers would be more likely to look, say, for ‘bats’ under Mammals than under Birds.

In keeping with the exceptionally rich diversity of marine fauna in Oceania and its economic and cultural importance to Oceanic societies, three (chapters 2-4) of the seven chapters that follow this introduction are largely devoted to creatures of the sea, as are portions of two others (chapters 5 and 8). Chapters 2 and 3 both concern fish terminologies. Chapter 2 presents supporting cognate sets for more than 140 POc fish names and for many additional names attributable to other major interstages below POc. A good many of the names are identifiable at the level of genus or species, others only to family or class. Chapter 3 investigates the retention rates of a sample of 52 POc fish names and asks why the total number of fish names reconstructed for POc is so much smaller than the number typically distinguished by contemporary Oceanic languages. Chapter 4 presents reconstructions of terms for aquatic invertebrates and their body parts. Chapter 5 examines terms for mammals, reptiles and amphibians. Again it is partly about creatures of the sea. Other than New Guinea, the islands of Oceania have few native land mammals. Chapter 6 deals with names for bird taxa and other terms associated with birds. Chapter 7 is entitled ‘Insects and other creepy-crawlies’, the latter including non-insect terrestrial invertebrates: spiders, centipedes, worms, leeches and grubs. The final chapter of the volume, chapter 8, investigates the semantic histories of several terms that may have been high-level generics or life-forms in the POc taxonomy of animals. It looks for recurrent patterns in the way different languages have extended or reduced the referential range of each of these terms.

2. The relation of the current project to previous work⇫

Reconstructions of POc phonology and lexicon began with Dempwolff’s pioneering work in the 1920s and 1930s. Dempwolff’s dictionary of reconstructions attributed to Proto Austronesian (PAn) (1938) — but equivalent in modern terms to Proto Malayo-Polynesian (PMP) — contains some 600 reconstructions with reflexes in Oceanic languages.

Since the 1950s, POc and other early Oceanic interstage languages have been the subject of a considerable body of research. However, relatively few new reconstructions safely attributable to POc were added to Dempwolff’s material until the 1970s. In 1969 George Grace made available as a working paper a compilation of reconstructions from various sources amounting to some 700 distinct items, attributed either to POc or to early Oceanic interstages. These materials were presented in a new orthography for POc, based largely on Biggs’ (1965) orthography for an interstage he called Proto Eastern Oceanic. Updated compilations of Oceanic cognate sets were produced at the University of Hawai‘i in the period 1977-1983 as part of a project directed by Grace and Pawley. These compilations and the supporting data are problematic in various respects and we have made only limited use of them.

Comparative lexical studies have been carried out for several lower-order subgroups of Oceanic: for Proto Polynesian by Biggs (resulting in Walsh & Biggs 1966, Biggs, Walsh & Waqa 1970 and subsequent versions of the pollex file, including Biggs & Clark (1993) and Clark & Biggs (2006); for Proto Micronesian by scholars associated with the University of Hawai‘i (Bender et al. 1983, Bender et al. 2003); for the ancestor of the Banks and Torres languages by Alexandre François (several unpublished manuscripts); for Proto North and Central Vanuatu by Clark (1996, 2009); for Proto Southern Vanuatu by Lynch (1978, 1996b, 2001); for New Caledonia by Ozanne-Rivierre (1992), Haudricourt & Ozanne-Rivierre (1982) and Geraghty (1989); for PSES by Levy (1980) and Lichtenberk (1988); for Proto Central Pacific by Hockett (1976), Geraghty (1983, 1986, 1996 together with a number of unpublished papers); for Proto Eastern Oceanic by Biggs (1965), Cashmore (1969), Levy & Smith (1970), and Geraghty (1990); and for Proto Central Papuan by Pawley (1975), Lynch (1978, 1980), and Ross (1994a).

Robert Blust of the University of Hawai‘i has, in a series of papers (1970, 1980, 1983-84, 1986, 1989) published extensive, alphabetically ordered, lexical reconstructions (with supporting cognate sets) for interstages earlier than POc, especially for Proto Austronesian, Proto Malayo-Polynesian and Proto Eastern Malayo-Polynesian. He has also written several papers investigating specific semantic fields (Blust 1980b, 1982, 1987, 1994). Blust has a major work in progress, the Austronesian Comparative Dictionary (ACD), which will bring together all his reconstructions for Proto Austronesian and lower-order stages. This is stored in electronic form at the University of Hawai‘i. The version to which we refer dates from 1995.

Several papers predating our project systematically investigated particular semantic domains in the lexicon of POc, e.g. Milke (1958), French-Wright (1983), Pawley (1982a, 1985), Pawley & Green (1985), Lichtenberk (1986), Walter (1989), and the various papers in Pawley & Ross (1994). Ross (1988) contains a substantial number of new POc lexical reconstructions, as well as proposed modifications to the reconstructed POc sound system and the orthography. However, previous Oceanic lexical studies were limited both by large gaps in the data, with a distinct bias in favour of ‘Eastern Oceanic’ languages, and by the technical problems of collating large quantities of data. Although most languages in Melanesia remain poorly described, there are now many more dictionaries and extended word lists, particularly for Papua New Guinea, than there were in the 1980s. And developments in computing hardware and software now permit much faster and more precise handling of data than was possible then. A list of sources and a summary of the Project’s collation procedures is found in Appendix 1.

Several compilations of reconstructions have provided valuable points of reference, both inside and outside the Oceanic group. We are indebted particularly to Bender et al. (2003), two editions of pollex (Biggs & Clark 1993 and Clark & Biggs 2006), Blust (ACD), Clark (2009) and Lynch (2001c).

In the course of planning the several volumes for the present project, we came to realise that the form in which preliminary publications were presented—namely as essays, each discussing cognate sets for a particular semantic field at some length—would also be the best form for the presentation of our final synthesis. A discursive treatment of individual terminologies, as opposed, say, to a dictionary-type listing of reconstructions with supporting cognate sets, makes it easier to relate the linguistic comparisons to relevant issues of culture history, language change, and methodology. Hence each of the present volumes has as its core a collection of analytic essays. Some of these have been published or presented elsewhere, but are printed here in revised form. In some cases we have updated the earlier versions in the light of subsequent research, and, where appropriate, have inserted cross-references between contributions. Authorship is in some cases something of a problem, as a number of people have had a hand in collating the data, doing the reconstructions, and (re)writing for publication here. In most chapters, however, one person did the research which determined the structure of the terminology, and that person appears as the first or only author, and where another or others had a substantial part in putting together the paper itself, they appear as the second and further authors.

3. Reconstructing the lexicon⇫

The lexical reconstructions presented in these volumes are arrived at using the standard methods of comparative linguistics, which require as preliminaries a subgrouping or internal classification of the languages in question (§3.2) and the working out of systematic sound correspondences among cognate vocabulary in contemporary languages (§3.3). As well as cognate sets clearly attributable to POc, we have included some cognate sets which at this stage are attributable to various interstage languages, particularly Proto Western and Proto Eastern Oceanic (but see §3.2 for definitions). We have set out to pay more careful attention to reconstructing the semantics of POc forms than has generally been done in earlier work, treating words not as isolates but as parts of terminologies.

3.1. Terminological reconstruction⇫

Our method of doing ‘terminological reconstruction’ is as follows. First, the terminologies of present-day speakers of Oceanic languages are used as the basis for constructing a hypothesis about the semantic structure of a corresponding POc terminology, taking account of (i) ethnographic evidence, i.e. descriptions of the lifestyles of Oceanic communities and (ii) the geographical and physical resources of particular regions of Oceania. For example, by comparing terms in several languages for parts of an outrigger canoe, or for growth stages of a coconut, one can see which concepts recur and so are likely to have been present in POc. Secondly, a search is made for cognate sets from which forms can be reconstructed to match each meaning in this hypothesised terminology. The search is not restricted to members of the Oceanic subgroup; if a term found in an Oceanic language proves to have external (non-Oceanic) cognates, the POc antiquity of that term will be confirmed and additional evidence concerning its meaning will be provided. Thirdly, the hypothesised terminology is re-examined to see if it needs modification in the light of the reconstructions. There are cases, highlighted in the various contributions to these volumes, where we were able to reconstruct a term where we did not expect to do so and conversely, often more significantly, where we were unable to reconstruct a term where we had believed we should be able to. In each case, we have discussed the reasons why our expectations were not met and what this may mean for Oceanic culture history.

Blust (1987: 81) distinguishes between conventional ‘semantic reconstruction’, which asks, “What was the probable meaning of protomorpheme X?”, and Dyen and Aberle’s (1974) ‘lexical reconstruction’, where one asks, “What was the protomorpheme which probably meant ‘X’?” At first sight, it might appear that terminological reconstruction is a version of lexical reconstruction. However, there are sharp differences. Lexical reconstruction applies a formal procedure: likely protomeanings are selected from among the glosses of words in available cognate sets, then an algorithm is applied to determine which meaning should be attributed to each set. This procedure may have unsatisfactory results, as Blust points out. Reconstructions may end up with crude and overly simple glosses; or no meaning may be reconstructed for a form because none of the glosses of its reflexes is its protomeaning.

Terminological reconstruction is instead similar to the semantic reconstruction approach. In terminological reconstruction the meanings of protomorphemes are not determined in advance. Instead, cognate sets are collected and their meanings are compared with regard to:

their specific denotations, where these are known;
the geographic and genetic distribution of these denotations (i.e. are the glosses from which the protogloss is reconstructed well distributed?);
any derivational relationships to other reconstructions;
their place within a working hypothesis of the relevant POc terminology (e.g., are terms complementary —‘bow’ implies ‘arrow’; ‘seine net’ implies ‘floats’ and ‘weights’? Are there different levels of classification—generic, specific, and so on?).

For example, it proved possible to reconstruct the following POc terms for tying with cords (vol.1, ch.9, §10):

POc *buku ‘tie (a knot); fasten’
POc *pʷita ‘tie by encircling’
POc *paqu(s), *paqus-i- ‘bind, lash; construct (canoe +) by lashing together’
POc *pisi ‘bind up, tie up, wind round, wrap’
POc *kiti ‘tie, bind’

In each of the supporting cognate sets from contemporary languages there are a number of items whose glosses in the dictionaries or word lists are too vague to tell the analyst anything about the specific denotation of the item, and in the case of *kiti this prevents the assignment of a more specific meaning. The verb *buku can be identified as the generic term for tying a knot because of its derivational relationship (by zero derivation) with a noun whose denotation is clearly generic, *buku ‘node (as in bamboo or sugarcane); joint; knuckle; knot in wood, string or rope’ (vol.1, ch.4, §3.2). Reconstruction of the meaning of *pʷita as ‘tie by encircling’ is supported by the meanings of the Lukep, Takia and Longgu reflexes, respectively ‘tie by encircling’, ‘tie on (as grass-skirt)’, and ‘trap an animal’s leg; tie s.t. around ankle or wrist’: Lukep and Takia are North New Guinea languages, whilst Longgu is SE Solomonic. Reconstruction of the meaning of *paqu(s), *paqus-i- as ‘bind, lash; construct (canoe +) by tying together’ is supported by the meanings of the Takia, Kiribati and Samoan reflexes, respectively ‘tie, bind; construct (a canoe)’, ‘construct (canoe, house)’, and ‘make, construct (wooden objects, canoes +)’: Takia is a North New Guinea language, Kiribati is Micronesian, and Samoan is Polynesian. The meaning of *pisi is similarly reconstructed by reference to the meanings of its Mono-Alu, Mota, Port Sandwich, Nguna and Fijian reflexes.

Often, however, the authors have been less fortunate in the information available to them. For example, Osmond (vol.1, ch.8, §9) reconstructs six POc terms broadly glossed as ‘spear’. Multiple terms for implements within one language imply that these items were used extensively and possibly in specialised ways. Can we throw light on these specialised ways? Unfortunately, some of the word lists and dictionaries available give minimal glosses, e.g. ‘spear’, for reflexes of the six reconstructions. What we need to know for each reflex is: what is the level of reference? Is it a term for all spears, or perhaps all pointed projectiles including arrows and darts? Or does it refer to a particular kind of spear? Is it noun or verb or both? If a noun, does it refer to both the instrument and the activity? Most word lists are frustratingly short on detail. For this kind of detail, ethnographies have proved a more fruitful source of information than many word lists.

Another problem is inherent in the dangers of sampling from over 450 languages. The greater the number of languages, the greater are the possible variations in meaning of any given term, and the greater the chances of two languages making the same semantic leaps quite independently. Does our (sometimes quite limited) cognate set provide us with a clear unambiguous gloss, or have we picked up an accidental bias, a secondary or distantly related meaning? Did etymon x refer to fishhook or the material from which the fishhook was made? Did etymon y refer to the slingshot or to the action of turning round and round?

3.2. Subgrouping and reconstruction⇫

3.2.1. Subgrouping⇫

Although the subgrouping of Austronesian languages, and hypotheses about which protolanguage was spoken where, remain in certain cases somewhat controversial, it is impossible to proceed without making some assumptions about these matters. Figures 1.1 and 1.2 are approximate renderings of our subgrouping assumptions. The upper part of the tree, shown in Figure 1.1, is due to Blust, originally presented in Blust (1977b) and repeated with additional supporting evidence in subsequent publications (1978, 1982, 1983-84, 1993, 2009a).⁴ ⁵ The diagram of the lower (Oceanic) part of the tree in Figure 1.2 shows nine primary subgroups of Oceanic. Its rake-like structure indicates that no convincing body of shared innovations has been found to allow any of the nine subgroups to be combined into higher-order groupings. Sections 3.2.2, 3.2.3 and 3.2.4 offer some commentary on our subgrouping, and in §3.2.4 we explain how we handle the rake-like structure in making reconstructions.

Figure 1.1: Schematic diagram showing higher-order subgroups of Austronesian languages. — **Figure 1.1:** Schematic diagram showing higher-order subgroups of Austronesian languages.

3.2.2. Kinds of subgroup⇫

In Figures 1.1 and 1.2 each node is either a single language,⁶ usually a reconstructed protolanguage, or, in italics, a group of languages.

Figure 1.2: Schematic diagram showing the subgrouping of Oceanic Austronesian languages. — **Figure 1.2:** Schematic diagram showing the subgrouping of Oceanic Austronesian languages.

Where a node is a protolanguage, its descendants form a proper subgroup (in the technical sense in which historical linguists use the term ‘subgroup’). A proper subgroup is identified by innovations shared by its member languages, i.e. it is ‘innovation-defined’ in the terminology of Pawley & Ross (1995). These innovations are assumed to have occurred just once in the subgroup’s protolanguage, i.e. the exclusively shared ancestor of its members. Thus languages of the large Oceanic subgroup of Austronesian share a set of innovations relative to the earlier Austronesian stages shown in Figure 1.1 (Dempwolff 1934).⁷ By inference these innovations occurred in their common ancestor, POc, and the claim that they are innovations is based on a comparison of reconstructed POc with reconstructed PMP The innovations may be phonological (e.g. PMP *e, pronounced [ə], and PMP *aw both became POc *o), morphological (e.g. POc acquired a morphological distinction between three kinds of possessive relationship: food, drink and default), or lexical (e.g. PMP *limaw ‘citrus fruit’ was replaced by POc *molis).

Italics are used in Figures 1.1 and 1.2 to indicate a group of languages which is not a proper subgroup, i.e. has no identifiable exclusively shared parent. Thus Formosan languages in Figure 1.1 indicates a collection of languages descended (along with PMP) from PAn. They are spoken in Taiwan, but do not form a subgroup. There was no ‘Proto Formosan’, as Formosan languages and language groups are all descended directly from PAn.

Some of the italicised labels in Figures 1.1 and 1.2 include the term linkage. A linkage (an ‘innovation-linked group’ in the terminology of Pawley & Ross 1995) is a collection of usually quite closely related languages or dialects,⁸ speakers of which were in sufficient contact at one time or another during their history for innovations to pass from one language to the next, often resulting in a pattern such that the domains of various innovations overlap but are not coterminous.⁹ A number of Oceanic linkages were recognised during the 1980s (Geraghty 1983, Pawley & Green 1984, Ross 1988).¹⁰ A linkage may arise in at least three ways, but distinguishing between them is often impossible.

First, what would otherwise be a proper subgroup may happen to lack exclusively shared innovations, perhaps because the parent did not exist as a unit for long enough to undergo any innovations of its own.¹¹

Second, a linkage may consist of some but not all of the languages descended from a single parent. The Western Oceanic linkage reflects the innovations of POc, but no innovation is common to the whole of Western Oceanic (although the merger of POc *r and *R comes close). However, the languages of its three component linkages—North New Guinea, Papuan Tip and Meso-Melanesian—display complex patterns of overlapping innovations. The Western Oceanic linkage appears to be descended from the dialects of POc that were left behind in the Bismarck Archipelago after speakers of the languages ancestral to the other eight primary subgroups in Figure 1.2 had moved away to the north or east. After these departures various innovations occurred. Each arose somewhere in the Western Oceanic dialect network and spread to neighbouring dialects without reaching every dialect in the network.

The third type of linkage is the result of contact among languages descended from more than one immediate parent, indicated in Figure 1.2 by a dashed line around the relevant groups of languages. An example is the Fijian linkage, which represents the partial resynthesis of the Fiji-based descendants of earlier Western Central Pacific and Eastern Central Pacific linkages after Rotuman and Polynesian had split off from them (Geraghty & Pawley 1981, Geraghty 1983, Pawley 1996c).¹² Geraghty reconstructed the history of the Fijian linkage by painstaking analysis of innovations from at least two stages in its history. From the earlier period Western Fijian languages share innovations with Rotuman and Eastern Fijian with Polynesian. From a more recent period Western Fijian and Eastern Fijian languages share innovations with each other, reflecting their reintegration into a single linkage, within which the present Western/Eastem boundary has shifted relative to the (fuzzy) boundary of the earlier period.

For most of the linkages noted in Figures 1.1 and 1.2 this kind of analysis is not available. For example, Blust (1993) indicates that CEMP was a linkage. But its history is far from clear. Does CEMP perhaps include some languages that share history with languages to their west and others that share history with those to their north? The North/Central Vanuatu linkage, long assumed to be some sort of genealogical unit, appears to reflect the partial reintegration of at least two dialect networks, North Vanuatu and Central Vanuatu, that probably had not diverged greatly from each other, but the details of this history are difficult to elucidate (Lynch 2000c).¹³

The languages of a linkage have no identifiable exclusively shared parent. Yet we have found many instances in which a cognate set is limited to one of the linkages in Figures 1.1 and 1.2: CEMP, Western Oceanic, New Guinea Oceanic, Southern Oceanic or the reintegrated North and Central Vanuatu linkage. As with PEOc and PROc (§3.2.4), we think it is preferable to attribute these reconstructions to a hypothetical protolanguage rather than to a higher node in the tree. Hence there are reconstructions labelled PCEMP, PWOc and so on. Again these apparent lexical innovations offer only the weakest evidence for the protolanguage to which they are attributed. In addition to the explanations of the kinds offered for PEOc and PROc etyma in §3.2.4 it is possible, for example, that an innovatory ‘PWOc’ etymon arose when the Western Oceanic dialect network was still close-knit, and spread from dialect to dialect before the network broke into the two networks ancestral to its present-day first-order subgroups.

3.2.3. Further notes on subgroups⇫

This section brings together brief notes on the subgroups in Figure 1.2 beyond those mentioned in the discussion in §3.2.2.

Admiralty is a proper subgroup (Ross 1988: ch.9).

Western Oceanic consists of the North New Guinea (NNG), Papuan Tip (PT), MesoMelanesian (MM) linkages and the Sarmi/Jayapura (SJ) group (see Map 4). The last-named may belong to the NNG linkage, but this is uncertain (Ross 19967?). It is not shown in Figure 1.2 and its languages do not play a crucial role in reconstruction. It is likely that the NNG and PT groups form a super-group, the New Guinea Oceanic linkage, and so etyma reflected only in NNG and PT languages are attributed to a putative Proto New Guinea Oceanic (Milke 1958, Pawley 1978), and etyma reflected in either NNG or PT (or both) and in MM are labelled PWOc.

Introduction

Map 3: Groups of Oceanic languages used in cognate sets — **Map 3:** Groups of Oceanic languages used in cognate sets

SE Solomonic was established as a proper subgroup by Pawley (1972: 98-110). Further support was provided by Levy (1979, 1980, n. d,), Tryon & Hackman (1983) and Lichtenberk (1988).

Temotu comprises the languages of the Reef Islands, Santa Cruz, Utupua and Vanikoro, located 400 km east of the main Solomons archipelago and to the north of Vanuatu (Map 3). It was established as a proper subgroup by (Ross & Nᴂss 2007).

The Southern Oceanic linkage as proposed by Lynch (1999, 2000, 2001, 2004) is characterised by complex overlapping innovations, but by none that are reflected in all its member languages and would qualify it as a proper subgroup (see discussion in Lynch et al. 2002: 112-114).¹⁴

Micronesian is a proper subgroup (Jackson 1983, 1986, Bender et al. 2003).

Central Pacific is a proper subgroup, but one defined by only a handful of shared innovations, indicating that the period of unity was short (Geraghty 1996). The high-order subgrouping of Central Pacific is due to Geraghty (1983), except for the position of Rotuman, due to Pawley (1996c). Within Central Pacific is another long recognised proper subgroup, Polynesian, for which Pawley (1996a) lists diagnostic innovations.

3.2.4. Criteria for reconstruction⇫

The strength of a lexical reconstruction rests crucially on the distribution of the supporting cognate set across subgroups. The distribution of cognate forms and agreements in their meanings is much more important than the number of cognates. It is enough to make a secure reconstruction if a cognate set occurs in just two languages in a family, with agreement in meaning, provided that the two languages belong to different primary subgroups and provided that there is no reason to suspect that the resemblances are due to borrowing or chance. The PMP term *apij ‘twins’ is reflected in several western Malayo-Polynesian languages (e.g. Batak apid ‘twins, double (fused) banana’) but only a single Oceanic reflex is known, namely Roviana avisi ‘twins of the same sex’. Because Roviana belongs to a different first- order branch of Malayo-Polynesian from the western Malayo-Polynesian witnesses and because there is virtually no chance that the agreement is due to borrowing or chance similarity, this distribution is enough to justify the reconstruction of PMP *apij, POc *apic ‘twins’.

The rake-like form of Figure 1.2 almost certainly reflects the very rapid settlement of Oceania out of the Bismarcks,¹⁵ but it confronts us with a methodological question. If we follow the rubric that we make a reconstruction if a cognate set occurs in languages of just two primary subgroups, then reflexes of an etymon in, say, a SE Solomonic language and a Micronesian language would be sufficient evidence for a POc reconstruction and the absence of reflexes in Admiralty and Western Oceanic would be irrelevant. Given what we know about the location of the POc homeland (in the Bismarcks; vol.2, ch.2) and the early eastward spread of Oceanic speakers, this is too loose a criterion. Instead, we assume two hypothetical nodes not shown in the tree in Figure 1.2.¹⁶ These are

Remote Oceanic, comprising Southern Oceanic, Micronesian and Central Pacific;
Eastern Oceanic, comprising SE Solomonic and Remote Oceanic.¹⁷

If a cognate set occurs in two or all three of the groups in Remote Oceanic, the reconstruction is attributed to PROc (PROc). If a cognate set occurs in one or more of the groups in Remote Oceanic and in SE Solomonic, it is attributed to Proto Eastern Oceanic (PEOc). In this way we acknowledge that such reconstructions may represent an innovation that postdates the spread of the early Oceanic speech community. There are enough PROc and PEOc reconstructions to suggest that such lexical innovations indeed occurred. This in turn provides weak evidence for Remote Oceanic and Eastern Oceanic subgroups, but evidence that is too weak to be relied on, for at least two reasons. First, it is quite possible that some of our PROc and PEOc reconstructions will be promoted to POc as more Admiralty and Western Oceanic data become available. Second, it is reasonable to assume that some of our PROc and PEOc etyma are of POc antiquity but happen to have been lost in Proto Admiralty and Proto Western Oceanic. Without supporting phonological or morphological evidence we are unwilling to treat PROc or PEOc as anything other than convenient hypotheses which allow us to retain rigorous criteria for a POc reconstruction.

In volumes 1 and 2 a reconstruction here labelled ‘PROc’ would have been labelled ‘PEOc’, but the absence of SE Solomonic reflexes from among its reflexes indicates that it has the same status as a PROc reconstruction in volume 3 and the present volume. Two factors have led to the distinction between PEOc and PROc here. One was particularly relevant to volume 3: because the primary biogeographic divide in Oceania is between Near and Remote Oceania (see vol. 2, Map 5), i.e. between the Solomon Islands and Vanuatu, the question of whether or not a plant name includes a SE Solomonic reflex is significant, and there are many plant names that do not (and are thus attributed to PROc). The other is that the historical separateness of SE Solomonic from both Western Oceanic and the groups treated as Remote Oceanic has become increasingly clear through recent research (Pawley 2009).

Our criterion for recognising a reconstruction as POc is that the cognate set must occur in at least two out of four criterial groupings: Admiralties (or Yapese or Mussau), Western Oceanic, Temotu and our hypothetical Eastern Oceanic. Both here and at the hypothetical interstages defined above, no reconstruction is made if there are grounds to infer borrowing from one of these groupings to another.¹⁸ We also reconstruct an etymon to POc if it is reflected in just one of the four criterial groupings and in a non-Oceanic Austronesian language (a member of one of the subgroups on the left branches in Figure 1.1), as illustrated above by the reconstruction of POc *apic ‘twins’.

These criteria are identical to those applied in volumes 1 and 2 except for the addition of Temotu (which figures in very few cognate sets). The establishment of Temotu as a primary subgroup (Ross & Nasss 2007) postdates the publication of volumes 1 and 2.

There are indications that Yapese (a single-language subgroup) and Mussau and Tench (a subgroup with two closely related languages) may be more closely related to Admiralty than to any other Oceanic subgroup,¹⁹ and for this reason they are treated as Admiralty languages for the purposes of reconstruction. That is, the presence of a reflex in one or more of these languages and in Admiralty does not support a POc reconstruction, but the presence of of a reflex in one or more of these languages and one of Western Oceanic, Temotu and Eastern Oceanic does support one.

In chapter 2 (§4) of volume 2 Pawley discusses Blust’s (1998a) proposal that the primary split in Oceanic divides Admiralty from a subgroup embracing all other Oceanic languages. Pawley dubs the latter ‘Nuclear Oceanic’. If Blust’s subgrouping were accepted, then an etymon which lacked cognates outside Oceanic would need to be reflected both in an Admiralties language and in a non-Admiralties language for a POc reconstruction to be made. Etyma with reflexes in both Western and Eastern Oceanic, but not in the Admiralties, would be reconstructed as Proto Nuclear Oceanic. Under the criteria outlined above, however, we attribute these reconstructions to POc. These criteria were used in volumes 1 and 2, and we have thought it wise to maintain them throughout the volumes of this work. The reader who wishes to single out reconstructions attributable to a putative Proto Nuclear Oceanic (rather than to POc) can easily recognise them, however. They are those POc reconstructions for which (i) there are no Admiralties reflexes, and (ii) there is no higher-order reconstruction (i.e. PEMP, PCEMP, PMP or PAn), since the latter would be based on cognates outside Oceanic.

3.3. Sound correspondences⇫

As we noted above, reconstruction depends on working out the systematic sound correspondences among cognate vocabulary in contemporary languages and on having a working hypothesis about how the sounds of POc have changed and are reflected in modern Oceanic languages. Working out sound correspondences even for twenty languages is a large task, and so we have relied heavily on our own previous work and the work of others. The sound correspondences we have used are those given by Ross (1988) for Western Oceanic and Admiralties; by Levy (1979, 1980) and Lichtenberk (1988) for Cristobal-Malaitan, by Pawley (1972) and Tryon & Hackman (1983) for SE Solomonic; by Ross & Nasss (2007) for Temotu; by Tryon (1976) and Clark (2009) for North and Central Vanuatu; by Lynch (1978, 2001) for Southern Vanuatu; by Geraghty (1989), Haudricourt & Ozanne-Rivierre (1982), Ozanne-Rivierre (1992) and Ozanne-Rivierre (1995) for New Caledonia; by Jackson (1986) and Ross (1996a) for Nuclear Micronesian; by Geraghty (1986) for Central Pacific; by Biggs (1978) for Polynesian; by Ross (1996a) for Yapese; and by Ross (1996b) for Oceanic languages of Irian Jaya.

Map 4: Oceanic language groups in northwest Melanesia: the Admiralties and St Matthias groups and the subgroups of Western Oceanic — **Map 4:** Oceanic language groups in northwest Melanesia: the Admiralties and St Matthias groups and the subgroups of Western Oceanic

**Table 1.1** Reconstructed paradigm of POc phonemes
*pʷ	*p	*t	*c	*k	*q
*bʷ	*b	*d	*j	*g
		*s
*mʷ	*m	*n	*ñ	*ŋ
		*r			*R
		*dr
		*l
*w			*y

*i		*u
*e		*o
	*a

For non-Oceanic languages we have referred to sound correspondences given by Tsuchida (1976) for Formosan languages; by Zorc (1977, 1986) and Reid (1982) for the Philippines; by Adelaar (1992b) and Nothofer (1975) for Malay and Javanese; by Sneddon (1984) for Sulawesi; by Collins (1983) for Central Maluku; and by Blust (1978a) for South Halmahera and Irian Jaya.

We are well aware that regular sound correspondences can be interfered with in various ways: by phonetic conditioning that the analyst has not identified (see, e.g., Blust 1996a), by borrowing (for an extreme Oceanic case, see Grace 1996), or, as recent research suggests, by the frequency of an item’s use (Bybee 1994). We have tried at least to note, and sometimes to account for, irregularities in cognate sets.

3.4. POc phonology and orthography⇫

Work based on the sound correspondences of both Oceanic and non-Oceanic languages has resulted in the reconstructed paradigm of POc phonemes shown in Table 1.1. The orthography used here and in the POc reconstructions in this work is from Ross (1988), with the addition of *pʷ and *kʷ. POc phonology and its relationship to PMP are discussed in greater detail in Chapter 2 (§2) of volume 1. Since the publication of volume 1, articles by John Lynch have appeared on POc stress (2000) and POc labiovelar phonemes (2002). The putative protophoneme *kʷ is added in this volume for the first time as a result of taking stock of reconstructions in volumes 1-3 and the current volume. It must have had a very low functional load, as it occurs only word-initially before *a and then only rarely (Ross 2010).

Table 1.2 shows two POc orthographies. The first was established by Biggs (1965), for PEOc, and Grace (1969), who applied it to POc. It has been used with a number of variants, separated by a slash in Table 1.2. The second, introduced by Ross (1988), is the one generally used in this work. The terms ‘oral grade’ and ‘nasal grade’ were used by Grace (1969) and have become conventional among Oceanic linguists to refer to the outcomes of certain sound changes that occurred between PMP and POc (vol.1, ch.2, §2.4).

**Table 1.2** POc orthographies after Grace (1969) and Ross (1988)
Grace	oral grade	*p	-	*t	d/r	*s	*j	*k	-
Ross		*p	*pʷ	*t	*r	*s	*c	*k	*kʷ

Grace	nasal grade	*mp	ŋp/mpw	*nt	nd/nr	*nj	*ŋk
Ross		*b	*bʷ	*d	*dr	*j	*g

Grace	*m	ŋm/mw	*n	*ñ	*ŋ	*w	*y	*l	*q_	*R
Ross	*m	*mʷ	*n	*ñ	*ŋ	*w	*y	*l	*q_	*R

Grace	*i	*o	*e	*a	*u
Ross	*i	*o	*e	*a	*u

4. Conventions common to the series⇫

4.1. Presentation of reconstructions⇫

Each of the contributions to these volumes concerns a particular POc ‘terminology’. Generally, each contribution begins with an introduction to the issues raised by the reconstruction of its particular terminology, and the bulk of each contribution consists of reconstructed etyma with supporting data and a commentary on matters of meaning and form.

The reconstruction of POc *taRaqan ‘Holocentrus spp., squirrelfish’ below, adapted from Chapter 2, §14, shows how reconstructions and supporting cognate sets are presented. Above it is a superordinate (PMP) reconstruction drawn from Blust’s Austronesian Comparative Dictionary (ACD; see §2). Below it are supporting reflexes. Contributors vary in the degree to which they insert lower-order reconstructions like PMic *tard and PPn *taʔa below. Lower-order reconstructions are sometimes given to clarify the relationship of reflexes to the higher-order reconstruction: Southern Vanuatu languages, for example, have undergone so much phonological change that a Proto Southern Vanuatu reconstruction helps explicate the relationship between Southern Vanuatu reflexes and the POc reconstruction. Sometimes a lower-order reconstruction displays an extension of meaning or some other semantic change.

PMP	*taRaqan	‘Holocentrus spp., squirrelfish’ (ACD)
POc	*taRaqan	‘Scirgocentron spp., squirrelfish, including A spiniferum’ (Geraghty 1990: PEOc *taRciʔa)
Adm: Loniu	taʔay	‘squirrelfish or silver biddy’ (metathesis: expected form †taya(ʔ))
NNG: Yabem	talaŋ	‘a red sea fish’
PT: Gumasi	tayawana	‘squirrel fish’
PT: Misima	talayan	‘Sargocentron spp., spiny and blue-striped squirrelfish’
PT: Motu	tara	‘Sargocentron violaceum, violet squirrelfish’
MM: Nakanai	talaha	‘k.o. fish’
MM: Lihir	taran	‘Sargocentron spiniferum, spinecheek squirrelfish’
MM: Roviana	ta-tara	‘small reddish fish, easily hooked’
SES: Gela	talā	‘Sargocentron spiniferum, spinecheek squirrelfish’
NCV: S Efate	tra(kap)	‘squirrelfish’
NCal: Pije	jalā	‘Holocentrus spinifer’ [Sargocentron spiniferum]
NCal: Jawe	jarak	‘Holocentrus spinifer’ [Sargocentron spiniferum]
PMic	*tarā	‘squirrelfish’ (Bender et al. 2003)
Mic: Kiribati	tā	‘Sargocentron spp.’
Mic: Marshallese	cera	‘Sargocentron, Myripristis spp., squirrelfish’
Mic: Ponapean	sara	‘Sargocentron spiniferum, spinecheek squirrelfish’
Mic: Chuukese	sarā	‘Sargocentron sp., yellow-lined squirrel fish’
Mic: Woleaian	sezā	‘Adioryx spinifer, spiny squirrel fish’ [Sargocentron spiniferum]
Mic: Puluwatese	hara	‘k.o. red fish’
PPn	*taʔa	‘Sargocentron spiniferum, armoured soldierfish’ (Hooper 1994)
Pn: Tongan	taʔa	‘Sargocentron spiniferum, red, edible’
Pn: Niuean	tā	‘Myripristis violacea, lattice soldierfish’
Pn: Pukapukan	tā	‘Sargocentron spiniferum’
Pn: Samoan	tā (malau)	‘name given to certain fishes of genus Sargocentron when about 30 cm in length’. (See also under *malau below.)
Pn: K’marangi	tā	‘Sargocentron ruber, red squirrelfish’ [Sargocentron rubrum]
Pn: Tokelauan	tā	‘Sargocentron spiniferum’
Pn: Tuvaluan	tā (malau)	‘Adioryx spp.’ [Sargocentron]
Pn: Tikopia	ta	‘sea fish, app. related to large squirrelfish’

Because our supporting data are drawn from such a wide range of languages, the convention is adopted of prefixing each language name with the abbreviation for the genealogical or geographic group to which the language belongs, so that the distribution of a cognate set is more immediately obvious. Table 1.3 is a key to the labels. Figure 1.2 shows the positions of these groups in the Oceanic tree. We have sought to be consistent in always listing these groups in the same order, but contributors vary in the ordering of languages within groups.

Lynch’s recent research on Southern Oceanic (§3.2.3) renders the NCV group mildly anomalous, although there is no doubt that it reflects an integrated dialect network. There are a number of etyma whose reflexes are confined to North and Central Vanuatu, and so we continue to make ‘Proto North/Central Vanuatu’ reconstructions, even though these perhaps represent a Southern Oceanic term that has been lost in southern Vanuatu and New Caledonia. Where the distribution of reflexes requires it, the chapters in this volume include reconstructions for PROc and for PSOc. Etyma with these distributions were attributed to PEOc in volumes 1 and 2, but the distributions are transparent, thanks to the presence of the group labels in cognate sets.

In the interests of space we have not given the history of the reconstructions themselves, as this would often require commentary on the modifications made by others and by us, and on why we have made them. Where a reconstruction is not new, we have tried to give its earliest source, e.g. ‘ACD’ above, but this is difficult when earlier reconstructions differ in form and meaning.

In general, the contributions to these volumes are concerned with items reconstructable in POc, PWOc, PEOc, PROc and occasionally Proto New Guinea Oceanic (PNGOc). Etyma for PWOc, PNGOc and PEOc are reconstructed because these may well also be POc etyma for which known reflexes are not well distributed (see discussion in §3.2.4). The contributors vary in the degree to which they reconstruct etyma for interstages further down the tree. Reconstructions for lower-order interstages are decreasingly likely to reflect POc etyma and may be the results of cultural change as Oceanic speakers moved further out into the Pacific.

Contributors have usually not sought to make fresh reconstructions at interstages superordinate to POc. What they have done, however, is to cite other scholars’ reconstructions for higher-order interstages, as these represent a summary of the non-Oceanic evidence in support of a given POc reconstruction. These interstages are shown in Figure 1.1, together with their abbreviations.

Sometimes non-Oceanic evidence has been found to support a POc reconstruction where no reconstruction at a higher-level interstage has previously been made. In this case a new higher-order reconstruction is made, and the non-Oceanic evidence is given in a footnote.

Whilst we have tried to use the internal organisation of the lexicons of Oceanic languages themselves as a guide in setting the boundaries of each terminology, we have inevitably taken decisions which differ from those that others might have made. There are, obviously, overlaps and connections between various semantic domains and therefore between the contributions here. We have done our best to provide cross-references, but we have sometimes duplicated information rather than ask the reader repeatedly to look elsewhere in the book. Indexes at the end of each volume and in the final volume are intended to make it easier to use the volumes collectively as a work of reference.

**Table 1.3** Abbreviations for the genealogical or geographic groups
Yap:	Yapese (one language)
Adm:	Admiralty and Mussau/Tench
SJ:	Sarmi/Jayapura
NNG:	North New Guinea
PT:	Papuan Tip
MM:	Meso-Melanesian
SES:	Southeast Solomonic
TM:	Temotu
NCV:	North/Central Vanuatu, i.e. the reintegrated network formed by the North and Central Vanuatu linkages
SV:	Southern Vanuatu
NCal:	Loyalty and New Caledonia
Mic:	Micronesian
Fij:	Fijian, i.e. the reintegrated network formed by Western and Eastern Fijian dialects
Pn:	Polynesian

4.2. Data⇫

Data sources are listed in Appendix 1.

For some reconstructed etyma only a representative sample of reflexes is given. We have endeavoured to ensure, however, that in each case this sample not only is geographically and genetically representative, but also provides evidence to justify the shape of the reconstruction. Where only a few reflexes are known to us, this is usually noted.

Although there are accepted or standard orthographies for a number of the languages from which data are cited here, all data are transcribed as far as possible into a standard phonemic orthography based on that used by Ross (1988: 3–4) in order to facilitate comparison.²⁰ This means, for example, that the j of the German-based orthographies of Yabem and Gedaged becomes ʃ, Yabem c becomes ʔ, Gedaged z becomes ɬ and so on; the ng of English-based orthographies becomes ŋ, and Fijian g, q and c become ŋ, g and ð respectively.

The following symbols have more or less their usual IPA values: ð, g, ɢ, ɣ, h, k, l, ɬ, ʎ, m, n, ŋ, ñ, p, q, χ, ɾ, r, s, t, w, x, z, ʔ, a, ᴂ, e, ɛ, ə, i, ɨ, o, œ, ɔ, ʌ, u, ɯ. As far as possible, however, our orthography is phonemic and does not show allophonic variation, so that there are instances where a symbol does not have its usual phonetic value. For example, Wayan Fijian k is a voiceless stop word-initially but [k] is in free or stylistic variation with [y] word-medially. The voiced stops b, d, g and the voiced bilabial trill ʙ are prenasalised in some languages, but prenasalisation is not written unless it is phonemically distinctive. Where a language has just one rhotic, we usually write r, despite the fact that that rhotic is sometimes a flap. Other orthographic symbols (with values in IPA) are:

f	[ɸ, f]	voiceless bilabial or (less often) labio-dental fricative
v	[β, v]	voiced bilabial or (less often) labio-dental fricative
c	[ts], [tʃ]	voiceless alveolar or palatal affricate
j	[dz], [dʒ]	voiced alveolar or palatal affricate
y	[j]	palatal glide
dr	[ⁿr]	prenasalised voiced alveolar trill (as in Fijian)
ö	[ø]	rounded mid front vowel
ü	[y]	rounded high front vowel

Other superscripts and diacritics are as follows:

contrastive long vowels are represented by a macron, e.g. ā;
contrastive vowel nasalisation in New Caledonian languages is represented by a circumflex, e.g. â;
labialisation is marked by a superscript w, e.g. pʷ;
velarisation is marked by a superscript ɯ, e.g. pᵚ;
contrastive aspiration is marked by a superscript h, e.g. pʰ;
apicolabials are represented by the corresponding apical symbol and the linguolabial diacritic (the ‘seagull’ ), e.g. tᫀ;
retroflexes are represented by the corresponding apical symbol with a dot beneath, e.g. ṛ.

Except for inflexional morphemes, non-cognate portions of reflexes, i.e. derivational morphemes and non-cognate parts of compounds, are shown in parentheses ( … ). Where an inflexional morpheme is an affix or clitic and can readily be omitted, its omission is indicated by a hyphen at the beginning or end of the base. This applies particularly to possessor suffixes on directly possessed nouns (vol. 1, ch.2, §3.2). Where an inflexional morpheme cannot readily be omitted, then it is separated from its base by a hyphen. This may happen because of complicated morphophonemics or because the morpheme is always present, like the adjectival -n in some NNG and Admiralties languages and prefixed reflexes of the POc article *na in scattered languages. When a reflex is itself polymorphemic (i.e. the morphemes reflect morphemes present in the reconstructed etymon) or contains a reduplication, the morphemes or reduplicates are also separated by a hyphen.

Languages from which data are cited in this volume are listed in Appendix B in their subgroups, together with an index allowing the reader to find the subgroup to which a given language belongs. Appendix B also includes alternative language names. The difficulty of deciding where the borderline between dialect and language lies, combined with the fact that these volumes contain work by a number of contributors, has resulted in some inconsistency in the naming of dialects in the cognate sets. Some occur in the form ‘Kara (E)’, i.e. the East dialect of the Kara language, or ‘Lukep (Pono)’, i.e. the Pono dialect of the Lukep language, whilst others are represented simply by the dialect name, e.g. Iduna, noted in Appendix B as ‘Iduna (= dialect of Bwaidoga)’.

4.3. Conventions used in representing reconstructions⇫

Reconstructions are marked with an asterisk, e.g. *taRaqan ‘squirrelfish’, in keeping with the standard convention in historical linguistics. POc reconstructions, and also PWOc and PNGOc reconstructions, are given in the orthography of §3.4. For reconstructions at higher-order interstages the orthographies are those used by Blust in his various publications and the ACD. Reconstructions at lower-order interstages are given in the standard orthography adopted for data (§4.2). Geraghty’s (1986) PCP orthography, for example, is based on Standard Fijian spelling, and is converted into our standard orthography in the same way as Fijian spelling is. In practice, this means that the orthographies for PEOc, PROc and PCP are the same as for POc, except that a distinction between *p and *v is recognised and *R is absent from PCP. Biggs and Clark’s PPn reconstructions are in any case written in an orthography identical to our standard. Bracketing and segmentation conventions in protoforms are shown in Table 1.4.

PMP final consonants are usually retained in POc in absolute word-final position. In many cases decisive evidence for retention or loss can be found in those Oceanic languages that usually retain final consonants. However, there are some cases where it is uncertain whether POc kept the PMP finals. This is so when a PMP etymon is not attested in an Oceanic language that consistently retains POc final consonants. An example is *-R in PMP *kamaliR ‘men’s house’, a term reflected in Oceanic only in certain languages that regularly lose POc final consonants. In such cases the consonant is reconstructed in parentheses (e.g. POc *kamali(R)).

In presenting words that display anomalies of form, it is often necessary to posit an expected form. For example, in the cognate set supporting POc *taRaqan ‘squirrelfish’ in §4.1, the Loniu form taʔay is presented in support of PEOc *taRaqan. Given the reconstruction, however, we would expect the Loniu form to be taya(ʔ). In this volume we use a less widely employed convention and mark expected forms with a dagger, e.g. †taya(ʔ), to distinguish them both from reconstructions and real data.²¹

**Table 1.4** Bracketing and segmentation conventions in protoforms
(x)	it cannot be determined whether x was present
(x,y)	either x or y was present
[x]	the item is reconstructable in two forms, one with and one without x
[x,y]	the item is reconstructable in two forms, one with x and one with y
x-y	x and y are separate morphemes
x-	x takes an enclitic or a suffix
⟨x⟩	x is an infix

Sometimes we need to refer to a reconstructed form that one would expect as the regular reflex of an established POc etymon, but which does not occur because an irregular sound change has occurred. In such cases the dagger and asterisk conventions are used together. For example, in ch.7, §4, we reconstruct POc *simuk ‘mosquito, small biting fly’. Its Proto SE Solomonic reflex is reconstructed as *simi (this is the form reflected by the SE Solomonic data), but the expected (regular) PSES form, referred to in our discussion, is †*simu, the regular reflex of POc *simuk. The dagger marks it as expected but not attested to.

Sometimes a hypothetical POc form is marked in this way. In ch.7, §5, we cite PMP *nik-nik, *ñik-ñik ‘tiny biting insect: gnat, sandfly, fruitfly’. The expected POc form is †*ninik, marked with a dagger because it does not actually occur. Instead, the Oceanic data require us to reconstruct the irregular form POc *niku-niku ‘small biting fly’.

When historical linguists compile cognate sets they commonly retain word for word the glosses given in the sources from which the items are taken. However, again in the interests of standardisation, we have often reworded (and sometimes abbreviated) the glosses of our sources, while preserving the meaning. Where glosses were in a language other than English we have translated them. In the interests of space and legibility, and because data often have multiple sources, we have given the source of a reflex only when it is not included in the listings in Appendix A.

Sometimes we use the convention of providing no gloss beside the items in a cognate set whose gloss is identical to that of the POc (or other lower-order) reconstruction at the head of the set, i.e. the reconstruction which they reflect.

Where necessary, we use ‘(v)’, ‘(vi)’, or ‘(vt)’ to indicate that a gloss is a verb, intransitive verb or transitive verb, ‘(n)’ to indicate that it is a noun.

Within glosses we use the conventional abbreviations ‘k.o.’ (as in ‘k.o. yam’) for ‘kind of’, ‘s.o.’ for ‘someone’, ‘s.t.’ for ‘something’, ‘sp.’ for a species and ‘spp.’ for more than one species.

In putting together cognate sets we have quite often found possible cognates which do not quite ‘fit’ the set: they display unexplained phonological irregularities or their meaning is just a little too different from the rest of the set for us to assume cognacy. Rather than eliminate them we often include them below the cognate set under the rubric ‘cf. also’.

5. Indexes⇫

This volume has three indexes. The first, as in volumes 1 and 2, is an index of reconstructions arranged by their protolanguages. The second is an alphabetical list of reconstructions. The third is an index of animals by their English and Linnaean glosses.

Notes⇫

The project has been jointly directed by Andrew Pawley and Malcolm Ross, with research assistance from Meredith Osmond, in the Department of Linguistics, Research School of Pacific and Asian Studies at the Australian National University. Originally, five volumes were planned, but the large amount of material has required this to be increased to seven (see p2).↩︎
The listing in Tryon (1995) contains 466 Oceanic languages, many of which are subdivisible into dialects.↩︎
This Introduction incorporates much of the material in the Introductions to Volume 1, 2 and 3. We replicate this material here in order that each volume can be used independently. The introduction to volume 3, however, introduced a fresh presentation of the subgrouping of Oceanic languages, and this is retained here.↩︎
For critical overviews of the literature on Austronesian subgrouping, see Ross (1995b), Adelaar (2005) and Blust (2009a). The CEMP and Central Malayo-Polynesian linkages, and also PEMP, have been called into question recently, but these issues lie beyond our present scope (Blust 2009b, Donohue & Grimes 2008, Ross 20086).↩︎
The two very closely related languages Mussau and Tench form a minor exception.↩︎
The two very closely related languages Mussau and Tench form a minor exception.↩︎
Chapter 4 of Lynch et al. (2002) gives a recent account of these innovations.↩︎
In what follows, ‘language’ is used to mean ‘language or dialect’.↩︎
One or more innovations may spread right across the languages of the linkage. In this case it becomes virtually impossible to distinguish it from a proper subgroup.↩︎
Recent work in Indo-European appeals to the concept of linkage: Garrett (2006) suggests that the dialects ancestral to Greek were not dialects of ‘Proto Greek’ but a collection of Nuclear Indo-European dialects drawn together by relations between the communities ancestral to the Greek city states, across which spread the innovations which characterise Ancient Greek.↩︎
A situation in which a subgroup is both proper (i.e. defined by exclusive innovations) and a linkage (displaying overlapping patterns of innovations) is of course possible, the exclusively shared innovations having occurred in the parent, the others after the break-up of the parent. It so happens that we have no need of this construct here.↩︎
‘Eastern Fijian languages’ in Figure 1.2 is our label for Geraghty’s (1983) ‘Tokalau Fijian’.↩︎
For a history of scholarly views of the subgrouping of North and Central Vanuatu languages see Clark (2009: §1.3). For arguments supporting a NCV grouping, see Clark (2009: ch.4).↩︎
Because they have only been recently proposed, Temotu and Southern Oceanic do not appear in Figure 1 of volumes 1 and 2.↩︎
Bearers of the Lapita culture had settled various parts of the Bismarck Archipelago by around 1400 BC (Specht 2007) and colonised the Reefs and Santa Cruz Is. in the Temotu Archipelago by 1200-1100 BC (Green 2003, Green et al. 2008). There were Lapita settlements in New Caledonia by 1100-1000 BC (Sand 2001), in Fiji by about 1000 BC (Clark & Anderson 2009, Nunn et al. 2004), in Tonga by 900 BC (Burley & Connaughton 2007), and in Samoa and the smaller islands between Tonga and Samoa by 800-700 BC (Clark & Anderson 2009).↩︎
We included these nodes in the corresponding tree in Figure 1 of volumes 1 and 2, but this was too easily interpreted as a statement of our views on subgrouping, so we abandon it here and in Appendix 2.↩︎
The term ‘Eastern Oceanic’ and the search for evidence of an Eastern Oceanic subgroup has a relatively long pedigree in Oceanic linguistics (Biggs 1965, Pawley 1972, Pawley 1977, Lynch & Tryon 1985, Geraghty 1990). However, by the time volume 1 of the present work was published in 1998 it was already evident that innovations supporting the existence of an Eastern Oceanic subgroup were not forthcoming, as Pawley & Ross (1995:79) had mentioned in a footnote. Our use of the term here is more inclusive than most, resembling more closely the ‘Central/Eastern Oceanic’ of Lynch & Tryon (1983) (the published version of which, Lynch & Tryon 1985, presents a less inclusive version of Central/Eastern Oceanic) and of Lynch et al. (2002:94-96). who also express reservations about its status.↩︎
Cases where such an inference can be made occur mostly at the boundary (in the Solomon Islands) between Western and Eastern Oceanic. Borrowing is likely (and is often reflected in unexpected sound correspondences) where an etymon occurs (i) in Western Oceanic and only in SE Solomonic languages or (ii) in SE Solomonic languages and only in the NW Solomonic languages (a subgroup within the Meso-Melanesian linkage of Western Oceanic).↩︎
On the positions of Yapese and Mussau. see respectively Ross (1996a) and Ross (1988:315-316, 331).↩︎
The main reason for retaining Ross’ orthography was that the electronic files initially used in this project were drawn in large part from those used in the research reported in Ross (1988).↩︎
Another convention sometimes used for this purpose is a double asterisk, e.g. **taya(ʔ). we prefer the dagger on aesthetic grounds.↩︎

Contents

The lexicon of Proto Oceanic: 4 Animals

Chapter 4.1 Introduction

Aims
The relation of the current project to previous work
Reconstructing the lexicon
Conventions common to the series
Indexes

Cognatesets

References

ACD “Austronesian comparative dictionary (ACD)”
Adelaar 1992b “Proto-Malayic”
Bender et al. 1983 “Micronesian cognate sets”
Bender et al. 2003 “Proto-Micronesian reconstructions”
Biggs 1965 “Direct and indirect inheritance in Rotuman”
Biggs 1978 “The history of Polynesian phonology”
Biggs and Clark 1993 “POLLEX (Polynesian lexicon)”
Blust 1977b “The Proto-Austronesian pronouns and Austronesian subgrouping: a preliminary report”
Blust 1978a “Eastern Malayo-Polynesian: a subgrouping argument”
Blust 1980b “Austronesian etymologies”
Blust 1982 “The linguistic value of the Wallace Line”
Blust 1987 “Lexical reconstruction and semantic reconstruction: the case of Austronesian ‘house’ words”
Blust 1993 “Central and Central-Eastern Malayo-Polynesian”
Blust 1994 “Proto Malayo-Polynesian sibling terms”
Blust 1996a “The Neogrammarian Hypothesis and pandemic irregularity”
Blust 1998a “A note on higher-order subgroups in Oceanic”
Blust 2009a “The Austronesian languages”
Blust 2009b “The position of the languages of Eastern Indonesia: A reply to Donohue and Grimes”
Burley and Connaughton 2007 “First Lapita settlement and its chronology in Vava’u, Kingdom of Tonga”
Bybee 1994 “A view of phonology from a cognitive and functional perspective”
Cashmore 1969 “Some Proto-Eastern Oceanic reconstructions with reflexes in Southeast Solomon Islands languages”
Clark 1996 “North and Central Vanuatu languages: a comparative study”
Clark 2009 “Leo tuai: A comparative lexical study of North and Central Vanuatu languages”
Clark and Anderson 2009 “Colonisation and culture change in Fiji”
Clark and Biggs 2006 “POLLEX: Polynesian lexicon”
Collins 1983 “The historical relationships of the languages of Central Maluku”
Dempwolff 1934 “Vergleichende Lautlehre des Austronesischen Wortschatzes. Vol. 1”
Donohue and Grimes 2008 “Yet more on the position of the languages of eastern Indonesia”
Dyen and Aberle 1974 “Lexical reconstruction: the case of the Proto-Athapaskan kinship system”
French-Wright 1983 “Proto-Oceanic horticultural practices”
Garrett 2006 “Convergence in the formation of Indo-European subgroups: Phylogeny and chronology”
Geraghty 1983 “The history of the Fijian languages”
Geraghty 1986 “The sound system of Proto Central Pacific”
Geraghty 1989 “The reconstruction of Proto-Southern Oceanic”
Geraghty 1990 “Proto-Eastern Oceanic *R and its reflexes”
Geraghty 1996 “Problems with Proto Central Pacific”
Geraghty and Pawley 1981 “The relative chronology of some innovations in the Fijian languages”
Grace 1969 “A Proto-Oceanic finder list”
Grace 1996 “Regularity of change in what?”
Green 2003 “The Lapita horizon and traditions: signature for one set of oceanic migrations”
Green et al. 2008 “The reconstructed environment and absolute dating of SE-SZ-8 Lapita site on Nendo, Santa Cruz, Solomon Islands”
Haudricourt and Ozanne-Rivierre 1982 “Dictionnaire thématique des langues de la région de Hienghène (Nouvelle Caledonie), Pije, Fwâi, Nemi, Jawe (Thematic dictionary of the Hienghène languages, New Caledonia)”
Hockett 1976 “The reconstruction of Proto Central Pacific”
Hooper 1994 “Reconstructing Proto Polynesian fish names”
Jackson 1983 “The internal and external relationships in the Trukic languages of Micronesia”
Jackson 1986 “On determining the external relationships of the Micronesian languages”
Levy 1979 “The phonological history of the Bugotu-Nggelic languages and its implications for Eastern Oceanic”
Levy 1980 “Languages of the southeast Solomon Islands and the reconstruction of Proto-Eastern-Oceanic”
Levy and Smith 1970 “Proto-Eastern Oceanic reconstructions”
Lichtenberk 1986 “Leadership in Proto Oceanic society: linguistic evidence”
Lichtenberk 1988 “The Cristobal–Malaitan subgroup of Southeast Solomonic”
Lynch 1999 “Southern Oceanic linguistic history”
Lynch 2000c “Linguistic subgrouping in Vanuatu and New Caledonia”
Lynch 2001c “The linguistic history of southern Vanuatu”
Lynch and Tryon 1983 “Central Oceanic: a subgrouping hypothesis”
Lynch and Tryon 1985 “Central-Eastern Oceanic: a subgrouping hypothesis”
Lynch et al. 2002 “The Oceanic languages”
Milke 1958 “Zur inneren Gliederung und geschichtlichen Stellung der ozeanisch-Austronesischen Sprachen”
Nothofer 1975 “The reconstruction of Proto-Malayo-Javanic”
Nunn et al. 2004 “Early Lapita settlement at Bourewa, southwest Fiji”
Ozanne-Rivierre 1992 “The Proto-Oceanic consonantal system and the languages of New Caledonia”
Ozanne-Rivierre 1995 “Structural changes in the languages of Northern New Caledonia”
Pawley 1972 “On the internal relationships of Eastern Oceanic languages”
Pawley 1975 “The relationships of the Austronesian languages of Central Papua”
Pawley 1977 “On redefining 'Eastern Oceanic'”
Pawley 1978 “The New Guinea Oceanic hypothesis”
Pawley 1982a “Rubbishman commoner, big-man chief: linguistic evidence for hereditary chieftainship in Proto-Oceanic society”
Pawley 1985 “Proto-Oceanic terms for ‘person’: a problem in semantic reconstruction”
Pawley 1996a “On the Polynesian subgroup as a problem for Irwin's continuous settlement hypothesis”
Pawley 1996c “On the position of Rotuman”
Pawley 2009 “The role of the Solomon Islands in the first settlement of Remote Oceania: bringing linguistic evidence to an archaeological debate”
Pawley and Green 1984 “The Proto-Oceanic language community”
Pawley and Green 1985 “The Proto-Oceanic language community”
Pawley and Ross 1994 “Austronesian terminologies: continuity and change”
Pawley and Ross 1995 “The prehistory of Oceanic languages: a current view”
Reid 1982 “The demise of Proto-Philippines”
Ross 1988 “Proto Oceanic and the Austronesian languages of western Melanesia”
Ross 1994a “Central Papuan culture history: some lexical evidence”
Ross 1995b “Some current issues in Austronesian linguistics”
Ross 1996a “Is Yapese Oceanic?”
Ross 1996b “On the genetic affiliations of the Oceanic languages of Irian Jaya”
Ross 2010 “Proto Oceanic *kw”
Sand 2001 “Evolutions in the Lapita cultural complex: a view from the Southern Lapita Province”
Sneddon 1984 “Proto-Sangiric and the Sangiric languages”
Specht 2007 “Small islands in the big picture: the formative period of Lapita in the Bismarck Archipelago”
Tryon 1976 “New Hebrides languages: an internal classification”
Tryon 1995 “Comparative Austronesian dictionary”
Tryon and Hackman 1983 “Solomon Islands languages: an internal classification”
Tsuchida 1976 “Reconstruction of Proto-Tsouic phonology”
Walsh and Biggs 1966 “Proto-Polynesian word list 1”
Walter 1989 “Lapita fishing strategies: a review of the archaeological and linguistic evidence”
Zorc 1977 “The Bisayan dialects of the Philippines: subgrouping and reconstruction”
Zorc 1986 “The genetic relationships of Philippine languages”