An open-ended computational construction grammar for Spanish verb conjugation

This web demonstration accompanies the following paper:

Beuls, K. (2017). An open-ended computational construction grammar for Spanish verb conjugation. Constructions and Frames.

Abstract: The Spanish verb phrase can take on many forms, depending on the temporal, aspectual and modal interpretation that a speaker wants to convey. At least half a dozen constructions work together to build or analyze even the simplest verb form such as hablo 'I speak'. This paper documents how the complete Spanish verb conjugation system can be operationalised in a computational construction grammar formalism, namely Fluid Construction Grammar. Moreover, it shows how one can, starting from a seed grammar handling regular morphology and grammar, create a productive grammar that can capture systematicity in Spanish verb conjugation that can expand its construction inventory when new verbs are encountered

Explanations on how to interact with an FCG web demonstration can be found here.


Menu:

WD-1. Grammar design

WD-2. Verb conjugation.

WD-3. Stem changes.


WD-1. Grammar design

The grammar fragment that this demo describes targets the Spanish verb phrase, uniquely focussing on the conjugated verb form. Rather than storing all verb forms individually as holistic chunks, the FCG grammar contains productive rules in the form of constructions, working together to build or analyse a single conjugation. For a simple form such as cenabas , 'you had dinner', the grammar makes use of six constructions to map the verb form to the following meaning representation:

gstruct0(event-perspectiveunbounddinner-event-1)struct2(simultaneouspoint-1dinner-event-1)struct0:DINNERdashEVENTdash11->struct2:DINNERdashEVENTdash13struct1(time-pointrecalled-pointpoint-1)struct1:POINTdash12->struct2:POINTdash13struct4(eaterdinner-event-1you-1)struct2:DINNERdashEVENTdash13->struct4:DINNERdashEVENTdash15struct3(quantitysingletonyou-1)struct3:YOUdash14->struct4:YOUdash15struct5(personyouyou-1)struct4:YOUdash15->struct5:YOUdash16struct6(activityeatdinner-event-1)struct4:DINNERdashEVENTdash15->struct6:DINNERdashEVENTdash17

Note: Because we focus on the conjugation of individual verb forms, the grammar fragment only considers intransitive verb meanings. Therefore every verb, including transitive or ditransitive verbs, has only a single role in its meaning representation.

A constituency grammar approach is used to model conjugated verb forms in this grammar fragment. The resulting transient structure after producing the above meaning network looks as follows:

The utterance that can be extracted from this transient structure is ''cen aba s''. Yet, the transient structure contains more units than just the three form bearing morpheme units. Three additional units have been created. The first one collects the stem morpheme and the two suffix morpheme units into a verb unit, containing agreement information and the tense-aspect-mood characterization of the form, together with the meaning of the form and the order of the morphemes (in the form feature). Another unit is made for the unmarked subject, which is a second singular person in this case. Together, the subject and the verb unit are the constituents of the intransitive clause unit.

A transient structure is the result of a repeated application process of constructions. In the case of cenabas six constructions did their work and they can be split up into four different types:

WD-1-1. Lexical constructions

These constructions can be compared to a dictionary look-up for a given verb. They contain features such as lex-class, verb-class, sem-class, lemma and meaning. The lexical construction for cenar 'to dine, to have dinner' is included here below. You can now inspect the full construction by clicking on the ⊕ sign and reading the explanations below.

A lexical construction for a verb lemma consists of two units on the conditional part: the ?cenar-verb unit and the ?cenar-stem unit. The stem unit is going to be a constituent of the verb unit, together with potential suffixes (to be added later). In production, all this lexical construction matches on is the meaning feature in the root unit (accessed by the HASH operator). In comprehension, on the other hand, we expect two units to be there already, linked by a constituency relation, with the stem unit bearing the lemma ``cenar''. The contributing part will merge information into the two units, such as a lex-class, a verb-class (set to 1st verb class, since the lemma is ending on -ar ) and initialises the tense-aspect-mood features with variables. The semantic class is inherited from the meaning feature, and the agent of the activity is linked to the eater role.

WD-1-2. Stem constructions

Lexical constructions do not contain actual word forms that will be encountered in sentences. This is the task of the stem constructions. For a default stem such as cen- , the corresponding stem construction maps the lemma ''cenar'' into a string feature ''cen'':

If you inspect the above construction, you will see that there is a red feature inside the phon-cat feature. This colour highlighting indicates a negation of the stem-realized feature, meaning that the construction can only apply in production if the phon-cat of the stem unit does NOT contain a (stem-realized +) feature. Such a precondition is needed because this is the construction for the base stem of the verb, and thus constitutes a sort of default.

WD-1-3. Suffix constructions

To make the Spanish verb phrase modular, a verb form is analysed into three parts: a verb stem (see above), a tense-aspect-mood suffix and an agreement suffix (e.g. cen -aba -s ). The -abas suffix is split into two parts: a morpheme that indicates the tense/aspect/mood of the verb form and a morpheme for the person/number information. The construction here below indeed matches on a verb unit in the transient structure that has a past tense, imperfective aspect and indicative mood of the first verb class (in production); or an -aba suffix immediately adjacent to the verb stem (in comprehension). The contributing part of the construction creates a suffix unit, which is a constituent of the verb unit, and a sibling of the stem unit.

The person/number information is added by the s-2sg-morph construction included here below. Its preconditions in production are the following: a second singular agreement feature, a verb of any verb class that is not future and not perfective. It is also a requirement that the verb unit already has two constituent units. When these conditions are met, the construction makes the verb stem unit finite, as well as creating a second suffix unit that follows the verb stem unit (but does not have to be directly adjacent to it, hence the use of the precedes feature).

WD-1-4. Grammatical constructions

Two remaining constructions are involved in the processing of a form such as cenabas : the intransitive second singular construction and the past-imperfective construction. The first one matches on a verb unit with a second singular agreement feature in comprehension and merges a corresponding meaning representation into its structure. Also, a new unit, the VP unit is created that has two dependent units: the verb stem unit itself, as well as the unmarked subject unit.

Finally, the past imperfective constructions maps a meaning representation onto a syntactic configuration of the tense, aspect and mood features. No contributing part is required.

Back to menu


WD-2. Verb conjugation.

Let us now turn to the working of these constructions in an actual production setting. First in line are the constructions that carve out a part of the meaning that needs to be expressed: the cenar-lex, the 2sg-covert-subject-cxn and the past-imperfective-indicative-cxn. Then, the morphological constructions can work on this and attach actual forms to these meanings, through the expression of syntactic features.

You can inpect each step in the construction application process by clicking twice on the green nodes. The resulting transient structure is shown at the bottom.


Formulating 

gstruct0(time-pointrecalled-pointpoint-1)struct1(simultaneouspoint-1dinner-event-1)struct0:POINTdash11->struct1:POINTdash12struct2(event-perspectiveunbounddinner-event-1)struct1:DINNERdashEVENTdash12->struct2:DINNERdashEVENTdash13struct3(activityeatdinner-event-1)struct2:DINNERdashEVENTdash13->struct3:DINNERdashEVENTdash14struct4(eaterdinner-event-1you-1)struct3:DINNERdashEVENTdash14->struct4:DINNERdashEVENTdash15struct5(personyouyou-1)struct4:YOUdash15->struct5:YOUdash16struct6(quantitysingletonyou-1)struct5:YOUdash16->struct6:YOUdash17

Applying
in formulation


initial structure
application process
applied constructions
resulting structure

Utterance: "cen aba s"


Let us inspect the comprehension process of a closely related verb form, cen-aba , which can both be used to say 'I had dinner' or 'he/she had dinner'. In comprehension, the order of construction application is different. First the morphological constructions apply, as they match on string features that are present in the root unit. Then, the syntactic features that these constructions have added for tense and agreement for instance, are conditions for the grammatical constructions to apply.

Two solutions are provided when parsing cen-aba . The lack of an overt person/number ending can both be interpreted as a first person singular or a third person singular. There is one construction that covers both cases: no-marker-1/3sg-morph. This construction is part of a special construction set (default), that is consulted only after the regular morphological constructions could not apply. Please click on the construction to inspect its features in more detail.


Comprehending "cen aba" (all solutions)


Computing all solutions for application of
in comprehension


initial structure
application process
solution 1
applied constructions
resulting structure
solution 2
applied constructions
resulting structure

Parsed meanings

meaning 1

gstruct0(personme?agent-48)struct2(eater?ev-25?agent-48)struct0:varAGENTdash481->struct2:varAGENTdash483struct1(activityeat?ev-25)struct1:varEVdash252->struct2:varEVdash253struct3(simultaneous?point-17?ev-25)struct2:varEVdash253->struct3:varEVdash254struct4(event-perspectiveunbound?ev-25)struct3:varEVdash254->struct4:varEVdash255struct5(time-pointrecalled-point?point-17)struct3:varPOINTdash174->struct5:varPOINTdash176

meaning 2

gstruct0(personhe-or-she?agent-48)struct2(eater?ev-25?agent-48)struct0:varAGENTdash481->struct2:varAGENTdash483struct1(activityeat?ev-25)struct1:varEVdash252->struct2:varEVdash253struct3(simultaneous?point-17?ev-25)struct2:varEVdash253->struct3:varEVdash254struct4(event-perspectiveunbound?ev-25)struct3:varEVdash254->struct4:varEVdash255struct5(time-pointrecalled-point?point-17)struct3:varPOINTdash174->struct5:varPOINTdash176

Back to menu


WD-3. Stem changes.

Verb conjugation in Spanish, as other Romance languages, is characterized by a large number of verb paradigms according to which specific infinitives are conjugated. Rather than simply combining the verb stem with one or two suffixes, the actual verb form can often be the result of transformations following from stress patterns or phonetic assimilation processes.

To account for multiple possible realizations of a verb stem (given the same lemma), a range of morphological allostructions needs to be defined, which compete with each other. For instance, for the verb cocer 'to cook', four stem allostructions are implemented:

Please expand these stem constructions now. You will see that in comprehension, the stem form itself will decide which of the four stem constructions will match: coc- , cuec-, cuez- or coz- . The real difficulty lies in production, where the correct form needs to be selected. The three irregular stem constructions is always tried first, before the default cocer-base-stem construction. If an irregular stem construction can match, it will add a (stem-realized +) to the stem-unit, which will then prevent the default construction to match (see negated feature).

Let us look at these irregular stem constructions in action.

WD-3-1. cuez-o 'I cook'

The first verb form that will be produced is cuez-o 'I cook', where the stem receives primary stress and is therefore diphthongized. Stress in Spanish words always falls on the penultimate syllable. Moreover, the final consonant of the original stem -c- turns into a -z- because it is followed by a back vowel (-o).


Formulating 

gstruct0(time-pointpresent-pointpoint-1)struct1(simultaneouspoint-1cook-event-1)struct0:POINTdash11->struct1:POINTdash12struct2(apply-heatcookcook-event-1)struct1:COOKdashEVENTdash12->struct2:COOKdashEVENTdash13struct3(cookcook-event-1me-1)struct2:COOK3->struct3:COOK4struct2:COOKdashEVENTdash13->struct3:COOKdashEVENTdash14struct4(personmeme-1)struct3:MEdash14->struct4:MEdash15

Applying
in formulation


initial structure
application process
applied constructions
resulting structure

Utterance: "cuez o"

The construction that is responsible for the application of the correct stem construction is the stress-cxn-1 construction. If you click on the construction here below, you will see that it is a production-only construction that is looking for a verb unit with a stem and a suffix unit as its constituents, which are directly adjacent to each other. Moreover, the suffix-unit cannot be a tense-aspect-mood morpheme (i.e. not an infix) and cannot carry primary stress. If these conditions are met, the construction will add a (primary-stress +) feature to the stem unit, which is a feature that is part of the production lock of the pensar-piens-stem construction.


WD-3-2. cuec-e-s 'You cook'

Another irregular stem of the verb cocer is cuec-. This stem is used in the production of 'you cook', cuec-e-s:


Formulating 

gstruct0(time-pointpresent-pointpoint-1)struct1(simultaneouspoint-1cook-event-1)struct0:POINTdash11->struct1:POINTdash12struct2(apply-heatcookcook-event-1)struct1:COOKdashEVENTdash12->struct2:COOKdashEVENTdash13struct3(cookcook-event-1you-1)struct2:COOK3->struct3:COOK4struct2:COOKdashEVENTdash13->struct3:COOKdashEVENTdash14struct4(personyouyou-1)struct3:YOUdash14->struct4:YOUdash15struct5(quantitysingletonyou-1)struct4:YOUdash15->struct5:YOUdash16

Applying
in formulation


initial structure
application process
applied constructions
resulting structure

Utterance: "cuec e s"

A second stress construction is used now, which is specialized on verb forms with two suffixes following the stem. You can click on it to inspect its production locks. It will apply when the person-number suffix does not contain a vowel and the tense-aspect-mood suffix does not receive primary stress.


WD-3-3. coz-a-mos 'We cook' (subjunctive)

A less frequently used irregular stem of the verb cocer is coz-. It only occurs in three slots of the verb conjugation paradigm: in the first plural and second plural of the subjunctive present and in the imperative first plural. Let us look at the subjunctive form 'we cook', coz-a-mos:


Formulating 

gstruct0(time-pointpresent-pointpoint-1)struct1(simultaneouspoint-1cook-event-1)struct0:POINTdash11->struct1:POINTdash12struct2(apply-heatcookcook-event-1)struct1:COOKdashEVENTdash12->struct2:COOKdashEVENTdash13struct3(cookcook-event-1you-1)struct2:COOK3->struct3:COOK4struct2:COOKdashEVENTdash13->struct3:COOKdashEVENTdash14struct4(personweyou-1)struct3:YOUdash14->struct4:YOUdash15struct5(event-moodsubjunctivecook-event-1)struct3:COOKdashEVENTdash14->struct5:COOKdashEVENTdash16

Applying
in formulation


initial structure
application process
applied constructions
resulting structure

Utterance: "coz a mos"


Back to menu