Projects

Past Projects

Optimality Theory (OT) is the central paradigm in current theorizing about phonological acquisition. OT is a deductive model: (a priori) linguistic knowledge is represented in the child's linguistic (grammatical) competence. In this project we explore an empirist, inductive alternative for this approach. An empirist, inductive model is defined as a model in which the mental lexicon is central in acquisition. Linguistic knowledge is collected and stored in the lexicon. The contrast between...
01/10/2000 - 30/09/2004
In this project we study the auditory development, the speech and language acquisition in congenital deaf children with a cochlear implant (CI) implanted during their second year of life. Our aim is to systematically investigate the effect of the CI on different aspects of language and speech development: The effect of a CI on the auditory level; The effect of a CI on the articulatory level (the speech); The effect of a CI on language acquisition and communicative development. In essence, we...
01/10/2000 - 31/12/2006
The aim of this project is (i) to automatically find important new topical "concepts" in unrestricted text, and (ii) to track the evolution of the connotation and definition of such concepts through time in newspaper text and WWW. From a scientific point of view, this project investigates the usefulness of the combination of statistical and information-theoretic techniques used in information retrieval and statistical natural language processing, and of language engineering components such as...
01/08/2000 - 31/07/2003
The issue of abstract representations in the domains of language acquisition and adult language processing is addressed in this project. Is it possible to learn a subdomain of language without prior linguistic knowledge in this domain? Can one achieve the final learning stage (adult performance) without developing abstract representations ? A new methodology will be used to study these questions. The research will explicitly combine the techniques that are used in three separate disciplines:...
01/07/1998 - 31/12/2003
cgn
01/06/1998 - 30/11/2003
The two national varieties of Dutch, spoken resp. in the Netherlands and Belgium, differ in a considerable number of features. Phonetic differences are among the most stable ones. This project has three distinct aims: the collection of data representative of speech in the two countries description of the phonetic variation evaluation of the variation. The analysis will be carried out beyond the traditional limitations of the segmental level, and will encompass both auditive and acoustic...
01/10/1997 - 30/09/2001
One common feature of child-directed speech involves the use of diminutive derivations as in Patty, froggy, or bootie. Although the pragmatic and semantic features of diminutives seem to be universal (Jurafsky, 1996), across languages there is much variability in the frequency of diminutive usage. For example, diminutives are pervasive in the child-directed speech registers of Dutch, Russian, and Spanish but are rare in those of English and German. To some extent, these differences can be...
Dynamic interfaces for corporate medicine
The aims of this project are: i. Investigation of the earliest stages of morphophonological acquisition and development;ii. crosslinguistic studies of early morphophonological development. In the project the following languages are represented: Bask, Corean, Croatian, Dutch, Estonian, French, Georgian, German, Greek, Hebrew, Huichal, Hungarian, Italian, Lithuanian, Maltese, Polish, Russian, Slovenian, Spanish, Swedish, Thai, Tunesian Arabic, Turkish, Ukranian, Yucatec Maya.
lcg
The focus of the network Learning Computational Grammars (LCG) is the investigation of ways to improve computational grammars by applying machine learning techniques to current best practice in Computational Grammar. LCG seeks improvement through the application of a range of machine learning techniques, including both symbolic and statistical techniques. The scientific goal is to provide a characterization of the algorithms capable of learning (important fragments of) language. This responds...
In this project we investigate the application of natural language processing to the domain of art and graphic design. The MBSP shallow parser is adapted in the context of visual communication and integrated in the NodeBox software application.
METIS aims at the development of techniques for bootstrapping Machine Translation on the basis of (linguistically annotated) monolingual corpora. CLiPS is a subcontractor of CCL (Leuven) in this project, and provides shallow parsing technology for English and Dutch for this project, in cooperation with the ILK research group of Tilburg University.
The project aims at processing the morphological part of the Goeman - Taeldeman corpus. In June 2005, the first volume of the MAND was published. Topics: pluralisation, diminutivisation and gender.
By analysing the availability of various resources needed for Dutch Language and Speech Technology, priority was assigned to the development of those missing resources that are known to be vital for various types of applications and at the same time can be developed in the current state of the art. The priority lists were submitted to the Dutch Language Union Platform for Language and Speech Technology.
This project investigates the acquisition of relative adjectives (e.g. big, old, warm) by Dutch-speaking children. Semanticists have shown that relative adjectives evoke open scales with the pivotal region – norm – in the middle. Due to the relevance of a class-specific norm, we may call a cat big and an elephant small, even though a cat is much smaller than an elephant. Open scale structure explains why relative adjectives cannot be used in combinations like helemaal làng 'completely long'....
Development of a parallel corpus English - Swahili
The aim of this project is to investigate segmental, intrasyllabic and intersyllabic co-occurrence patterns in prelexical babbling, and the acquisition of phonological segments and patterns in the early lexical period. Longitudinal data of deaf children with a cochlear implant (implanted in the first/second year of life) will be compared with those of a hearing age matched cohort in order to establish if they develop language in the same sequence and according to the same patterns as hearing...
This project investigates the potential integration of two Artificial Intelligence domains by investigating the problematic role of syntax within both lines of research. Syntactic research within the subfield of Memory Based Reasoning is concerned with optimising two classification tasks: classification of segmentation (delimiting constituents) and classification of disambiguation (assigning grammatical labels).The robotic experiments that are being conducted within the Origins of Language...