A computational approach to early language bootstr.. (BOOTPHON)
A computational approach to early language bootstrapping
Start date: Nov 1, 2012,
End date: Oct 31, 2017
"During their first year of life, infants become attuned to the phonemes, words and phonological rules of their language, with little or no adult supervision. After 30 years of accumulated experimental results, we are still lacking an account for the puzzling fact that these 3 interdependent components of language are acquired not sequentially, but in parallel. Drawing tools from Machine Learning and Automatic Speech Recognition, we construct a model of this early process, test it on 2 large spontaneous speech databases (Japanese, French and Dutch) and test its predictions in infants using behavioral, EEGs and fNIRS techniques.1. Coding. We study different ways of defining coding features for speech, from fine-grained to coarse grained, in view of the automatic discovery of a hierarchy of linguistic units. We compare this with a systematic study of the units of speech coding as they unfold in 6, 9 and 12 month old infants..2. Lexicon. Infants recognize some words before they know the phonemes of their language; we modify existing word segmentation algorithms so they can work on raw speech. We test the unique prediction that infants start with a large lexicon that’s quite different from the adult one.3. Rules. Phonemes are produced as overlapping, coarticulated gestures. To untangle these context effects, we use a predictive model of coarticulation in auditory space and invert it. We test when and how infants perform reverse coarticulation.4. Integration. The above subprojects provide only an initial bootstrapping into approximate phonemes, words, and contextual rules. We show how to iteratively integrate these approximate representations to derive better ones. The outcome will be numerically assessed on an adult directed and infant directed speech database, and compared to those of to state-of-the-art supervized phoneme recognizers. The predictions will be tested in infants learning artificial languages and in a longitudinal study."
Get Access to the 1st Network for European Cooperation