Gestion de projet

Producing spoken corpora in field conditions


Descriptif de la formation

This course will explain how to record, transcribe, align and annotate spoken data

The course will proceed in two steps :

  • (i) making students familiar with field recording equipment, and effectively producing video as well as audio material in field-like conditions;
  • (ii) focusing on how to import, align, segment and annotate in ELAN the material thus created, also using electronic lexical resources (e.g. dictionaries created with FLEX / Toolbox).

This course aims at introducing students to methods related to the development of spoken corpora in near-field conditions, and to their annotation using ELAN platform, in conjunction with other commonly used tools for field linguistics, e.g. PRAAT software

Being registered in a Master in huma, sciences and humanities with a project including collecting spoken field work data

Students will be confronted with actual field recording material collected at various Indigenous communities in Australia, and will have to re-produce some of the annotation procedure they are taught in the course (e.g., given an existing transcription of the source text and its translation, they will segment the source text and provide a morphological gloss using a dictionary and morphological tables).

Notre formateur

Patrick Caudal CR CNRS laboratoire de linguistique formelle, Université Paris Cité

Date des formations

Les sessions terminées

Session : Producing spoken corpora in field conditions

Lieu : Campus Rive Gauche, Batiment Olympe de Gouges, 3e etage 357

Début : 12/01/2023 09:30

Fin : 13/01/2023 05:00

Commentaire :

January 12 and 13, 2023

Session times: 9:30 a.m. to 12:30 p.m. and 2 p.m. to 5 p.m.