![]() The text message was 'born' just over two decades ago, and the general public has been communicating via SMS (Short Message Service) between mobile telephones for almost one and a half decades. Keywords: French SMS corpus design compilation anonymisation transcoding annotation mediated electronic discourse discourse analysis. Selectionandpeer-reviewunderresponsibilityofCILC2013. The corpus will be made available for researchers, students and the general public in 2014. The researchers have started linguistic analyses of language practices and sociolinguistic data contained in the questionnaire. In 2011, 90,000 authentic text messages in French were collected from the general public by academics in the south of France in the context of the sud4science LR project ( and This paper retraces the design of the corpus collation, including linking text-message data with usage, before proceeding with a three-step semiautomatic anonymisation process, transcoding of the SMS data into "standardised" French, and an optional linguistic annotation phase. Praxiling UMR 5267 & Université Paul-Valéry Montpellier 3, Route de Mende 34199 Montpellier cedex 5, France ![]() Procedia - Social and Behavioral Sciences 95 (2013) 96 - 104ĥth International Conference on Corpus Linguistics (CILC2013)Ī Large SMS Corpus in French: from Design and Collation to Anonymisation, Transcoding and Analysis
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |