In:SMS Communication: A linguistic approach
Edited by Louise-Amélie Cougnon and Cédrick Fairon
[Benjamins Current Topics 61] 2014
► pp. 11–28
Seek&Hide
Anonymising a French SMS corpus using natural language processing techniques
Published online: 8 July 2014
https://doi.org/10.1075/bct.61.03acc
https://doi.org/10.1075/bct.61.03acc
This article presents the system Seek&Hide, a text message processing tool developed for the sud4science LR (http://www.sud4science.org/) project. It performs the anonymisation/de-identification of a corpus. At present, it has been used to anonymise the sud4science LR corpus of French text messages collected during the project. This is done in two phases. In the first phase, it automatically processes over 70% of the corpus. The rest of the corpus is processed in the second phase, aided by an expert annotator via a web interface specifically designed to simplify the task.
Cited by (5)
Cited by five other publications
Panckhurst, Rachel, Cédric Lopez & Mathieu Roche
McSweeney, Michelle A.
Panckhurst, Rachel
This list is based on CrossRef data as of 11 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
