Modeling Noun-Phrases Dynamics in Specialized Text Collections - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Journal of Quantitative Linguistics Année : 2010

Modeling Noun-Phrases Dynamics in Specialized Text Collections

Résumé

The science of biology has entered a new era with new approaches for information processing frameworks and high-throughput experiments. This has led to a high rate of publication production and the emergence of large accessible databases in English, permitting the creation of text collections in any specialized domain. To process such text data, systematic analysis of language properties is helpful and benefits from a distribution description. In this article, firstly, as scientific publications are time-stamped we can analyse distribution profiles of noun-phrases (i.e. “content-words”) over time. Hence, time-dependency analysis of noun-phrases reveals interesting specific behaviour taking into account sequential occurrence of features. Single content-word distributions appear to be linearly shaped. We also observed that the association of content-words is distributed in a different way over time, i.e. as a mixed beta distribution.

Domaines

Sociologie
Fichier non déposé

Dates et versions

hal-02054488 , version 1 (01-03-2019)

Identifiants

Citer

Nicolas Turenne. Modeling Noun-Phrases Dynamics in Specialized Text Collections. Journal of Quantitative Linguistics, 2010, 17 (3), pp.212-228. ⟨10.1080/09296174.2010.485447⟩. ⟨hal-02054488⟩
29 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More