The CFMf Topic-Modeling Method Based on Neural Clustering with Feature Maximization: Comparison with LDA

Lamirel, J.C; Lareau, F et Malaterre, C (2023). « The CFMf Topic-Modeling Method Based on Neural Clustering with Feature Maximization: Comparison with LDA », dans 19th International Conference on Scientometrics & Informetrics (International Society for Scientometrics and Informetrics (I.S.S.I.)., Bloomington, États-Unis, 2023-07) Bloomington, États-Unis, pp. 253-259.

Fichier(s) associé(s) à ce document :
[img]
Prévisualisation
PDF
Télécharger (2MB)

Résumé

Mining the content of scientific publications is increasingly used to investigate the practice of science and the evolution of research domains. Topic-models, among which LDA, have notably been shown to provide rich insights into the thematic content of disciplinary fields, their structure and evolution through time. However, improving topic modeling methods remains a major concern. Here we propose an alternative topic-modeling approach based on neural clustering and feature maximization with F1-measure (in short: CFMf). We compare the performance of this approach to LDA by applying both methods to a reference corpus of full-text philosophy of science articles (N=16,917). The results show significant improvements along key quantitative performance measures such as coherence, independently of the number of topics. Qualitative comparisons also show improvements in the consistency of topics and their interpretability in light of expert knowledge. We discuss these promising results and highlight upcoming research work.

Type: Communication, article de congrès ou colloque
Mots-clés ou Sujets: topic modeling
Unité d'appartenance: Faculté des sciences humaines > Département de philosophie
Déposé par: Christophe Malaterre
Date de dépôt: 17 avr. 2026 08:04
Dernière modification: 17 avr. 2026 08:04
Adresse URL : https://archipel.uqam.ca/secure/id/eprint/19700

Statistiques

Voir les statistiques sur cinq ans...