Exploratory analysis of text collections through visualization and hybrid biclustering
N. Médoc, M. Ghoniem, and M. Nadif
in Machine Learning and Knowledge Discovery in Databases European Conference, ECML PKDD 2016, Riva del Garda, Italy, September 19-23, 2016, Proceedings, Part III, Lecture Notes in Computer Science, vol. 9853, B. Berendt, B. Bringmann, E. Fromont, G. Garriga, P. Miettinen, N. Tatti, and V. Tresp (Eds.), pp. 59-62, 2016
We propose a visual analytics tool to support analytic journalists in the exploration of large text corpora. Our tool combines graph modularity-based diagonal biclustering to extract high-level topics with overlapping bi-clustering to elicit fine-grained topic variants. A hybrid topic treemap visualization gives the analyst an overview of all topics. Coordinated sunburst and heatmap visualizations let the analyst inspect and compare topic variants and access document content on demand.
doi: 10.1007/978-3-319-46131-1_13