Contract type: Internship
Duration: 6 months
Large text collections contain valuable knowledge that is difficult to find without appropriate visual text analytics software. The Papyrus software developed at LIST provides interactive visualizations to help analysts explore large text collections at various levels of details and focus on a manageable set of documents to read in detail.
The internship will focus in particular on enhancing term tree visualizations available in Papyrus by investigating various term alignment strategies, inspired from state-of-the-art sequence alignment algorithms. The intern will also run user experiments to evaluate whether term alignment improves the interpretability of term trees.