Lecture by Keli Du (Trier) in the lecture series “Praxis der Digital Humanities”

Evaluation of LDA Topic Modeling in Digital Humanities - State of Research, Method, Results

Keli Du

Date:

01.12.2021

Place:

16:15 – 17:45 Uhr

digital via zoom

https://hu-berlin.zoom.us/j/98682719801

 

Categories:

Event

Contact:

Keli Du
The talk by Keli Du presents the systematic evaluation of LDA Topic Modeling and takes place in the context of the Digital History Research Colloquium at the Humbold University in Berlin.

The lecture series “Praxis der Digital Humanities” will take place within the framework of the Master 'Digital Humanities'. It is open to all interested parties, also from other courses or from outside the university, and will be designed as a video conference with lecture and discussion possibility.

As a quantitative text analytic method, Latent Dirichlet Allocation (LDA), often referred to as Topic Modeling, has been widely used in Digital Humanities in recent years to explore numerous unstructured textual data. However, Topic Modeling itself has not yet been conclusively explored due to its complexity in terms of, for example, the algorithms or parameter settings used. It is also often observed when using Topic Modeling that this method is sensitive to the setting of the respective parameters. Therefore, LDA Topic Modeling is also often strongly criticized. In this talk, the systematic evaluation of LDA Topic Modeling will be presented. The goal of the evaluation is to understand the influence of the crucial factors on Topic Modeling from two perspectives, namely Topic Modeling-based document classification and Topic Coherence.

A special feature of this year's program is that some of the talks will be inserted into the series program not only digitally, but virtually, so to speak. These talks take place online in other event contexts. Instead of repeating them in our program, we get to be guests at the other lecture series.We thank our colleagues in Berlin (Digital History, Prof. Dr. Torsten Hiltmann) and Dresden (Applied Linguistics, Prof. Dr. Simon Meier-Vieracker) for their cooperation.


Projects: Zeta and Company

Keywords: Dissemination and Community Building in the DH