Invited Talk by Yu Wang on Tuesday, May 05, 15:00
# Title
Human and Model-generated Text Detection: Methods and Linguistic Phenomenon
# Abstract
To enhance the trustworthiness of Large Language Models (LLMs), there are two areas we should continue to work on. The first is the development of robust methods that make users aware of LLM involvement in daily life, such as LLM-based explanations. Human and machine-text detection method, with sufficient ethic consideration, can allow users to determine the degree to which content should be trusted. The second is involving human users in the co-construction of explanations for LLM outputs. In my presentation, I will emphasize the first area and present my previous and current work on revealing the differences between human language and model-generated language. First, I hypothesize that human language is constrained by cognitive resources, which creates distinct patterns that differentiate it from model-generated text. Based on this idea, a method called FourierGPT is developed to detect model-generated text. In follow-up work, I examine how human–human dialogue and human–LLM dialogue differ at the syntactic level, and I use these syntactic features to detect LLM-involved dialogue. Finally, I present an analytical study showing that the periodicity of information in human and LLM-generated text differs significantly.
# Bio
Yu Wang is a final-year PhD student in Computational Linguistics at Bielefeld University. He holds an MSc degree in Artificial Intelligence from KU Leuven and an MA degree in Linguistics from Tohoku University. His general research focus is dialogue modelling, with particular emphasis on human–human and human–agent interaction. He is currently working on two DFG-funded projects within TRR 318: one on monitoring understanding in interaction, and the other on communicative practices of requesting information and explanations from LLM-based agents.
Referent/Referentin
Termin
05. Mai 202615:00 - 16:02
Ort
Raum: 1101.F138Welfengarten 1
30167 Hannover