Aktuelles & Veranstaltungen

Vortrag "The VESUM Dictionary as a Source of Ukrainian Morphological Data for NLP, Corpora, and Humans", 21.05.2025, 14:15 Uhr (hybrid: live oder per Zoom)

Vortragender:

Dr. Vasyl‘ Starko (L’viv)

Ort: Slavisches Seminar, Werthmannstr. 14, 1. OG, Raum 01004 und per Zoom

Workshop “Regex in R for Multimodal Analysis” 23.05.25, 9:30 – 16.30 Uhr Veranstaltungsort: DH Lab der Universität Freiburg

Dozierender: PD Dr. Christoph Rühlemann

Unterrichtssprache: Englisch

Ort: DH Lab der Universität Freiburg

Anmeldung: bis 9.Mai an chrisruehlemann@gmail.com

Regex in R for Multimodal Analysis

Many researchers in the fields of Conversation Analysis and Interactional Linguistics painstakingly elaborate transcripts and manually implement annotations in ELAN, a software for multimodal annotation. Few researchers, however, seem to know how to post-process this immensely valuable data in a way that allows synthesis, aggregation, transformation, large-scale analysis and visualization. This workshop aims to present some solutions for these tasks implemented in R and based on Regular Expression (or regex), a syntax used to match, select, edit, and extract data and to reshape data frames based on regularities (patterns) in the data.

The full-day workshop will cover these topics:

– Basics of R and regex (Crash Course)

– Convert (multimodal) transcripts to R data frames to make them machine-readable

– Analyze (multimodal) transcripts in R data frames

– Export and structure ELAN annotations in R data frames

– Analyze and visualize ELAN annotations in R data frames

The workshop is open to beginners and more experienced users of R. To facilitate a running start for everybody, the workshop will start with a crash course on the key syntactic elements of regex and some key R functions before we turn to using them for Multimodal Analysis. The workshop aims for maximum relevance to the participants: participants are encouraged to articulate to the instructor what special tasks they wish to achieve with their data prior to the workshop (no later than 9 May, 2025). Concrete solutions to these tasks could be discussed during the workshop.

The maximum number of participants is 25; the registration deadline is 9 May, 2025. To register (and, if you want, to share your data) please contact: chrisruehlemann@gmail.com

Workshop „Natural Language Processing (NLP)“ 27.06.25, 9:00 – 18:00 Uhr Haus zur Lieben Hand, großer Saal

Dozierende:

Projekt Daidalos, HU Berlin: Dr. Andrea Beyer, Konstantin Schulz, Florian Deichsler

Universität Freiburg: Prof. Dr. Stefan Tilg, Carolin Giere

Unterrichtssprache: Deutsch

Ort: Haus zur Lieben Hand, großer Saal

Anmeldung: bis 06.06.25 an carolin.giere@altphil.uni-freiburg.de.

Natural Language Processing (NLP)

Der Workshop wird gemeinsam mit der Daidalos-Gruppe abgehalten, die sich vorrangig der Entwicklung und Anwendung von Methoden des NLP (Natural Language Processing) in der Klassischen Philologie widmet (https://www.klassphil.hu-berlin.de/de/forschung-und-projekte/projekte/projekte-fachuebergreifend/dfg-projekt-daidalos). Dabei sollen auch Einsatzmöglichkeiten von LLMs (Large Language Models) erörtert werden. Der Workshop hat das Ziel, Forschende und Studierende zum selbständigen Arbeiten mit den vorgestellten Methoden zu befähigen und Ihnen Ansätze für die eigene Forschung zu liefern.

Es wird zwei Vorträge geben („Digital Classics und KI“ und „NER: Von lexikonbasiert bis Deep Learning vs. chatbasiert mit einem LLM“ sowie einen Hands-on-Teil zu konkreten NER-Methoden, die zu den Forschungsinteressen passen). Es sind ausdrücklich Interessierte aller Forschungsrichtungen herzlich zur Teilnahme eingeladen!

“english-corpora.org – New perspectives on the corpus-based study of linguistic and cultural trends” Dienstag, 8. 7. 2025, 10 Uhr c.t., HS 1221

Dozierende:

Mark Davies (emeritus, BYU, Provo, Utah, USA) hat die weltweit größte und meistgenutze Sammlung englischsprachiger Online-Korpora geschaffen (https://www.english-corpora.org/) und auch Großkorpora für das Spanische und Portugiesische erstellt (https://www.corpusdelespanol.org/; https://www.corpusdoportugues.org/).

Unterrichtssprache: Englisch

Ort: KG I, HS 1221

Anmeldung: keine.

“’Fireside talk’: Corpora and AI/LLMs” Dienstag, 8. 7. 2025, 16 Uhr c.t., DH Lab, HS 1026

Dozierende:

Unterrichtssprache: Englisch

Ort: DH-Lab, Raum 1026

Anmeldung: keine.

Aktuelles & Veranstaltungen

Öffnungszeiten

Kontakt