This page needs JavaScript! Please enable it to continue.

The plumbing of corpus linguistics: A guided tour of the corpus-processing pipeline (Methodenworkshop)

Date	Wednesday, 23rd January 2019
Location

veranstalter: David Lukes
ansprechpartner: Christina Meuser, Dennis Dressel
email: contact@hpsl.uni-freiburg.de
web:
institution: HPSL
language: Englisch
location institution: Freiburg
date_raw: 23.-25. Januar 2019
date_sort: 23.01.2019, 00:00:00

While it’s not necessary to know how corpus software works in order to use it, having a high-level idea of the entire process, from raw data to what happens when you type a query into a search interface, can help you become a power user. Providing you with such a general idea is the goal of this workshop. We’ll cover the following topics:

technical background: how text is represented inside a computer (file formats, plain text, character sets and encodings)
adding annotation: metadata (author, year of publication…), morphological tagging
corpus query systems: what’s their purpose (why not directly search the plain text files?), how they work behind the scenes, standard formats

The concepts will be illustrated with practical examples using the corpus query systems Corpus Workbench, (No)SketchEngine and ANNIS, and other related tools. By the end of the workshop, you should have a better intuition for what can and cannot be achieved using corpora, and you should also be better equipped to deal with the technical pitfalls of conducting corpus research.

Congratulations on the Publication of former scholarship holder Joelle Loew’s PhD.: We congratulate Dr. Joelle Loew on the publication of her PhD in the book series Routledge Research...
HPSL Alumna Miriam Neuhausen erhält Auszeichnung des International Council for Canadian Studies: Miriam Neuhausen ...
HPSL PhD Scholarship holder Mizuki Koda successfully defends her doctoral thesis: On the 12th of November 2025, HPSL doctoral candidate and scholarship holder Mizuki Koda successfully...

Der Hermann-Paul Preis 2025 ging an Robert Reinecke. Herzlichen Glückwunsch! /// The Hermann Paul Award 2025 went to Robert Reinecke. Congratulations!

Der Hermann-Paul Preis 2022 ging an Aline Bieri und Florian Dreyer. Herzlichen Glückwunsch! /// The Hermann Paul Award 2022 went to Aline Bieri and Florian Dreyer. Congratulations!

Der Hermann-Paul Preis 2019 ging an Emiel van den Hoven. Herzlichen Glückwunsch! /// The Hermann Paul Award 2019 went to Emiel van den Hoven. Congratulations!

Der Hermann-Paul Preis 2018 ging an Verena Schröter und Hanna Svensson. Herzlichen Glückwunsch! /// The Hermann Paul Award 2018 went to Verena Schröter and Hanna Svensson. Congratulations!

This page needs JavaScript! Please enable it to continue.

Hermann Paul School of Linguistics
Basel - Freiburg (i.Br.)

The plumbing of corpus linguistics: A guided tour of the corpus-processing pipeline (Methodenworkshop)

Information for applicants

News

Upcoming Events

Search member

PhD Scholarships

Hermann-Paul-Preis für herausragende Dissertationen

Annual Reports & Newsletter

Current Research

This page needs JavaScript! Please enable it to continue.

Hermann Paul School of Linguistics Basel - Freiburg (i.Br.)

The plumbing of corpus linguistics: A guided tour of the corpus-processing pipeline (Methodenworkshop)

Information for applicants

News

Upcoming Events

Search member

PhD Scholarships

Hermann-Paul-Preis für herausragende Dissertationen

Annual Reports & Newsletter

Current Research

Hermann Paul School of Linguistics
Basel - Freiburg (i.Br.)