publications

My research spans corpus methodology, discourse analysis, and their application across disciplinary boundaries.

Discourse, methodology and meaning

2026

  1. Regular Expressions for Corpus Queries
    Nathan Dykes, and James Tauber
    In Encyclopedia of Language and Linguistics, 2026
    in press

2024

  1. From Linguistic to Discursive Patterns: Introducing Discoursemes as a Basic Unit of Discourse Analysis
    Philipp Heinrich, Andreas Blombach, Nathan Dykes, and 4 more authors
    CADAAD Journal, 2024
  2. Automatic Identification of COVID-19-Related Conspiracy Narratives in German Telegram Channels and Chats
    Philipp Heinrich, Andreas Blombach, Bao Minh Doan Dang, and 5 more authors
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024

2020

  1. Reconstructing argumentation patterns in German newspaper articles on multidrug-resistant pathogens: a multi-measure keyword approach
    N. Dykes, and Joachim Peters
    Journal of Corpora and Discourse Studies, 2020

2019

  1. Metaphors in German newspaper articles on multidrug-resistant bacteria in clinical contexts, 1995–2015: A computer-assisted study
    Joachim Peters, N. Dykes, Mechthild Habermann, and 2 more authors
    Metaphor and the Social World, 2019

Argumentation mining

2024

  1. Leveraging High-Precision Corpus Queries for Text Classification via Large Language Models
    Nathan Dykes, Stephanie Evert, Philipp Heinrich, and 2 more authors
    In Proceedings of the First Workshop on Language-driven Deliberation Technology (DELITE) @ LREC-COLING 2024, 2024
  2. Finding Argument Fragments on Social Media with Corpus Queries and LLMs
    Nathan Dykes, Stephanie Evert, Philipp Heinrich, and 2 more authors
    In Conference on Advances in Robust Argumentation Machines, 2024

2022

  1. Retrieving Twitter argumentation with corpus queries and discourse analysis
    Nathan Dykes, Philipp Heinrich, and Stephanie Evert
    In Broadening the Spectrum of Corpus Linguistics, 2022

2021

  1. Argument parsing via corpus queries
    N. Dykes, Stefan Evert, Merlin Göttlinger, and 2 more authors
    it-Information Technology, 2021

2020

  1. Reconstructing arguments from noisy text: Introduction to the RANT project
    N. Dykes, Stefan Evert, Merlin Göttlinger, and 2 more authors
    Datenbank-Spektrum, 2020

Applications in the legal and medical domain

2024

  1. DIREGA–Building Decision Support for German Register Law
    Axel Adrian, Osman Anil Basaran, Nathan Dykes, and 8 more authors
    In Legal Knowledge and Information Systems, 2024

2022

  1. Präsentation von Palliativstationen und SAPV-Teams im Internet–eine korpusbasierte Metaanalyse von Webseiten
    Joachim Peters, N. Dykes, Maria Heckel, and 2 more authors
    Zeitschrift für Palliativmedizin, 2022

2021

  1. Anonymisierung von Gerichtsurteilen–Eine wesentliche Voraussetzung für E-Justice
    Axel Adrian, Stefan Evert, Michael Keuchen, and 2 more authors
    Jusletter IT (May 2021), 2021
  2. Die palliativmedizinische Fachkultur in Geschichte und Gegenwart–sprachwissenschaftliche Perspektiven
    Joachim Peters, and N. Dykes
    Sprache und Wissen, 2021

2020

  1. Kompetenzdarstellung, Patientennähe und Argumentationsstrategien von Internetangeboten deutscher Hospize, Palliativstationen und SAPV-Teams–eine korpusbasierte Meta-Analyse
    J. Peters, N. Dykes, C. Ostgathe, and 2 more authors
    Zeitschrift für Palliativmedizin, 2020

2019

  1. A linguistic model of communication types in palliative medicine: effects of multidrug-resistant organisms colonization or infection and isolation measures in end of life on family caregivers’ knowledge, attitude, and practices
    Joachim Peters, N. Dykes, Maria Heckel, and 1 more author
    Journal of palliative medicine, 2019

Corpora, resources and infrastructure

2023

  1. A pipeline for the creation of multimodal corpora from YouTube videos
    Nathan Dykes, Anna Wilson, and Peter Uhrig
    In Proceedings of the 1st Workshop on Linguistic Insights from and for Multimodal Language Processing, 2023
  2. Studying time conceptualisation via speech, prosody, and hand gesture: interweaving manual and computational methods of analysis
    Peter Uhrig, Elinor Payne, Irina Pavlova, and 7 more authors
    In Gesture and Speech in Interaction Conference (GeSpIn 2023), 2023

2020

  1. EmpiriST corpus 2.0: Adding manual normalization, lemmatization and semantic tagging to a German web and CMC corpus
    Thomas Proisl, N. Dykes, Philipp Heinrich, and 3 more authors
    In Proceedings of the Twelfth Language Resources and Evaluation Conference, 2020
  2. A corpus of German Reddit exchanges (GeRedE)
    Andreas Blombach, N. Dykes, Philipp Heinrich, and 2 more authors
    In Proceedings of the Twelfth Language Resources and Evaluation Conference, 2020

2019

  1. A new German Reddit corpus
    Andreas Blombach, N. Dykes, Stefan Evert, and 3 more authors
    In Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019), 2019
  2. The_illiterati: Part-of-speech tagging for Magahi and Bhojpuri without even knowing the alphabet
    Thomas Proisl, Peter Uhrig, Andreas Blombach, and 4 more authors
    In Proceedings of The First International Workshop on NLP Solutions for Under Resourced Languages (NSURL 2019) co-located with ICNLSP 2019-Short Papers, 2019

2026

  1. Regular Expressions for Corpus Queries
    Nathan Dykes, and James Tauber
    In Encyclopedia of Language and Linguistics, 2026
    in press

2024

  1. From Linguistic to Discursive Patterns: Introducing Discoursemes as a Basic Unit of Discourse Analysis
    Philipp Heinrich, Andreas Blombach, Nathan Dykes, and 4 more authors
    CADAAD Journal, 2024
  2. Automatic Identification of COVID-19-Related Conspiracy Narratives in German Telegram Channels and Chats
    Philipp Heinrich, Andreas Blombach, Bao Minh Doan Dang, and 5 more authors
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024
  3. Leveraging High-Precision Corpus Queries for Text Classification via Large Language Models
    Nathan Dykes, Stephanie Evert, Philipp Heinrich, and 2 more authors
    In Proceedings of the First Workshop on Language-driven Deliberation Technology (DELITE) @ LREC-COLING 2024, 2024
  4. Finding Argument Fragments on Social Media with Corpus Queries and LLMs
    Nathan Dykes, Stephanie Evert, Philipp Heinrich, and 2 more authors
    In Conference on Advances in Robust Argumentation Machines, 2024
  5. DIREGA–Building Decision Support for German Register Law
    Axel Adrian, Osman Anil Basaran, Nathan Dykes, and 8 more authors
    In Legal Knowledge and Information Systems, 2024

2023

  1. A pipeline for the creation of multimodal corpora from YouTube videos
    Nathan Dykes, Anna Wilson, and Peter Uhrig
    In Proceedings of the 1st Workshop on Linguistic Insights from and for Multimodal Language Processing, 2023
  2. Studying time conceptualisation via speech, prosody, and hand gesture: interweaving manual and computational methods of analysis
    Peter Uhrig, Elinor Payne, Irina Pavlova, and 7 more authors
    In Gesture and Speech in Interaction Conference (GeSpIn 2023), 2023

2022

  1. Retrieving Twitter argumentation with corpus queries and discourse analysis
    Nathan Dykes, Philipp Heinrich, and Stephanie Evert
    In Broadening the Spectrum of Corpus Linguistics, 2022
  2. Präsentation von Palliativstationen und SAPV-Teams im Internet–eine korpusbasierte Metaanalyse von Webseiten
    Joachim Peters, N. Dykes, Maria Heckel, and 2 more authors
    Zeitschrift für Palliativmedizin, 2022

2021

  1. Argument parsing via corpus queries
    N. Dykes, Stefan Evert, Merlin Göttlinger, and 2 more authors
    it-Information Technology, 2021
  2. Anonymisierung von Gerichtsurteilen–Eine wesentliche Voraussetzung für E-Justice
    Axel Adrian, Stefan Evert, Michael Keuchen, and 2 more authors
    Jusletter IT (May 2021), 2021
  3. Die palliativmedizinische Fachkultur in Geschichte und Gegenwart–sprachwissenschaftliche Perspektiven
    Joachim Peters, and N. Dykes
    Sprache und Wissen, 2021

2020

  1. Reconstructing argumentation patterns in German newspaper articles on multidrug-resistant pathogens: a multi-measure keyword approach
    N. Dykes, and Joachim Peters
    Journal of Corpora and Discourse Studies, 2020
  2. Reconstructing arguments from noisy text: Introduction to the RANT project
    N. Dykes, Stefan Evert, Merlin Göttlinger, and 2 more authors
    Datenbank-Spektrum, 2020
  3. Kompetenzdarstellung, Patientennähe und Argumentationsstrategien von Internetangeboten deutscher Hospize, Palliativstationen und SAPV-Teams–eine korpusbasierte Meta-Analyse
    J. Peters, N. Dykes, C. Ostgathe, and 2 more authors
    Zeitschrift für Palliativmedizin, 2020
  4. EmpiriST corpus 2.0: Adding manual normalization, lemmatization and semantic tagging to a German web and CMC corpus
    Thomas Proisl, N. Dykes, Philipp Heinrich, and 3 more authors
    In Proceedings of the Twelfth Language Resources and Evaluation Conference, 2020
  5. A corpus of German Reddit exchanges (GeRedE)
    Andreas Blombach, N. Dykes, Philipp Heinrich, and 2 more authors
    In Proceedings of the Twelfth Language Resources and Evaluation Conference, 2020

2019

  1. Metaphors in German newspaper articles on multidrug-resistant bacteria in clinical contexts, 1995–2015: A computer-assisted study
    Joachim Peters, N. Dykes, Mechthild Habermann, and 2 more authors
    Metaphor and the Social World, 2019
  2. A linguistic model of communication types in palliative medicine: effects of multidrug-resistant organisms colonization or infection and isolation measures in end of life on family caregivers’ knowledge, attitude, and practices
    Joachim Peters, N. Dykes, Maria Heckel, and 1 more author
    Journal of palliative medicine, 2019
  3. A new German Reddit corpus
    Andreas Blombach, N. Dykes, Stefan Evert, and 3 more authors
    In Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019), 2019
  4. The_illiterati: Part-of-speech tagging for Magahi and Bhojpuri without even knowing the alphabet
    Thomas Proisl, Peter Uhrig, Andreas Blombach, and 4 more authors
    In Proceedings of The First International Workshop on NLP Solutions for Under Resourced Languages (NSURL 2019) co-located with ICNLSP 2019-Short Papers, 2019