Prof. Dr. Margot Mieskes

Professor for research and economic data

Member of the Steering Committee dkmi

International Representative

Short profile

Margot Mieskes has been Professor of Information Science at Darmstadt University of Applied Sciences since 2015.

From 2013-2015 she worked as a post-doctoral researcher at the Institute for Educational Research and Educational Information (DIPF) in Frankfurt, in cooperation with the Ubiquitous Knowledge Processing (UKP) Lab at TU
Darmstadt.

From 2008 to 2013 she worked in companies - including at the EML.

She completed her doctorate at the EML Research (now Heidelberg Institute for Theoretical Studies)
under the supervision of Michael Strube.

Margot Mieskes studied computational linguistics in Stuttgart, Edinburgh and Cambridge.

about the person

Research focus

Application of machine language processing methods in other domains (e.g. education research, psychotherapy, financial market analyses, teaching support).
Automatic summary of spoken and written natural language data
Evaluation of automatic summary methods
Reproducibility, transparency and ethical issues of NLP/CL field
Information extraction from informal language data (e.g. Web 2.0 content) or transcribed spoken language
Creation of corpora and annotations
Evaluation of annotations and evaluation metrics

Projects

2022 - 2023

ReproHum - Investigating Reproducibility of Human Evaluations in NLP

Joint research project on replication of manual evaluation led by the University of Aberdeen.

2021 - 2022

BigScience research workshop on large multilingual models and datasets

Creation of large, multilingual language models under the project management of HuggingFace; Chair of the Social Impact Meta Group.

2018- 2019

VALERIE – Evaluation of learning therapy

Funding source: Central Research Funding of the Darmstadt University of Applied Sciences

Funding programme: University-internal funding

2017 – 2018

PARANOIA – Psychotherapy using natural language processing based on computational aid

Co-applicant and coordinator

Funded by: Hessian Ministry of Science and the Arts (HMWK)

Funding programme: Research for Practice

2015 – 2019

AIPHES - Adaptive Preparation of Information from Heterogeneous Sources

DFG Research Training Group, co-applicant and associate professor

Funding source: German Research Foundation (DFG)

2008 - 2012

High Quality Voice User Interaction

Funded by: Klaus Tschira Foundation

2004 - 2007

DIANA-Summ: Dialogue, Anaphors, Summarization

Funding source: DFG subject grant

Scientific and artistic activities

Invited lectures

2022

Member of the panel discussion at the Workshop on Insights from Negative Results, in the context of the
ACL 2022 on the topic “How Bad are Annotation Disagreements, Really?”

AIT Austrian Institute of Technology, Lecture Series, invited lecture, May 2022, "Ethics in
Natural Language Processing”

2018

International Symposium on Language Technology for Individualized Language Learning and
Assessment, Univ. of Duisburg-Essen, 01–02 October 2018 – Invited Talk “Summarization Evaluation meets Short-Answer Grading”

2017

New Frontiers in Social Media Research – International Summer School 2017, Duisburg, 18–22
September 2017 – Invited Lecture “Reliability of Methods in NLP”

Swisstext 9 June 2017 – Keynote Lecture “Computer, Summarize Service Records”

Appraisal activities

2023

Tutorial Chair ACL 2023

2020

- SwissText 2020

- EMNLP 2020 – Main Conference

- EMNLP 2020 – Demo Track

- EMNLP 2020 – Ethics Review Board

- ACL 2020

- IJCAI 2020

- LREC 2020

- REPROLANG 2020

- AAAI 2020

- AACL-IJCNLP

- NUSE 2020

- Presentation of a tutorial on "Reviewing NLP" at the ACL; positive review and approval of the tutorial.

2018

Language Resources and Evaluation Conference (LREC). She also presented her research results at this conference. Miyazaki (Japan), 07.-12.05.2018.

Widening NLP – Second WiNLP workshop, New Orleans (USA), 01.06.2018.

Language Resources and Evaluation (Journal International Symposium on “Language Technology for Individualized Language Learning and Assessment”, 02.10.2018.

Workshops

Tutorial Chair for workshops at the conference
"The 61st Annual Meeting of the Association for
Computational Linguistics in Toronto, Canada,
09.-14.07.2023

Event organisation

2023

Co-Organisatorin Workshop Teaching NLP 2023 --

Co-Organisatorin Workshop Teaching German NLP 2023 for the Convens 2023

2021

Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Prizes and awards

2020

Prof. Dr. Margot Mieskes was named "outstanding reviewer" at the 58th Annual Meeting of the Association for Computational Linguistics in 2020.

Member of the following scientific committees:

NAACL, ACL, Swisstext, BEA, AAAI, EMNLP, ACL Professional Conduct Committee Member

Publications

2023

Manon Reusens, Philipp Borchert, Margot Mieskes, Jochen De Weerdt, Bart Baesens: Investigating
Bias in Multilingual Language Models: Cross-Lingual Transfer of Debiasing Techniques. Accepted to EMNLP 2023 main conference, https://doi.org/10.48550/arXiv.2310.10310, 16.10.2023

Nadine Probol and Margot Mieskes. 2023. Emotions in Spoken Language - Do we need acoustics?. In Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis, pages 71–84, Toronto, Canada. Conference: Association for Computational Linguistics.

Anya Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, Jose M Alonso-Moral, Mohammad Arvan, Jackie Cheung, Mark Cieliebak, Elizabeth Clark, Kees van Deemter, Tanvi Dinkar, Ondřej Dušek, Steffen Eger, Qixiang Fang, Albert Gatt, Dimitra Gkatzia, Javier González-Corbelle, Dirk Hovy, Manuela Hürlimann, Takumi Ito, John D Kelleher, Filip Klubicka, Huiyuan Lai, Chris van der Lee, Emiel van Miltenburg, Yiru Li, Saad Mahamood, Margot Mieskes, Malvina Nissim, Natalie Parde, Ondřej Plátek, Verena Rieser, Pablo Mosteiro Romero, Joel Tetreault, Antonio Toral, Xiaojun Wan, Leo Wanner,
Lewis Watson, Diyi Yang: Missing information, unresponsive authors, experimental flaws:
The impossibility of assessing the reproducibility of previous human evaluations in NLP. arXiv preprint arXiv:2305.01633, 02.05.2023.

Margot Mieskes, Jacob Benz: h da@ ReproHum–Reproduction of Human Evaluation and Technical Pipeline. In: Proceedings of the 3rd Workshop on Human Evaluation of NLP Systems, HumEval’23.

2022

Mieskes, Margot (2022). Replicability under near-perfect conditions – a case-study from automatic
summarization. In Proceedings of the Third Workshop on Insights from Negative Results in NLP,
pages 165–171, Dublin, Ireland. Association for Computational Linguistics

2021

Tuggener, D., Mieskes, Margot, Deriu, J., and Cieliebak, M. (2021). Are we summarizing the
right way? a survey of dialogue summarization data sets. In Proceedings of the Third Workshop on
New Frontiers in Summarization, pages 107–118, Online and in Dominican Republic. Association
for Computational Linguistics

Jurgens, D., Kolhatkar, V., Li, L., Mieskes, M., and Pedersen, T. (2021). Teaching NLP. In
Proceedings of the 2021 Annual Conference of the North American Chapter of the Association for
Computational Linguistics. to appear

Cohen, K., Fort, K., Mieskes, M., Névéol, A., and Gold, A. (2021). Reviewing natural language
processing research. In Proceedings of the 16th Conference of the European Chapter of the
Association for Computational Linguistics: Tutorial Abstracts, Online. Association for Computational
Linguistics

2020

Cohen, K., Fort, K., Mieskes, M., and Névéol, A. (2020). Reviewing natural language processing
research. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics:
Tutorial Abstracts, pages 16–18, Online. Association for Computational Linguistics

Mieskes, M., Loza Mencía, E., and Kronsbein, T. (2020). A data set for the analysis of text
quality dimensions in summarization evaluation. In Proceedings of the 12th Language Resources
and Evaluation Conference, pages 6690–6699, Marseille, France. European Language Resources
Association

Tauchmann, C. and Mieskes, M. (2020). Language agnostic automatic summarization evaluation.
In Proceedings of the 12th Language Resources and Evaluation Conference, pages 6656–6662,
Marseille, France. European Language Resources Association

Tauchmann, C., Daxenberger, J., and Mieskes, M (2020). The influence of input data complexity
on crowdsourcing quality. In Proceedings of the 25th International Conference on Intelligent User
Interfaces Companion, IUI ’20, page 71–72, New York, NY, USA. Association for Computing
Machinery

2019

Mieskes, M. and Padó, U. (2019). Summarization Evaluation meets Short-Answer Grading.
In Proceedings of the 8th Workshop for Natural Language Processing for Computer-Assisted
Language Learning (NLP4CALL), Turku, Finland, 30 September 2019

Blazevic, M., Börner, I., Komander, M., and Mieskes, M. (2019). 2019 germeval shared task on
offensive tweet detection h_da submission. In Proceedings of the GermEval 2019 Shared Task on
the Identification of Offensive Language (GermEval 2019), Erlangen, Germany, 8 October 2019

Mieskes, M., Fort, K., Névól, A., Grouin, C., and Cohen, K. (2019). Community perspective on
replicability in natural language processing. In Proceedings of the Conference on Recent Advances
in Natural Language Processing (RANLP 2019), Varna, Bulgaria, 2 – 4 September 2019

Mieskes, M. and Schmunk, S. (2019). OCR Quality and NLP Preprocessing. In Third Workshop
on Widening NLP (WiNLP 2019), Florence, Italy, 28 July 2019. non-archival publication

Preisler, B., Mieskes, M., and Becker, C. (2019). Bitcoin value and sentiment expressed in tweets.
In Proceedings of the Fourth Swiss Text Analytics Conference, Winterthur, Switzerland, 18 – 19
June 2019

2018

Mieskes, M. and Padó, U. (2018). Work smart - reducing effort in short-answer grading. In
Proceedings of the 7thWorkshop for Natural Language Processing for Computer-Assisted Language
Learning (NLP4CALL), Stockholm, Sweden, 07 November 2018

Mieskes, M. and Shutyi, S. (2018). Emotionality in Patients and Therapists speaking German. In
SecondWorkshop onWidening NLP (WiNLP 2018), New Orleans, USA, 01 June 2018. Unarchived
Paper

Tauchmann, C., Arnold, T., Hanselowski, A., Meyer, C. M., and Mieskes, M. (2018). Beyond
Generic Summarization: A Multi-faceted Hierarchical Summarization Corpus of Large Heterogeneous
Data. In Proceedings of the Eleventh International Conference on Language Resources and
Evaluation (LREC 2018), Miyazaki, Japan, 7 – 12 May 2018

Mieskes, M. and Stiegelmayr, A. (2018). Preparing Data from Psychotherapy for Natural Language
Processing. In Proceedings of the Eleventh International Conference on Language Resources and
Evaluation (LREC 2018), Miyazaki, Japan, 7 – 12 May 2018

Siegel, M. and Mieskes, M. (2018). Information science education in darmstadt. In Proceedings of
the Future of Education in Information Science – International EINFOSE Symposium, Pisa, Italy

2017

Stiegelmayr, A. and Mieskes, M. (2017). Using argumentative structure to grade persuasive
essays. In GSCL 2017 – Language Technologies for the Challenges of the Digital Age

Mieskes, M. (2017a). A Quantitative Study of Data in the NLP community. In Proceedings of
the First ACL Workshop on Ethics in Natural Language Processing, pages 23–29, Valencia, Spain.
Association for Computational Linguistics

Schulz, K., Mieskes, M., and Becker, C. (2017). h-da Participation at Germeval Subtask B:
Document-level Polarity. In Proceedings of the GermEval 2017: Shared Task on Aspect-based
Sentiment in Social Media Customer Feedback, Berlin, Germany

Mieskes, M. (2017b). How Machines understand Speech. Babel – The Language Magazine, (20).
Pull-Out Poster

2016

Benikova, D., Mieskes, M., Meyer, C. M., and Gurevych, I. (2016). Bridging the gap between
extractive and abstractive summaries: Creation and evaluation of coherent extracts from
heterogeneous sources. In Proceedings of the 26th International Conference on Computational
Linguistics (COLING), Osaka, Japan, December 13–16 2016, pages 1039–1050

Remus, S., Hintz, G., Benikova, D., Arnold, T., Eckle-Kohler, J., Meyer, C. M., Mieskes, M., and
Biemann, C. (2016). EmpiriST: AIPHES Robust Tokenization and POS-Tagging for Different
Genres. In Proceedings of the 10th Web as Corpus Workshop (WAC-X), pages 106–114

Meyer, C. M., Benikova, D., Mieskes, M., and Gurevych, I. (2016). MDSWriter: Annotation
tool for creating high-quality multi-document summarization corpora. In Proceedings of the 54th
Annual Meeting of the Association for Computational Linguistics (ACL 2016): System Demonstrations,
Berlin, Germany, 7–12 August 2016, pages 97–102. Association for Computational Linguistics

Henß, S., Mieskes, M., and Gurevych, I. (2015). A reinforcement learning approach for adaptive
single- and multi-document summarization. In International Conference of the German Society for
Computational Linguistics and Language Technology (GSCL-2015), Duisburg-Essen, Germany,
30 September – 2 October 2015, pages 3–12

Further publications under:

ACL-Anthology

Further websites

Prof. Dr. Margot Mieskes - Google Scholar

dkmi member

Prof. Dr. Margot Mieskes

Communication Max-Planck-Straße 2
64807 Dieburg
Office: F14, 39F

+49.6151.533-69418
margot.mieskes@h-da.de

User login

Fontsize

Light-/darkmodus

Prof. Dr. Margot Mieskes

Short profile

about the person

2022 - 2023

2021 - 2022

2018- 2019

2017 – 2018

2015 – 2019

2008 - 2012

2004 - 2007

Invited lectures

Appraisal activities

Workshops

Event organisation

Prizes and awards

Member of the following scientific committees:

2023

2022

2021

2020

2019

2018

2017

2016

dkmi member