Steffen Remus
Name: | Dr. Steffen Remus |
Position: | Postdoc |
Email: | steffen.remus (at) uni−hamburg․de |
Phone: | +49 40 42883 2369 |
Fax: | +49 40 42883 2345 |
Office: | F-413 |
Address: |
Universität Hamburg |
![]() |
Greetings, I am a postdoctoral researcher at the language technology group under the guidance of Prof. Dr. Chris Biemann. My main interests are unsupervised methods, applications in compuational linguistics, distributional methods, semantics, (focused) web crawling, information extraction, knowledge induction, etc. I earned my Ph.D. at the Universität Hamburg under the supervision of Prof. Dr. Chris Biemann.
In the past I was a scholarship holder for the KDSL (Knowledge Discovery in Scientific Literature) program and an associate researcher in the AIPHES program, I worked in the BIMDANUBE project and JOIN-T a project that combines ontologies with semantically induced information from text.
Teaching
Thesis Supervision
- Fabian Rausch (2022, MA)
- Frederik Wille (2022, MA)
- Mengy Li (2022, MA)
- Jan Stenzel (2022, MA)
- Teresa Lübeck (2022, BA)
- Maximilian Fischer (2021, BA)
- Gopalakrishnan Venkatesh (2021, MA)
- Tim Fischer (2021, MA)
- Hans Ole Hatz (2020, MA)
- Tim Dobert (2019, MA)
- Rami Aly (2018, BA)
- Tim Fischer (2018, BA)
- Alvin Rindra Fazrie (2018, MA)
- Kai Brusch (2018, MA)
- Joël Harms (2018, BA)
- Ahmed Elshinawi (2018, Independent Study)
- Dominik Sobania (2015, MA)
- Dennis Werner (2015, BA)
Courses @ Universität Hamburg
- Software Engineering 1 - Tutoring Practice Class - Winter 2023/24
- Deep Learning for Natural Language Processing Seminar (DL4NLP) - Winter 2023/24
- Web interfaces for language technology systems (WILPS) - MA practice course + seminar - Summer 2023
- Statistical Methods of Language Technology (SMoLT) - Tutoring Practice Class - Summer 2023
- Applications with Aspects of Language Technology - BA block practice course - Winter 2022/23
- Deep Learning for Unstructured Data - Seminar - Winter 2022/23
- Software Engineering 1 - Tutoring Practice Class - Winter 2022/23
- Machine Learning - Tutoring Practice Class - Summer 2022
- Web interfaces for language technology systems (WILPS) - MA practice course + seminar - Summer 2022
- Bachelorpraktikum 2017: Language Technology and Web Services - Winter 2016/17
- Softwareentwicklung 1 - Tutoring Pratice Class - Winter 2016/17
Courses @ Technische Universität Darmstadt
- Question Answering Technologies Behind IBM Watson - Summer Term 2016
- Algorithms of Language Technology - Practice Class - Summer Term 2016
- Question Answering Technologies Behind IBM Watson - Summer Term 2015
- Workshop on IBM Watson - One day workshop, February 2015
- Algorithms of Language Technology - Practice Class - Summer Term 2015 (Best Mentoring)
- Algorithms of Language Technology - Practice Class - Summer Term 2014
Professional Activities
Organizational / Editorial Activities:
- Shared Task on Hierarchical Classification of Blurbs - GermEval Task 1 (2019)
- 2nd Workshop on Biomedical Information Management: Data-Driven Innovations (2018)
- 1st Workshop on Biomedical Information Management: Challenges and Open Problems (2018)
- Workshop on IBM Watson - One day workshop, February 2015
Programme Committee Memberships / Reviewer Activities:
- AAAI 2022
- NAACL 2021
- LDK 2021
- EACL 2021
- AAAI 2021
- ACL 2020
- AACL-IJCNLP 2020
- KNLP workshop 2020
- TACL 2020
- LREC 2020
- EMNLP 2020
- WAC-XII workshop 2020
- Coling 2020
- CONLL 2020
- AAAI 2020
- TextGraphs workshop 2020
- DGfS 2020
- NLE (Journal) 2019
- NAACL 2019
- KONVENS 2019
- EMNLP 2019
- ECIR 2019
- CONLL 2019
- ACL 2019
- IWCS 2019
- LDK 2019
- CONLL 2018
- EMNLP 2018
- ACL 2018
- *SEM 2018
- ESWC 2018
- TextGraphs workshop 2018
- ISWC 2017
- RANLP 2017
- GSCL 2017
- EMNLP 2017
- *SEM 2017
- EACL 2017
- TextGraphs workshop 2018
- ESWC 2016
- SemEval 2016
- EMNLP 2016
- WAC-X workshop 2016
Publications
- Sevgili, Ö., Remus, S., Jana, A., Panchenko, A., Biemann, C. (2023): Unsupervised Ultra-Fine Entity Typing with Distributionally Induced Word Senses. In Proceedings of the 11th International Conference on Analysis of Images, Social Networks and Texts (AIST), pp. 1--15, Yerevan, Armenia (pdf)
- Geislinger, R., Pourasad, A. E., Gül, D., Djahangir, D., Yimam, S. M., Remus, S., Biemann, C. (2023): Multi-Modal Learning Application - Support Language Learners with NLP Techniques and Eye-Tracking. In Proceedings of the Linguistic Insights from and for Multimodal Language Processing (LIMO) Workshop co-located with the 19th Conference on Natural Language Processing (KONVENS 2023), pp. 1--6, Ingolstadt, Germany (pdf)
- Remus, S. (2023): Domain Defining Context: On Domain-Dependent Corpus Expansion and Contextualized Semantic Structuring. Doctoral Dissertation, U Hamburg (pdf, bibliographic metadata)
- Fischer T., Remus S., Biemann C. (2022): Measuring Faithfulness of Abstractive Summaries. In Proceedings of the 18th Conference on Natural Language Processing (KONVENS), pages 63 – 73, Potsdam, Germany (pdf)
- Remus S., Wiedemann G., Anwar S., Petersen-Frey F., Yimam S. M., Biemann C. (2022): More Like This: Semantic Retrieval with Linguistic Information, In Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022), pages 156–166, Potsdam, Germany (pdf).
- Dirk Johannßen, Chris Biemann, Steffen Remus, Timo Baumann, and David Scheffer (2020): GermEval 2020 Task 1 on the Classification and Regression of Cognitive and Motivational style from Text. In Proceedings of the GermEval 2020 Task 1 Workshop in conjunction with the 5th SwissText ¨a; 16th KONVENS Joint Conference 2020, 1–10. Zurich, Switzerland. (pdf,web)
- Jingyuan Feng, Özge Sevgili, Steffen Remus, Eugen Ruppert, and Chris Biemann (2020): Supervised Pun Detection and Location with Feature Engineering and Logistic Regression. In Proceedings of the 5th SwissText & 16th KONVENS Joint Conference 2020, 3:1–6. Zurich, Switzerland. (pdf)
- Varvara Logacheva, Denis Teslenko, Artem Shelmanov, Steffen Remus, Dmitry Ustalov, Andrey Kutuzov, Ekaterina Artemova, Chris Biemann, and Alexander Panchenko. 2020. Word sense disambiguation for 158 languages using word embeddings only. In Proceedings of The 12th Language Resources and Evaluation Conference, 5943–5952. Marseille, France. (pdf,web)
- Steffen Remus, Rami Aly and Chris Biemann (2019): GermEval 2019 Task 1: Hierarchical Classification of Blurbs. In Proceedings of the GermEval 2019 Workshop in Conjunction with the 15th Conference on Natural Language Processing (KONVENS 2019), Erlangen, Germany (pdf,web)
- Gregor Wiedemann, Steffen Remus, Avi Chawla and Chris Biemann (2019): Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings. In Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019), Erlangen, Germany (pdf)
- Markus J. Hofmann, Steffen Remus, Chris Biemann, Ralph Radach (2019). Language models can outperform empirical predictability in predicting eye movement data. In Proceedings of the 20th European Conference on Eye Movements (ECEM) 2019, Alicante, Spain (poster-pdf)
- Rami Aly, Steffen Remus and Chris Biemann (2019): Hierarchical Multi-label Classification of Text with Capsule Networks (2019). In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, Florence, Italy (pdf, bib)
- Tim Fischer, Steffen Remus and Chris Biemann (2019): LT Expertfinder: An Evaluation Framework for Expert Finding Methods. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations), Minneapolis, MN, USA (pdf, bib, demo, code)
- Steffen Remus, Hanna Hedeland, Anne Ferger, Kristin Bührig and Chris Biemann (2019): WebAnno-MM: EXMARaLDA meets WebAnno. In Selected papers from the CLARIN Annual Conference 2018. Pisa, Italy (pdf)
- Hanna Hedeland, Steffen Remus, Anne Ferger, Kristin Bührig and Chris Biemann (2019): Annotation gesprochener Daten mit WebAnno-MM. In Die 6. Jahrestagung des DHd e. V. 2019. Frankfurt & Mainz, Deutschland (poster-pdf)
- Steffen Remus, Hanna Hedeland, Anne Ferger, Kristin Bührig and Chris Biemann (2018): EXMARaLDA meets WebAnno. In CLARIN Annual Conference 2018. Pisa, Italy (pdf)
-
Steffen Remus and Chris Biemann (2018): Retrofitting Word Representations for Unsupervised Sense Aware Word Similarities. In Proceedings Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan (pdf,poster-pdf)
- Steffen Remus, Manuel Kaufmann, Kathrin Ballweg, Tatiana von Landesberger, and Chris Biemann. 2017. Storyfinder: Personalized Knowledge Base Construction and Management by Browsing the Web. In Proceedings of the 26th ACM International Conference on Information and Knowledge Management. Singapore, Singapore. pp. 2519-252. (preprint-pdf, poster-pdf, website)
- Markus J. Hofmann, Chris Biemann and Steffen Remus (2017): Benchmarking n-grams, Topic Models and Recurrent Neural Networks by Cloze Completions, EEGs and Eye Movements. In (Eds: B. Sharp, F. Sèdes and W. Lubaszewski): Cognitive Approach to Natural Language Processing, pages 197-215. ISTE Press, Elsevier (link)
- Seid Muhie Yimam, Steffen Remus, Alexander Panchenko, Andreas Holzinger , Chris Biemann. 2017. Entity-Centric Information Access with the Human-in-the-Loop for the Biomedical Domains. In Proceddings of the Biomedical NLP Workshop associated with RANLP 2017. Varna, Bulgaria (pdf)
- Steffen Remus, Gerold Hintz, Darina Benikova, Thomas Arnold, Judith Eckle-Kohler, Christian M. Meyer, Margot Mieskes and Chris Biemann (2016): EmpiriST: AIPHES Robust Tokenization and POS-Tagging for Different Genres. In Proceedings of the 10th Web as Corpus Workshop (WAC-X), Berlin, Germany (pdf)
- Steffen Remus and Chris Biemann (2016): Domain-Specific Corpus Expansion with Focused Webcrawling. In Proceedings Tenth International Conference on Language Resources and Evaluation (LREC 2016), Protorož, Slovenia (pdf)
- Alexander Panchenko, Stefano Faralli, Eugen Ruppert, Steffen Remus, Hubert Naets, Cédrick Fairon, Simone Paolo Ponzetto and Chris Biemann (2016): TAXI: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling. In Proceedings of the 10th International Workshop on Semantic Evaluation, San Diego, CA, USA (pdf)
- Omer Levy, Steffen Remus, Chris Biemann and Ido Dagan (2015): Do Supervised Distributional Methods Really Learn Lexical Inference Relations? In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Denver, CO, USA (pdf, bib)
- Chris Biemann, Steffen Remus and Markus J. Hofmann (2015): Predicting word ’predictability’ in cloze completion, electroencephalographic and eye movement data. In Proceedings of the 12th International Workshop on Natural Language Processing and Cognitive Science. Krakow, Poland (pdf)
- Jinseok Nam, Christian Kirschner, Zheng Ma, Nicolai Erbs, Susanne Neumann, Daniela Oelke, Steffen Remus, Chris Biemann, Judith Eckle-Kohler, Johannes Fürnkranz, Iryna Gurevych, Marc Rittberger and Karsten Weihe (2014): Knowledge Discovery in Scientific Literature. In Proceedings of the 12th Konferenz zur Verarbeitung natürlicher Sprache (KONVENS 2014). Hildesheim, Germany (pdf)
- Dirk Goldhahn, Steffen Remus, Uwe Quasthoff and Chris Biemann (2014): Top-Level Domain Crawling for Producing Comprehensive Monolingual Corpora from the Web. In Proceedings of the LREC-14 workshop on Challenges in the management of large corpora. Reykjavik, Iceland (pdf)
- Steffen Remus (2014): Unsupervised Relation Extraction of In-Domain Data from Focused Crawls. In Proceedings of the Student Research Workshop at the 14th Conference of the European Chapter of the Association for Computational Linguistics. Gothenburg, Sweden (pdf, bib)
- Steffen Remus and Chris Biemann (2013): Three Knowledge-Free Methods for Automatic Lexical Chain Extraction. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Atlanta, GA, USA (pdf, bib)