James Lee

M. Dubremetz

Computational Linguist, PhD


Contact Me

About Me

I am a computational linguist based in Uppsala. This is a professional website. I have no WhatsApp neither facebook, the best way to contact me is by email. And if you want connect to my personal interests you can always look at the section Hobby. Wondering what I am up to? Have a look at my now page.
Last Update: 2021-01-15.

Projects List

project name

Mycroft voice assitant

Mycroft is the open source platform for vocal home assistance. It is like Alexa / Cortana except that it is made by the open source community and with respect of privacy. I created my own mycroft skill to turn on and off a projector via ssh and a raspberry pi. While installing and creating I also participated to correct both bugs and public documentation.

2021-04-22

View on github


project name

Lunch.uppsala.ai

This personal project, uses computational linguistics, webscraping, and image recognition to present each days the lunch menus in Uppsala city. 100% open source. You want to know what's for lunch today but you are too lazy to search all the restaurants? Go to https://lunch.uppsala.ai .

2021-04-13

Link to website
View on gitlab


project name

VTML4Sublime

Vtml is a language that helps tuning synthetic voices for a more natural reading. Writing a vtml file can be repetitive. I created vtml-tag shortcuts for Sublime 3 text editor.

2021-01-05

View on github


project name

ACL anthology

In this professional project performed for the Association for Computational Linguistics I helped archiving old conference papers. This project represents:200+ inconsistent webpages from the 2000s converted into csv files, 3000+ pdf inserted into the new scientific database, 10 000+ names of scientists, titles of articles and pdf links scraped and normalised

2021-01-06

View on github


project name

Mustache website

This website has been completely re-written to use a mustache templating. This language allows you to separate html from content. And thus to edit your website with a simple yaml file.

2021-01-02

View on gitlab


project name

Chiasmus detector

I designed a tool for literature analysis purpose that detects the figure of speech called chiasmus.

2018-01-20

View on github


project name

(Ep)anaphora/Epiphora detector

Martin Luther King made one on his famous discourse in Memphis. Epanaphora is the figure that consists in starting your sentences by the same words. Epiphora is the same but at the end of the sentences. I made a detector for it.

2018-01-20

Link to thesis


project name

CoreNLP French Lemmatizer

I made an (unofficial) script to get lemmatization into the XML output of Stanford CoreNLP.

2018-02-28

View on github



Publications

Journal Article

Dubremetz, Marie and Nivre, Joakim (2018) Rhetorical Figure Detection: Chiasmus, Epanaphora, Epiphora. Frontiers in Digital Humanities. 5:10. doi: 10.3389/fdigh.2018.00010

PDFBIBJOURNAL

Long Article

Dubremetz, Marie and Nivre, Joakim (2016). Syntax Matters for Rhetorical Figure Detection: the Case of Chiasmus. In Computational Linguistics for Literature (CLFL 2016). San-Diego, United-States.

PDFBIBVideoLong Article

Talk

Litteræ et Linguæ. Rhetoric Workshop, Uppsala. (February 2016) Sweden.

Slides

Long Article

Dubremetz, Marie and Nivre, Joakim (2015). Rhetorical Figure Detection: the Case of Chiasmus In Computational Linguistics for Literature (CLFL 2015). Denver, United-States.

PDFBIBSlides

Long Article

Dubremetz, Marie and Nivre, Joakim (2014). Extraction of nominal multiword expressions in French. In Proceedings of the 10th Workshop on Multiword Expressions (MWE). Gothenburg, Sweden.

PDFBIB

Long Article

Dubremetz, Marie (2013). Vers une identification automatique du chiasme de mots. In Actes de la 15e Rencontres des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL’2013), (pages 150–163). Les Sables d’Olonne, France.

PDFBIBSlides

Teaching

Grundläggande textanalys 2016

Basic text analysis. Lectures given in English. I was teaching computational linguistics treatment such as: lemmatisation, tokenization, finite state transducers, HMM and more. I was the main responsible for this course with responsibility such as: managing the schedule, design and correction of assignment, individual support to students, preparation of lab. The audience were heterogeneous profiles of first year bachelor. I developed pedagogical tools such as MCQs with team work and use of flash cards.

Recordings

Below  you can see the recording of the second talk I gave on chiasmus detection.