VU University Amsterdam

Department Member, Faculty of Arts

research assistant, computational linguist

VU University Amsterdam

About

Research interests

Lexicology, Terminology, Semantics, Cognitive linguistics, Corpus Linguistics and Computational Linguistics.
Research

Project DutchSemCor

Most NLP applications require large sense-tagged corpora along with lexical databases to reach satisfactory results in Word Sense Disambiguation (WSD) tasks such as machine translation, question & answering, summarization and terminology extraction. The number of English language resources have increased in the past years, the data scarceness for other languages, however, is more than obvious. The situation is similar for the Dutch language: scarceness of semantically annotated corpus material to train WSD machines. In order to overcome the data bottleneck the DutchSemCor project is aiming to deliver a one-million word Dutch corpus that is fully sense-tagged with senses and domain tags from the Cornetto lexical database. 250K examples of this corpus are being manually tagged. The remainder will be automatically tagged using three different WSD systems and will be validated by human annotators. The corpus data is based on existing corpus material collected in the projects CGN, D-CoI and SoNaR. These corpora have already been parsed and tagged in previous projects and will be extended where necessary in order to find sufficient examples for different word senses that are less frequent and do not appear in the above corpora. When writing this essay, our project is in a preliminary phase, we have currently begun with the annotation of our corpus material for Dutch nouns.

Project Dutch Terminology Service Centre

The Dutch Terminology Service Centre (Steunpunt Nederlandstalige Terminologie) was founded in 2007 by the Dutch Language Union (Nederlandse Taalunie), a Dutch-Flemish government institution. It functions as a non-commercial information center for all aspects of terminology and serves the entire Dutch-speaking community. We give advice on terminology research to anyone who is involved in terminology-related work (companies, organisations, translators, terminologists, teachers, scientists etc). Our organisation also maintains the website NedTerm (www.nedterm.org). NedTerm provides information on terminology activities and also includes a bibliography of terminological works, an overview of terminology training courses in the Netherlands and in Flanders, and information on standardisation issues. We are particularly keen on providing practical information on and links to on-line resources (terminological databases, information platforms) as well as promoting various language tools (esp. open source) which can be used for terminology work. We also organize terminology trainings and workshops for translators as well as master classes for scientists and language experts. Apart from organizing in-depth study days, we are also offering practical solutions to our targeted group. An example is the development of web courses for terminology management. The courses ad hoc terminology and systematic terminology management are both aimed at translators, translation agencies, terminologists, language professionals and master students and are available for free download on our website or can be consulted on-line.

International activities

Invited speaker on “Is there anything more we can do? A collaboration plan for Terminology Service Centres (TSCs) in Europe and beyond” at the 5th Terminology Summit, 12 November 2010, Budapest, Hungary.

Presentation on “DutchSemCor: from SoNaR to WSD” at the Stevin-day, 14 October 2010, Antwerp, Belgium.

Poster presentation on “Introducing the Dutch Terminology Service Centre: a centre of expertise on practical terminology work” at the 14th Euralex International Congress, 6-10 July 2010, Leeuwarden, the Netherlands.

Invited speaker on “Tools and Methods for Terminology Research” at the Symposium and Workshop “Terminology for Europeans and beyond”, 5-6 July, 2010, Euralex 2010, Leeuwarden, the Netherlands.

Poster presentation on “Computer assisted semantic annotation in the DutchSemCor project” at LREC, 19-21 May 2010, Valletta, Malta.

Invited speaker on “Web course Ad-hoc Terminology” at the 2nd Workshop Terminology for Translators, 3 April 2009, Antwerpen, Belgium.

Poster presentation on “Description of Terms from Cognitive Psychology: Work Method.” at the Conference Terminology, Computing and Translation, March 2004, Swansea, UK.
Scholarships and prizes

Other academic activities
Co-organizer of the 3rd TiNT day (Terminology in the Netherlands and Flanders), 9 December 2011, Amsterdam, the Netherlands

Co-organizer of the 2nd TiNT day (Terminology in the Netherlands and Flanders), 5 December 2010, Amsterdam, the Netherlands.

Co-organizer of the Symposium and Workshop “Terminology for Europeans and beyond”, 5-6 July, 2010, Euralex 2010, Leeuwarden, the Netherlands.

Co-organizer of the 1st TiNT day (Terminology in the Netherlands and Flanders),11 December 2009, Antwerp, the Belgium.

Co-organizer of the 1st Workshop “Terminology for Translators”, 3 April 2009, Antwerpen, Belgium.

Co-organizer of the 2nd Workshop “Terminology for Translators”, 12 December 2008, Amsterdam, the Netherlands.

Publications

Vossen P., Görög, A., Laan, F., Van Gompel, M. (2012). Optimizing WSD through Active Learning. In: Proceedings of the Eighth conference on International Language Resources and Evaluation (LREC'12), Istanbul, Turkey.

Vossen P., Görög, A., Laan, F., Van Gompel, M. (2011). DutchSemCor: building a semantically annotated corpus for Dutch. In: Proceedings of Electronic Lexicography in the 21st century: New Applications for new users (eLEX2011), Bled, Slovenia, November 10-12, 2011

Görög, A. and Van der Vliet, H. (2010). Introducing the Dutch Terminology Service Centre: a centre of expertise on practical terminology work. In: Proceedings of the 14th Euralex International Congress, Leeuwarden, the Netherlands.

Görög, A. and Vossen, P. (2010). Computer Assisted Semantic Annotation in the DutchSemCor Project. In:Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10),Valletta, Malta.

Görög, A. (2006). Description of Terms from Cognitive Psychology: Work Method. In: Terminology, Computing and Translation,Tübingen: Narr.

Contact Information

Telephone:

+31 20 598 9782

 
Language Matters
Language Resources and Evaluation
Language Sciences

x

Log In

or reset password

Reset Password

Enter the email address you signed up with, and we'll send a reset password email to that address

Academia © 2012