The World Wide Web (WWW) has been the main source of an important shift in the way people get information and order services. However, the current Web is aimed at people only. The Semantic Web, which is the successor of the WWW, is defined and linked in a way that it can be used by machines not just for display purposes, but also for automation, integration and reuse of data across various applications. Facilities and technologies to put machine understandable data on the Web are rapidly becoming a high priority for many communities. E.g., whenever a computer program scans a text containing the word "chair", it does not know that this is an instance of the concept "office equipment", since "chair" is just a sequence of characters. However, by using an annotator, we can mark this word chair with a tag reflecting the concept "office equipment", so this instance can be recognized by a computer application. In fact, the Semantic Web augments the current WWW with formalized knowledge in order to make its information machine-processable.
The aim of this project is the development of an ontology-based annotator in order to enrich resources containing semi- and/or non-structured data with meta-data tags based on concepts contained within DOGMA's Ontology Base. Examples of less-structured resources are html-pages, text-files, multimedia, etc... In order to demonstrate the usability of the annotator, a case study will be worked out. The domain and/or type of case study can be proposed by the student.
Java, Eclipse platform
You have programming skills in Java.