New diploma topic: "Classifying e-Tourism data sets"

The scope is to apply machine learning to classify e-tourism data sets. These sets are hotel descriptions, each description consisting of a set of given attributes (usually Boolean attributes following a well known and given data standard) and probably a free text. In a first step, these descriptions need to be analyzed and harmonized (on a semantic/conceptual as well as schema level). In a next step the tourism objects need to by classified, using both the set of attributes as well as the textual descriptions. Using machine learning these hotels should be classified into 7 distinct classes. Web Services should be developed to demonstrate the proof-of-concept. The evaluation is based on real world data.
Thorough skills in machine learning, Java and Web Services are required.
If you are interested, please contact the secretary of the EC-group.