Department of Informatics – DDIS

 

Bachelor Theses

This pages lists the BSc. Theses topics currently available in our group. Don't hesitate to contact the respective person if you are interested in one of the topics. If you would like to write a thesis about your own idea you can propose it to the person most related to what you plan to do or you can contact Prof. Bernstein directly.

Behavior-Based Quality Assurance in Crowdsourcing Markets

A big problem in crowdsourcing markets is the design of robust quality assurance mechanisms. Today, the lack of methods is compensated using massive redundancy in task allocation and / or the use of voting mechanisms. Recenty research showed that the use of behavioral data (e.g., working time per task, mouse movements) can be a sophisticated indicator for the resulting quality.

In this bachelor / master thesis, you will design several worker behavior-based quality predictors and evaluate them in various domains.

Contact: Patrick Minder

Linked Raster Data

In the Geo-Sptial community, Linked Data is gaining increasing importance. Recent examples are Linked Geo-Data and, most popular, Geonames aka the Linked Data version of Open Street Map. Such initiatives are not only supported by academia but indeed backed by prominent partners from industry (e.g. ESRI, Oracle) and the public services (UK Ordnance Survey, Swiss Federal Office for the Environment, US National Geographic Survey). Most approaches, however, aim towards the processing of vector data where entities (points, lines, polygons) are defined ex-ante. In collaboration with the Department of Geography (UZH) we recently approached towards linking raster data such as fields containing information about slope, height, population density, etc. to the Linked Open Data cloud. In particular, it is features deduced from raster data that can potentially be enriched with semantics.

Your task will be to integrate operators from a Geographic Information System such as ArcGIS or GRASS into a SPARQL engine such as Jena or Sesame. Based on a faceted browser you will integrate raster-data into the results of faceted search and/or faceted browsing. In the context of a Bachelor's thesis you will also show how operators in SPARQL translate into operators in the GIS. If performed as a Master Project, the work will more focus on the integration and optiomization aspect of SPARQL to GIS and vice versa.

Contact: Thomas Scharrenbach

Adapting Non-Distributed Algorithms for Distributed Constraint Optimizations

One of the methods of obtaining iterative approximate algorithms for Distributed Constraint Optimization Problems is extending centralized ones. Some possible starting point would be to come up with distributed versions of: Threshold accepting, Tabu search, Great deluge algorithm, Record-to-record travel (http://www.tcs.hut.fi/Publications/A27.shtml). One should then compare to similar, existent algorithms. Benefit: Coming up with new algorithms and studying their performance might enable finding cases where they perform better than others and might also enable the creation of hybrid algorithms (http://eprints.soton.ac.uk/265207/1/aamas08OptMAS.pdf)

Challenges: Design, implement and evaluate the distributed versions of the algorithms in Signal/Collect

Contact: Mihaela Verman

High performance data (RDF) processing

As published RDF data continues to grow (the LOD reported 31 billion triples as of September 2011), the need for high-performance data processing algorithms is paramount. This series of bachelore-thesis projects is intended to attract students that have good knowledge of algorithmic desing and/or are interested in learning more. Possible topics include but are not limited to: efficient merge algoritms (in-place), efficient external sorting (standard and/or map-reduce based), load balancing of resources (CPU / DISK) in parallel processing systems (fuzzy logic based controllers, evolutionary algorithms, other). The students should also have knowledge of C and python.

Contact: Cosmin Basca