Navigation

You are here: Home / Research

Research Projects

by year:  all years   2013   2012   2011   2010   2009   2008   2007   2006   2005   2004   2003   2002   2001   2000   1999   1998   1997   prior projects
by contact:  all contacts
Show Contacts
Maurer Michael Saffari Amir Schulter Samuel Seichter Hartmut Zeisl Bernhard Lex Alexander Arth Clemens Barakonyi István Bauer Joachim Beichel Reinhard Bischof Horst Bornik Alexander Reitinger Bernhard Bauer Christian Gruber Lukas Kainz Bernhard Pirchheim Christian Wagner Daniel Kalkofen Denis Donoser Michael Elbischger Pierre Ferstl David Fraundorfer Friedrich Reitmayr Gerhard Godec Martin Graber Gottfried Grabner Markus Grubert Jens Hartl Andreas Hauswiesner Stefan Riemenschneider Hayko Grabner Helmut Hirzer Martin Hofer Manuel Hoppe Christof Irschara Arnold Newman Joseph Junghanns Sebastian Khan Inayatullah Kalkusch Michael Karner Konrad Khlebnikov Rostislav Klaus Andreas Klopschitz Manfred Kluckner Stefan Köstinger Martin Kontschieder Peter Pirker Katrin Kruijff Ernst Langlotz Tobias Langs Georg Leberl Franz Lee Felix Leistner Christian Leitner Raimund Lenz Martin Mauthner Thomas Meixner Philipp Mendez Erick Grabner Michael Heber Markus Mühl Judith Mulloni Alessandro Ober Sandra Pacher Georg Partl Christian Pflugfelder Roman Pinz Axel Roth Peter M. Pock Thomas Puff Werner Pan Qi Ram Surinder Grasset Raphael Recky Michal Regenbrecht Holger Reinbacher Christian Rüther Matthias Rumpler Markus Santner Jakob Sareika Markus Schall Gerhard Schmalstieg Dieter Schulz Hans-Jörg Sormann Mario Steinberger Markus Sternig Sabine Storer Markus Straka Matthias Streit Marc Tatzgern Markus Nguyen Thanh Nguyen Thuy Trobin Werner Unger Markus Uray Martina Urschler Martin Veas Eduardo Waldner Manuela Wendel Andreas Werlberger Manuel Winter Martin Wohlhart Paul Zach Christopher Zebedin Lukas Zollmann Stefanie
by keyword:  all keywords
Show Keywords
3D Computer Vision 3D reconstruction Aerial Vision Augmented Reality Augmented Video Best Paper Award Biometrics Caleydo Computer Graphics Computer Vision Convex Optimization Coordinate transformations detection face Fingerprint Georeferencing GPU GUI HOG Human Computer Interaction Image Labelling Industrial Applications Information Visualization integral imaging Interaction Interaction Design Machine Learning Medical computer vision Medical Visualization Mixed Reality Mobile computing Mobile phone Model Multi-Display Environments Multiple Perspectives Object detection Object recognition Object reconstruction Object Tracking On-Line Learning Robotics Segmentation Shape analysis shape from focus SLAM Software Projects Structure from Motion Surveillance SVM Symmetry Tracking Fusion Tracking, Action Recognition User Interfaces Variational Methods Virtual reality and augmented reality Visual Tracking Visualization
per page:    all   50   20
  Title     Abstract     Start     End  
AUGUR: portable AR visualization of structure within structure using high precission detection
(details)

This project aims to develop portable measurement tools with in-situ visualization for the construction industry. A future measurement tool will provide a direct augmented reality view of measured properties over the real environment together with instructions as to where and how a certain task can be completed. For example, a metal detection tool should be able to provide direct visual feedback on the location of hidden metallic structures over a live video view of the inspected wall area. Furthermore it can guide a construction engineer to the optimal position for drilling a hole, avoiding any damage to existing structures.

Thus the tools should combine information from several sources to provide interactive and contextaware

guidance: Measurements from built-in sensors; location-aware through online tracking and registration; spatial, semantic information retrieved from a building information system (BIM). At the same time, future tools need to be simple to be used by non-expert users; therefore the system needs to be intuitive and guide users in the correct operation to fulfil their tasks. To accomplish this goal, The project addresses the following challenges:

  • Tracking for mobile devices in changing and unknown environments for correct visual overlays. We will investigate the combination of visual online reconstruction methods with range finders and coarse models for absolute registration.
  • X-Ray visualization of hidden and abstract information in unknown environments. Here we will investigate automatic approaches that take the environment’s appearance and the virtual information into account to select the best visualization method.
  • User guidance based on measured and plan information requires automatic analysis of the spatial arrangements and automatic visualization.
2012 2013
Mobi-Trick
(details)

The focus of the project is outdoor mobile computer vision with all of its challenges. Mobile systems need to be compact and energy efficient and are frequently changing locations. Therefore they must be autonomous and perform processing locally. A number of challenges arise from these requirements for which the project aims to provide solutions: Being compact, there is not much space for a large number of sensors such as laser scanners, radar antennas and the like. The work in this project will focus on stereo vision but with two different types of cameras. Often a second camera is already available and stereo information increases detection accuracies. Each time the system moves it needs to adapt to the changing situation. This requires adaptive calibration and online learning. Mobile systems often work from batteries. In addition, there is not much space to include intricate cooling systems. Thus, the system must be designed to be very energy efficient. New approaches for dynamic power management will be explored in the project. To put the work into context, several applications from the area of traffic surveillance/toll enforcement will be implemented and tested in an application oriented setting.

Current traffic enforcement solutions are either very large and costly (section control, toll enforcement) or do not offer much in terms of image processing (radar speed control). The technological output of Mobi Trick makes it possible to design mobile solutions for traffic monitoring, vehicle identification and classification, intelligent incident detection and observation of driver behavior. Mobile devices are also more efficient in enforcement. Their transient nature makes them less predictable. Mobile systems can also react more flexibly to changing road situations such as construction sites.

2010 2013
HD-VIP: High Definition Video Processing
(details)

The growth of information is nowadays enormous and at a level which had never been reached before. We currently produce almost more data in one year than was produced in the entire history of mankind so far. In particular the trend to a full digitization of audiovisual content is contributing to this explosion of available material. The exponential growth of online video, most notably YouTube among the many prominent video portals is just one example for that. Even if international studies are not arriving at exactly the same results, the figures are impressive: digital production in 2006 was approximately 160 Exabyte, and is predicted to rise to 990 Exabyte in 2010.

Any video processing /editing software has to keep pace with these extraordinary data rates which requires special efforts from the hardware and the software. Fortunately we see also an extraordinary increase in processing power, especially when looking at recent developments of graphics cards (GPUs). These cards offer massive parallelism (ideally suited for video processing) at a rather modest price. All these facts make this hardware an ideal candidate for video processing. But in order to make full use of the hardware the algorithms have to be highly parallel. Typical tasks encountered in video processing (which will also be tackled by the proposed project are):

Superresolution: With the advent of HDTVs in many homes there is an increasing need to produce also HDTV content. In order to make use of existing (low-resolution) material one can use so called superresolution algorithms. These methods generate from a sequence of low resolution frames a high resolution image by exploiting the high interframe redundancy.

Denoising: There are many sources of noise in a video, either the material is historic or during production/compression etc. noise is added to the video. A basic task is to remove the noise but still preserve all fine scale details.

Interactive video editing: For post production purposes one wants to mark objects in a video (of course the object should only be marked in a single frame and then segmented automatically in all subsequent frames) and either remove them (which requires inpainting methods to fill the holes with meaningful content), place them somewhere else in the video or replace them with different objects. Since these tasks are done interactively this requires interactive framerates.

Fortunately all of these tasks can be addressed by so called variational methods. The basic idea is to formulate the task as a minimization problem of a suitable energy functional. Besides other desirable properties these methods can be implemented in a highly parallel fashion which makes them ideal candidates for implementation on modern GPUs.

2010 2012
Highly accurate range computation in driver assistence systems
(details)

In this project we study variational methods for computing highly accurate range data in driver assistance systems.

2010 2011
VM-GPU: Variational Methods on the GPU for Industrial Problems
(details)

The project VM-GPU fits exactly to the FIT-IT Visual Computing call. It is a combination of computer vision and graphics methods to offer solutions to a problem of great relevance for industry. In particular,

  1. VM-GPU will address modern variational methods for computer vision. These methods are mathematically well understood and provide novel means for such diverse tasks as denoising, segmentation, 3D matching registration etc. One short coming of these methods is that due to their iterative nature they are usually slow to implement, therefore these methods have not been used in an industrial settings. 2. Modern graphics processing units (GPUs) offer a tremendous processing speed (the new Nvidia series is supposed to offer 500GFlops) and the increase in processing speed is much faster than for standard CPUs. Recent features of GPUs (e.g. floating point, highly parallel architecture etc.) make them attractive for general purpose calculations and in particular for computer vision tasks. 3. Machine Vision and industrial image processing is a fast growing market with a lot of challenging tasks to be solved. In order to keep pace with the production process the methods need to be fast.

The goal of VM-GPU is to make variational methods available for industrial problems by using modern graphics hardware. If successful this project will have a large impact on the machine vision industry, it will allow for the first time to use variational methods in an industrial setting, in addition having graphics cards available as computing platforms will offer completely new ways of addressing industrial vision problems (e.g., it is very easy to scale up by just using a second graphics card).

2007 2009
3D Reconstruction of Electrical Impulse Discharges
(details)

Electrical impulse discharges in nature are visible as lightning. Their impact point can be electro magnetically located up to a precision of several hundred meters. In some restricted areas such as industrial plants, airports etc. it desirable to know the the impact region and path of the lightning up to a precision of a few meters. If visibility is not too restricted by weather conditions, a multi-camera setup would be a viable option to locate path and impact area of the discharge.

In this project impulse discharges of a few meters are synthetically generated under laboratory conditions and reconstructed using a multi camera setup.

2007 2008
A Low-Cost System for Automatic People Tracking in a Labyrinth
(details)

After medical treatment of visually handicapped people it is desirable to evaluate the benefit of the treatment for the patient. Especially the capability of the patient to orient himself in a three-dimensional environment, to navigate and recognize obstacles is of interest.

For a clinical evaluation under controlled circumstances a labyrinth has been built through which the patient ha to navigate. Obstacles may be randomly placed in the labyrinth. A multi-camera system keeps track of the patients movements and extracts parameters such as position, speed, head rotation etc.

2006 2007
Measuring Material Displacement on Steel Specimens During Strain Tests
(details)

Deformation analysis of material surfaces is a crucial part of material testing and quality control. In this project a stereoscopic surface measurement system has been developed which allows to measure surface deformation during stress/strain tests over a wide range of fields of view. Stereo image pairs are acquired at specified time instances which makes it possible to synchronize the acquisition with the amount of stress applied.

2005 2006
Vision Based Kinematic Calibration and Error Compensation of Articulated Robot Arms
(details)

Development of a vision system for accurate calibration of the kinematic chain of an articulated robot arm.

The absolute positioning error of articulated robot arms is typically by an order of ten higher than their repeatability error. Inaccurate blueprint kinematic models typically account for 90% of this discrepancy. In this work a calibration procedure is developed which calibrates the kinematic model of a robot arm using fixed stereo rig and a calibration target mounted on the robot hand. In a single calibration framework the following parameters are automatically determined:

  • DH-parameters of the robot arm
  • Relative pose of calibration target and tooltip
  • Relative pose of stereo rig and robot coordinate frame
  • Interior camera parameters
  • Lens distortion parameters of both cameras
  • Relative pose of the stereo cameras
  • Inaccuracies/deformation of the calibration target

The procedure is fully automatic and does not require expensive, precalibrated equipment.

2005 2006
Vision Based Detection of Wrong-Way Drivers
(details)

Vehicles driving the wrong way down a motorway represent a serious source of danger. In the year 2005 in Austria, 521 vehicles were counted driving in the wrong direction on motorways. 8 people died due to wrong-way driver accidents.

Immediate detection of a vehicle driving in the wrong direction could help preventing serious accidents by warning the oncoming vehicles (via traffic telematic systems or radio announcements) and by alarming the police. To guarantee immediate detection, one would have to observe every access ramp and every place where a car could turn. Thus a low-cost monitoring system capable of detecting wrong-way drivers and forwarding an alarm to a central traffic control station is needed. Image based detection suffers from one main problem: In case a large vehicle (truck, bus etc.) throws a shadow to the opposite lane of a motorway, that shadow moves against the direction of traffic and may cause a false detection. With a monocular camera system (one single camera), this problem can only be eliminated by large computational costs.

2005 2007
TRICam - Robust Object Detection and Tracking on an Embedded System
(details)

Development of Computer Vision Algorithms for Robust Object Detection and Tracking on an Embedded DSP Platform. FREQUENTIS was working on an embedded platform called "TRICam". The platform is capable of acting as a high quality Motion-JPEG or MPEG4 Encoder or Decoder; furthermore, it can perform a set of selected image processing algorithms for object detection and tracking in real-time. The focus was mainly put on traffic monitoring, vehicle detection and tracking. The goal was to also deal with various algorithms for video surveillance in public places and public transport.

2005 2007
Dust Filter Surface Analysis
(details)

When a dust laden gas is sucked through a filter, the dust remains on its surface and forms a compact dust layer called filter cake. Periodically the filter cake is at least partially removed by inverse high pressure air pulses to allow continuous operation of the filter. Knowledge of the distribution of the filter cake on the filter surface at different stages of operation is decisive for filter operation. It is shown that the principle of calibrated shape from stereo with pattern projection for generating texture on the surface gives a robust 3D reconstruction of the filter surface when accessing through a glass window. Rigid registration of surface patches using landmark points, combined with an Iterative Closest Point Algorithm as a refinement procedure gives a continuous 3D model of the entire visible filter surface. Cake thickness is calculated by taking the height difference of two surface models, acquired before and after dust deposition. The challenging problem is to account for the non-rigid deformation of the filter cloth. A global deformation model is estimated using Thin Plate Spline Interpolation based on landmark points on the filter surface.

2004 2005
Visual Pose Determination by a Robot in six degrees of freedom
(details)

We propose to develop a robotic system for the exact pose estimation of rigid objects in six degrees of freedom (DoF). Our approach is based on the following scenario: Given be a set of unordered objects in arbitrary position which come from a small variety (<10) of different types as may happen on a conveyor belt during an assembly process. We assume the objects to be mostly un-occluded and their CAD-model given. Our goal is to pick with a robot a specific object in such a way that it has a defined position. The robot is equipped with multiple black and white cameras (e.g., 2-3). In order to achieve maximum generality we do not intend to use range information nor calculate a 3D-representation from stereo images. This task requires a solution for the following sub-problems: Recognition and initial pose estimation, which will be approached using robust appearance-based recognition methods; Pose refinement will be handled by model-based methods; Movement planning to enhance the accuracy of the pose estimation.

Partnerlist:

M&R Holding GmbH

2001 2002
Sorting and packaging of apples by means of digital image processing
(details)

In the process of packaging high quality apples the proper placement of the fruit in the tray is critical (i.e. the most appealing side should face towards the customer, all stalks should face into the same direction). Currently this sorting is done manually, at an additional cost of about 36 Euro (500 Schillings) per ton of apples. The automation of this process should result in a significant cost reduction. The first step of the packaging process rotates the apple so that its symmetry axis is vertical with either stalk or bloom facing upwards. This is done mechanically. The next step determines the position of the most appealing side of the apple (i.e. the horizontal angle) and decides whether stalk or bloom is facing upwards (the vertical angle). This is done by digital image processing. The apple is rotated by 360 degrees. A digital camera captures a sequence of images at different orientations and the image processing software measures fruit quality and vertical orientation in real-time. In the last step, a robot brings the apple to its ideal position and puts it into the tray.

Partnerlist:

Schuster Messtechnik

2001 2002

[Powered by Plone]