Research Projects
- Show Keywords
- 3D Computer Vision 3D reconstruction Aerial Vision Augmented Reality Augmented Video Best Paper Award Biometrics Caleydo Computational Photography Computer Graphics Computer Vision Convex Optimization Coordinate transformations detection face Fingerprint Georeferencing GPU GUI HOG Human Computer Interaction Image Labelling Industrial Applications Information Visualization integral imaging Interaction Interaction Design Machine Learning Medical computer vision Medical Visualization Mixed Reality Mobile computing Mobile phone Model Multi-Display Environments Multiple Perspectives Object detection Object recognition Object reconstruction Object Tracking On-Line Learning Robotics Segmentation Shape analysis shape from focus SLAM Software Projects Structure from Motion Surveillance SVM Symmetry Tracking Fusion Tracking, Action Recognition User Interfaces Variational Methods Virtual reality and augmented reality Visual Tracking Visualization
| Title | Abstract |
Start
|
End |
|---|---|---|---|
|
Person Re-Identification
(details) |
The goal of this project is to develop an interactive visual search method that finds a given pedestrian in a large archive of other camera views efficiently. A user-selected pedestrian image or sequence is used to obtain initial discriminative features and an initial ranked list of hypothetical matches. A discriminative pedestrian recognition model is learned in an on-line manner by user interaction assigning positive and negative labels to the initially retrieved results and on-line boosting for feature selection. This enables that the best discriminative features for the current query are selected. |
2008 | 2010 |
|
Semi-Supervised Learning for the Analysis of Unstructured Documents
(details) |
The goal of this project is to develop and analyze methods for analyzing textual information. This should be realized by using semi-supervised learning methods, which use labeled as well as unlabeled data. In particular, existing methods which are already applied for pattern recognition should be adapted such that those can also be applied for textual data. For a practical analysis comparisons to SVM and k-NN classifier using a boosting algorithm should be performed, the influence of the amount of labeled/unlabeled data and the convergence should be analyzed. Moreover, a fair comparative study between batch and on-line methods is performed. |
2008 | 2011 |
|
MARCUS - Mobile Augmented Reality and Context in Urban Scenarios
(details) |
MARCUS is an exchange program with the Human Interface Technologies Laboratory (Christchurch, NZ) and the University of Otago (Otago, NZ). Its aim is to extend the scope of the research work performed in the EU Integrated Project "IPCity" with researchers in New Zealand. The focus of research will be on how mobile devices can create new types of interactive urban experiences. For example, location specific information overlaid on the real world can be used to aid navigation through cities, in outdoor game play, or for providing user supplied comments at certain sites. |
2008 | 2010 |
|
Ludwig Boltzmann Institut für Klinisch-Forensische Bildgebung
(details) |
Die klinische Rechtsmedizin gewann in den letzten Jahren aufgrund einer Sensibilisierung der Öffentlichkeit gegenüber häuslicher und sexueller Gewalt, Gewalt gegenüber Kindern und Verdachtsfällen von medizinischen Behandlungsfehlern stark an Bedeutung. Die forensische Untersuchung von Lebenden ist bis heute jedoch auf eine äussere Besichtigung des Körpers beschränkt. Das neue Ludwig-Boltzmann-Institut (LBI) für klinisch-forensische Bildgebung hat zum Ziel, Verfahren zur Erfassung von inneren Verletzungsbefunden als Grundlage für forensische Gutachten zu entwickeln. Mittels Computertomographie (CT) und Magnetresonanztomographie (MRT), welche in der Klinik etabliert sind, können zusätzliche, objektiv nachweisbare innere Verletzungsbefunde erhoben werden, die eine verbesserte Einschätzung der ausgeübten Gewalt gegen die untersuchte Person ermöglichen. Die Methoden sind jedoch auf klinische Diagnostik ausgerichtet, während forensisch wichtige Befunde nicht oder nicht optimal dargestellt werden. Das Institut fuer Maschinelles Sehen und Darstellen kooperiert mit dem LBI zur Entwicklung neuer Methoden der Bildverarbeitung und Computergrafik zum Zwecke der Bildgebung. |
2008 | 2015 |
|
IMPPACT - Image-based Multi-scale Physiological Planning for Ablation Cancer Treatment
(details) |
IMPPACT is a European research project, which develops an intervention planning system for Radiofrequency Ablation of malignant liver tumours. TU Graz is dealing with medical visualization and augmented reality in the project. Problem or Context Radiofrequency Ablation (RFA) is a minimally invasive form to treat cancer without open surgery, by placing a needle inside the malignancy and destroying it through intensive heating. Though the advantages of this approach are obvious, the intervention is currently hard to plan, almost impossible to monitor or assess, and therefore is not the first choice for treatment. Project IMPPACT will develop a physiological model of the liver and simulate the RFA intervention result, accounting for patient specific physiological factors.
Mathematical modelling together with experimental validation lead to a patient specific intervention planning system. read more Expected Results & Impacts IMPPACT will be modelling a physiological organ including the metabolism and patient specific tissue properties. This alone is a huge step forward as compared to the state-of-the-art intervention planning systems that do not address this issue. The IPS will allow prediction of treatment results on a patient specific base. It will therefore bring down the risk of local recurrences and eliminate the nowadays so common repeated treatments of the same tumour, making RFA an as effective treatment as resection. |
2008 | 2011 |
|
HydroSys - Advanced spatial analysis tools for on-site environmental monitoring and management
(details) |
The research aim of the project is to provide a system infrastructure to support teams of users in the on-site monitoring of events and analysis of natural resources. The project will introduce the innovative concept of event-driven campaigns using handheld devices, potentially supported by an unmanned aerial vehicle (UAV). Event-driven campaigns provide users the capacity to analyse and predict environmental changes on-site, supporting the process of taking appropriate countermeasures to avoid environmental degradation. During these campaigns, users will be able to setup and retrieve data from mobile sensorstations, the UAV and external sources (such as permanent sensor networks) in order to generate dense information on a small area. The whole sensor network system will gather and store sensor data, and process simulations based on physical process models. Hence, a shared information system fusing heterogeneous data sources will be provided that supports teams of stakeholders to monitor environmental processes on-site, complementing remote monitoring and management. To enrich the data sets from a specific location, additional remotely controlled cameras will be deployed, mounted on sensorstations and below the UAV. Users will be able to analyse the environment using mobile phones and handheld computers, supported by advanced user interface techniques. The project will improve monitoring and management for environmental scientists, institutions, service providers, engineering companies and municipalities through its strong integration of handhelds and sensor networks. The project will progress well beyond the current state in the art, by dealing with short-term events and detailed analysis of small sites. The analysis of such events is hardly supported by current methods, but has a large impact on environmental degradation. Furthermore, information is made available to citizens by providing mechanisms to access top-level environmental data. Within the project, cutting edge inter-disciplinary research will be performed to develop user-centered solutions. When the data is integrated with analytical tools in a shared information space it will also aid a wide range of managers and planners pursuing more environmentally sensitive solutions to engineering problems. To aid the process, the research is steered by considerable end-user involvement in all its phases. |
2008 | 2011 |
|
Christian Doppler Laboratory for Handheld Augmented Reality
(details) |
Augmented Reality (AR) combines real and virtual in a single view, putting information right were it belongs - into the real world. AR is still a young research field and hence strongly driven by basic research and experimental methods, while only few successful commercial applications have been deployed. One of the reasons is that past hardware (such as head-mounted displays and Tablet PCs) have not been sufficiently inexpensive and ergonomically satisfactory. Therefore, recent AR research shows a trends towards deploying AR on advanced mobile phones, using the phone camera as video see-through interface for a “magic lens” style of AR. Recent research in the proposer’s group has first the first time established a baseline technology for achieving real-time performance AR on mobile phones, and this development has been meet with great interest from industry. This proposal the logical consequence of this development. It is concerned with extending this research in several directions, in particular making techniques more scalable (sometimes several orders of magnitude), so that realistic real world scenarios interesting for commercial applications can be attacked by industry. Firstly, we want to expand our real-time computer-vision based pose tracking and object recognition techniques. Secondly, we propose to develop realistic AR image synthesis and visualization methods. Thirdly, we suggest an investigation into efficient 3D interaction techniques with and for AR phones. Finally, we suggest the creation of a distributed infrastructure based on Web 2.0 technology for scalable content creation and deployment of geo-referenced AR applications on phones. |
2008 | 2015 |
|
CityFit: High-Quality Urban Reconstructions by Fitting Shape Grammars to Images and derived Textured Point Clouds
(details) |
The generation of realistic 3D models of whole cities has become a vibrant and highly competitive market through the recent activities of, most notably, Goggle Earth and Microsoft Virtual Earth. While the first generation of these systems only delivered high-quality zoomable images of the ground, the current trend is heavily geared towards 3D – that is, users can access three-dimensional height- fields of the terrain, and even 3D models of individual buildings. Simple building models, basically extruded polygons with different types of roofs, can be generated today from aerial images completely automatically. This is a solved problem. Far from solved, however, is the problem of generating automatically detailed buildings with façades. Input data for this problem are registered range maps obtained by stereo matching and sequences of highly overlapping thus redundant images (taken from a car driving in the road) where each pixel has not only a color but also a depth, a z-value. Although range maps can be directly rendered in principle, the data size is huge and, more importantly, the pixels have no semantics: A priori there is no difference between a pixel on the floor, on the wall, or on a door. But these shape semantics are required by all downstream applications using the city model. Shape grammars, on the other hand, have recently become (again) a popular method in research for representing 3D buildings. Their great advantage is that they allow to parameterize buildings, which can be used for populating virtual cities with believable architectural buildings, e.g., for 3D games. The goal of the CITYFIT project is, given highly redundant input imagery and range maps from an arbitrary building in Graz, to synthesize a shape grammar that, when evaluated, creates a clean, CAD- quality reconstruction of that building that fits the original data very closely and makes the semantics of all major architectural features explicit. These shape semantics can even be transferred back to inform the original data, so each of these “semantically enriched” data points can tell whether it belongs to ground, wall, or door. |
2008 | 2010 |
|
vdQA: Video Quality Analysis
(details) |
Automatic and efficient quality analysis of audiovisual content has become a crucial step before storing the material for later use. While most approaches in this area are only dealing with low level signal analysis, the goal of this project is to go far beyond state-of-the-art procedures. On the basis of novel as well as proven computer vision methods, we will attempt to incorporate high level knowledge in the analysis step, thus achieving significant better and faster results than current methods, comparable in their reliability with a human operator. In particular the vdQA project will carry out research in the following areas: • Improvement of optical flow field methodologies to deal with multi-frame information • Application of novel segmentation methods in order to enable semantic quality analysis. • Knowledge assisted artefact assessment and classification. • Novel methods for fast and robust detection of difficult impairments like unsteadiness, flicker, freeze frames, test patterns and lost frames. • Research into methodologies that are particularly well suited for implementations taking advantage of GPU hardware. The grand challenge in the end is the combination of robustness, speed and integration of human knowledge. The research and industrial partners have dedicated roles in the work programme to achieve those goals. The industrial partners have excellent knowledge of the market and will provide user requirements as well as extensive test material. The academic partners will do research in their respective fields, namely development of basic algorithms for optical flow, tracking, segmentation, classification and usage of GPUs as well as algorithms for content based quality analysis and semantic technologies to represent knowledge. Towards the project end the industrial partners will evaluate and test the developments together with pilot end users. Project Partners: Joanneum Research, Institute of Information Systems & Information Management |
2008 | 2010 |
|
CranUS - Cranial Ultrasound Simulation
(details) |
The use of augmented reality in medicine is an important field, especially in teaching and training of sensitive tasks. To support teaching and training of neonatal cranial sonography, an augmented reality simulator was developed. Physical models of a newborn and an ultrasound probe were tracked and their movements displayed in their virtual representation. The head of the newborn model was augmented with a 3D volume, reconstructed from ultrasound images of a real patient. Reconstructing a 3D volume from irregular source data takes a special focus on positioning the images and the subsequent interpolation. Moving the physical model towards each other, the according slices are generated in realtime. |
2007 | 2008 |
|
Doctoral Program for the Confluence of Graphics and Vision
(details) |
Computer vision and computer graphics constitute two closely related areas of research: Though both fields rely on the same physical and mathematical principles and on a common set of representations, they mainly differ in how these representations are built. Traditionally these two fields have been treated as separate academic discipline. Exploiting the commonalities between vision and graphics turns out to be a scientifically profitable endeavour. There are many examples of fruitfull combination of graphics and vision, but there is no systematic education of students (especially in Austria). Therefore, the goal of this doctoral program Confluence of Vision and Graphics is to educate highly talented PhD students in this interdisciplinary field and to teach them a common view of this challenging topic from the start. All proposed topics require a significant amount of vision and graphics. The students will be co-supervised jointly by one professor with vision and one professor with graphics expertise. The proposed educational program will ensure that the students will be trained to become future leading scientists, which will face the challenges of research excellence in the interdisciplinary area of graphics and vision, academic leadership, and social competence as a member of a particular research group as well as being a part of the global research network. |
2007 | 2019 |
|
VIPEM - Visual Analytics for Personalized Medicine
(details) |
VIPEM ist ein System zur hypothesengesteuerten Analyse multidimensionaler Datenräume im Gebiet der personalisierten Medizin. Ein multidimensionaler Datenraum, bestehend aus molekularen und klinischen Daten, wird unter gleichzeitiger Anwendung algorithmischer Verfahren und direkter Benutzerinteraktion gefiltert und hierarchisch strukturiert. Ein zentrales Forschungsproblem der personalisierten Medizin ist die Frage, wie die Verknüpfungen zwischen genetischen Variationen und Krankheiten, bzw. dem Ansprechen auf bestimmte Medikamente, gefunden werden können. Dazu gilt es, z.B. Gendaten mit klinischen Daten zu verknüpfen und in Folge spezifische Patientengruppen zu identifizieren. Die großen Datenmengen der molekularen Analyseverfahren (genetische Polymorphismen, Genexpressionsdaten, Proteomics) können nur mehr mit Methoden der Bioinformatik und Statistik bewältigt werden. Aber auch Standardmethoden der Statistik und der Bioinformatik versagen, wenn die Daten sehr inhomogen strukturiert sind dies ist bei den klinischen Daten der Fall und wenn Strukturen in den Daten durch Rauschen bzw. dominante Muster verdeckt werden. VIPEM soll mit Hilfe von Visualierungsmethoden die Struktur in den Datenräumen sichtbar machen und eine interaktive Navigation und Strukturierung sowohl der molekularen, als auch der klinischen Daten erlauben. VIPEM baut auf Grundlagenergebnissen in den Bereichen Informations-Visualisierung und multimodale Benutzerschittstellen auf. Durch eine enge Verknüpfung mehrerer gleichzeitig wirksamer Eingabekanäle und die sofortige Sichtbarkeit der Analyseschritte in der Visualisierung steht dem Experten ein Werkzeug zu interaktiven Erkundung von komplexen Datenräumen zur Verfügung. Als Eingabeparameter für Analysealgorithmen nutzt VIPEM hierbei die menschliche Fähigkeit, komplexe Muster und Zusammenhänge visuell bereits in Ansätzen zu erfassen, und erlaubt dadurch das Freilegen sonst verdeckter Strukturen. VIPEM fokussiert auf die hohe Nachfrage nach visualisierter Analytik im Bereich der Bioinformatik. Der innovative Zugang von VIPEM versteht sich als einmaliges Verkaufsargument, zumal sich mit VIPEM ein viel versprechendes Produkt abzeichnet, welches sicher innerhalb der nächsten zwei bis drei Jahre seinen Stellenwert als verwertbares Produkt am Markt behaupten könnte. Diese Forschungsarbeit wird als Teil des Projekts Caleydo durchgeführt. |
2007 | 2009 |
|
POMAR 3D - Position and Orientation Measurement in 3D for Augmented Reality
(details) |
Positionierungs- und Orientierungsmodul für einen Mobilen Augmented Reality- Client zur 3D-Echtzeitvisualisierung unterirdischer Ver- und Entsorgungsinfrastruktur |
2007 | 2008 |
|
Genoptikum - Interactive Biomedical Information Visualization
(details) |
Genoptikum is an interactive data exploration system for the visualization of and navigation in molecular and clinical data in the field of personalized medicine. Genoptikum addresses the essential but to date unsolved problem of how to identify connections between genetic variants and their corresponding diseases or the response to certain drugs and treatments, respectively. It is, therefore, necessary to connect gene data and clinical data in order to categorise specific subgroups of patients with certain disease features. The huge amount of data provided by molecular analytical methods (genetic polymorphisms, gene expression data, proteomics) can only be analysed by applying statistical methods and bioinformatics. However, even standard methods of statistics and bioinformatics fail when the data are inhomogeneous as is the case with clinical data and when data structures are obscured by noise and dominant patterns. Genoptikum should make the structure of the data spaces visible by using innovative methods of visualisation based on multiple high resolution displays in combination with data projection technologies. Genoptikum is bases on fundamental results in the fields of visualisation of information and multimodal user interfaces which enable an interactive navigation and structuring of both molecular and clinical data. Through a close link between several input channels, which are simultaneously active, and by immediate visualisation of the steps of the analysis, the expert is provides with a tool for the interactive exploration of complex data spaces. As input parameter for analysis algorithms Genoptikum makes use of the human visual capacity to grasp complex patterns to reveal hidden structures and correlations in large data spaces. This research is part of the project Caleydo. |
2007 | 2009 |
|
ICAO Face Normalization and Analysis
(details) |
The goal of this project is the research and development of state of the art computer vision and object recognition algorithms to analyze face portrait images according to the ICAO (International Civil Aviation Organization) standards and specifications. Therefore a close cooperation with Siemens IT Solutions and Services Biometric Center in Graz exists, where the Biometry group is developing a software solution for this purpose. Current passports issued in the European Union contain biometric data like e.g. digital photographs and fingerprints in order to uniquely identify its owner. To be able to read passports all over the world, the ICAO has specified a number of guidelines and requirements for the structure of these biometric features. In case of face portrait images, examples for these requirements are neutral appearance, eyes opened, mouth closed, frontal pose, straight-looking eyes, properly-sitting eye-glasses, or uncovered faces. Since these analysis steps have to be performed in an automatic fashion, each of these requirements imposes certain computer vision research challenges which are tackled in this research project. Examples for the topics involved in these analysis steps are model-based segmentation using active shape and active appearance models, fast and robust AdaBoost based machine learning algorithms for face and face component detection, or classification of facial expressions using multi-classifier fusion approaches. |
2007 | 2009 |
|
Deskotheque - Collaborative Interaction in Multi Display Environments
(details) |
Office space usually consists of private single-user workstations. Team work takes place on separate locations, usually supported by analogue media like printed paper. Digital data exchanges is accomplished through designated channels like e-mail or instant messengers. Deskotheque is an ongoing project aiming to extend personal workspaces to enhance team work. It represents a flexible, interactive environment for team work, conference and meeting rooms. Unused surfaces in the room, such as empty wall space and table surfaces, can be turned into interactive, digital displays to be used for multi-user co-located teamwork. |
2007 | 2011 |
|
EVis: Autonomous Traffic Monitoring by Embedded Vision
(details) |
The world will witness a tremendous increase in the number of vehicles in the near future. Future traffic monitoring systems will therefore play an important role to improve the throughput and safety of roads. Current monitoring systems capture (usually vision-based) traffic data from a large sensory network; however, they require continuous human supervision which is extremely expensive. In the proposed EVis research project we investigate the scientific and technological foundations for future autonomous traffic monitoring systems. Autonomy is achieved by a novel combination of three approaches: First, vision-based detection and classification methods are augmented by self-learning and scene adaptation mechanisms which will significantly reduce the effort of manual configuration. Second, visual data is fused with data from other sensors such as radar, infrared or inductive loop sensors. Sensor fusion helps to improve the robustness and confidence, to extend the spatial and temporal coverage as well as to reduce the ambiguity and uncertainty of the processed sensor data. Finally, the developed vision and fusion methods are implemented on a distributed embedded platform which makes them wider applicable and supports real-time operation. Our autonomous traffic monitoring system will be evaluated using real world traffic data. The evaluation will be conducted in three different case studies: offline testing using recorded data, online testing on a traffic test site, and on a test installation on a public road. |
2007 | 2010 |
|
APAFA: Automated Photogrammetric Aerial Feature Analysis
(details) |
The systematic creation of models of the real world to support the locational awareness on the Internet can be achieved if previously required massive manual labor gets replaced by automated procedures. A particular challenge exists in the automation of the extraction of the 4 classical map features buildings, circulation spaces (e.g. road networks), vegetation and water bodies, as well as their interaction. Decennia of research have been unable to automate the extraction of these features from classical aerial photography towards an economically viable result. However, we believe that we can succeed in the proposed project to develop automated procedures to create feature data for three reasons. First is the recent advent of digital aerial sensors producing highly redundant digital large format aerial photography. Redundancy will be obtained by using high forward and side overlaps, say at 80% and 60%, so that every point in the terrain is imaged at least 10 times, and any algorithm can rely on multiple analysis results that then can either reinforce or cancel one another. Second, the geometric redundancy gets augmented by a radiometric redundancy using 4 spectral bands, adding an infrared band to the classical red, green and blue color channels. Third, we will combine the classical "object reconstruction" approach available from stereo procedures, by new recognition methods. While classically a "car" on a street may have been seen via a "point cloud" and would have to get recognized simply by a representation of local height anomaly on an otherwise flat reference surface, recognition includes the use of stored images of cars in a data base to actually recognize a car as a human would do when inspecting an aerial image. The project is split up into five work packages which will focus on how reconstruction and recognition techniques can help each other and how additional information either from a previous mission or GIS can be integrated in the 3D modeling framework. One work package will address the assessment of the obtained quality, another will address project management and dissemination activities. Within the project we will develop an extensive library of combined recognition/reconstruction methods, and apply them to a range of test data sets. Test data will vary in geometric resolution (pixel size), overlaps, and types of terrain scenarios. |
2007 | 2010 |
|
VM-GPU: Variational Methods on the GPU for Industrial Problems
(details) |
The project VM-GPU fits exactly to the FIT-IT Visual Computing call. It is a combination of computer vision and graphics methods to offer solutions to a problem of great relevance for industry. In particular,
The goal of VM-GPU is to make variational methods available for industrial problems by using modern graphics hardware. If successful this project will have a large impact on the machine vision industry, it will allow for the first time to use variational methods in an industrial setting, in addition having graphics cards available as computing platforms will offer completely new ways of addressing industrial vision problems (e.g., it is very easy to scale up by just using a second graphics card). |
2007 | 2009 |
|
AUTOVISTA: Advanced Unsupervised Monitoring and Visualization of Complex Scenarios
(details) |
The trend in video surveillance is an ever increasing number of (digital) cameras for surveying complex scenarios (e.g. crowds). Currently available video surveillance systems cannot cope with this increased complexity, the detection rates are too low and the systems are not reliable enough. This hinders the broad use of automatic surveillance systems. AUTOVISTA proposes to use modern visual computing technologies to advance the state-of-the-art of video surveillance considerably. In order to cope with the increasing number of cameras, AUTOVISTA will (1) use novel on-line learning techniques to increase the detection rate and decrease the false alarm rate, while the camera adapts in an unsupervised manner to the surveyed scene. Besides an increased performance, this has the additional advantage that the installation and maintenance effort will be substantially decreased; (2) exploit novel visualization and interaction techniques to support the human operator. Furthermore two complementary visualization modes are proposed, blending smoothly between these allows the operator to maintain coherence. These techniques will enable a single operator to cope simultaneously with a large amount of cameras. AUTOVISTA will tackle the problem of increased people densities and highly cluttered scenes in a novel manner. Instead of relying on single person detection and tracking (which is not feasible for high people density scenarios), methods will be investigated to handle the crowd as a whole. AUTOVISTA will derive spatio-temporal crowd statistics, describe normal crowd behavior and use this for unusual event detection. |
2007 | 2009 |
