Header logo is


2013


Thumb xl thumb
Branch&Rank for Efficient Object Detection

Lehmann, A., Gehler, P., VanGool, L.

International Journal of Computer Vision, Springer, December 2013 (article)

Abstract
Ranking hypothesis sets is a powerful concept for efficient object detection. In this work, we propose a branch&rank scheme that detects objects with often less than 100 ranking operations. This efficiency enables the use of strong and also costly classifiers like non-linear SVMs with RBF-TeX kernels. We thereby relieve an inherent limitation of branch&bound methods as bounds are often not tight enough to be effective in practice. Our approach features three key components: a ranking function that operates on sets of hypotheses and a grouping of these into different tasks. Detection efficiency results from adaptively sub-dividing the object search space into decreasingly smaller sets. This is inherited from branch&bound, while the ranking function supersedes a tight bound which is often unavailable (except for rather limited function classes). The grouping makes the system effective: it separates image classification from object recognition, yet combines them in a single formulation, phrased as a structured SVM problem. A novel aspect of branch&rank is that a better ranking function is expected to decrease the number of classifier calls during detection. We use the VOC’07 dataset to demonstrate the algorithmic properties of branch&rank.

ps

pdf link (url) [BibTex]

2013


pdf link (url) [BibTex]


Thumb xl tro
Extracting Postural Synergies for Robotic Grasping

Romero, J., Feix, T., Ek, C., Kjellstrom, H., Kragic, D.

Robotics, IEEE Transactions on, 29(6):1342-1352, December 2013 (article)

ps

[BibTex]

[BibTex]


Thumb xl pic cviu13
Markov Random Field Modeling, Inference & Learning in Computer Vision & Image Understanding: A Survey

Wang, C., Komodakis, N., Paragios, N.

Computer Vision and Image Understanding (CVIU), 117(11):1610-1627, November 2013 (article)

Abstract
In this paper, we present a comprehensive survey of Markov Random Fields (MRFs) in computer vision and image understanding, with respect to the modeling, the inference and the learning. While MRFs were introduced into the computer vision field about two decades ago, they started to become a ubiquitous tool for solving visual perception problems around the turn of the millennium following the emergence of efficient inference methods. During the past decade, a variety of MRF models as well as inference and learning methods have been developed for addressing numerous low, mid and high-level vision problems. While most of the literature concerns pairwise MRFs, in recent years we have also witnessed significant progress in higher-order MRFs, which substantially enhances the expressiveness of graph-based models and expands the domain of solvable problems. This survey provides a compact and informative summary of the major literature in this research topic.

ps

Publishers site pdf [BibTex]

Publishers site pdf [BibTex]


Thumb xl ijrr
Vision meets Robotics: The KITTI Dataset

Geiger, A., Lenz, P., Stiller, C., Urtasun, R.

International Journal of Robotics Research, 32(11):1231 - 1237 , Sage Publishing, September 2013 (article)

Abstract
We present a novel dataset captured from a VW station wagon for use in mobile robotics and autonomous driving research. In total, we recorded 6 hours of traffic scenarios at 10-100 Hz using a variety of sensor modalities such as high-resolution color and grayscale stereo cameras, a Velodyne 3D laser scanner and a high-precision GPS/IMU inertial navigation system. The scenarios are diverse, capturing real-world traffic situations and range from freeways over rural areas to inner-city scenes with many static and dynamic objects. Our data is calibrated, synchronized and timestamped, and we provide the rectified and raw image sequences. Our dataset also contains object labels in the form of 3D tracklets and we provide online benchmarks for stereo, optical flow, object detection and other tasks. This paper describes our recording platform, the data format and the utilities that we provide.

avg ps

pdf DOI [BibTex]

pdf DOI [BibTex]


Thumb xl imgf0006
Human Pose Calculation from Optical Flow Data

Black, M., Loper, M., Romero, J., Zuffi, S.

European Patent Application EP 2843621 , August 2013 (patent)

ps

Google Patents [BibTex]

Google Patents [BibTex]


Thumb xl jmiv2012 mut
Unscented Kalman Filtering on Riemannian Manifolds

Soren Hauberg, Francois Lauze, Kim S. Pedersen

Journal of Mathematical Imaging and Vision, 46(1):103-120, Springer Netherlands, May 2013 (article)

ps

Publishers site PDF [BibTex]

Publishers site PDF [BibTex]


Thumb xl thumb hennigk2012 2
Quasi-Newton Methods: A New Direction

Hennig, P., Kiefel, M.

Journal of Machine Learning Research, 14(1):843-865, March 2013 (article)

Abstract
Four decades after their invention, quasi-Newton methods are still state of the art in unconstrained numerical optimization. Although not usually interpreted thus, these are learning algorithms that fit a local quadratic approximation to the objective function. We show that many, including the most popular, quasi-Newton methods can be interpreted as approximations of Bayesian linear regression under varying prior assumptions. This new notion elucidates some shortcomings of classical algorithms, and lights the way to a novel nonparametric quasi-Newton method, which is able to make more efficient use of available information at computational cost similar to its predecessors.

ei ps pn

website+code pdf link (url) [BibTex]

website+code pdf link (url) [BibTex]


Thumb xl toc image
Hybrid nanocolloids with programmed three-dimensional shape and material composition

Mark, A. G., Gibbs, J. G., Lee, T., Fischer, P.

NATURE MATERIALS, 12(9):802-807, 2013, Max Planck Press Release. (article)

Abstract
Tuning the optical(1,2), electromagnetic(3,4) and mechanical properties of a material requires simultaneous control over its composition and shape(5). This is particularly challenging for complex structures at the nanoscale because surface-energy minimization generally causes small structures to be highly symmetric(5). Here we combine low-temperature shadow deposition with nanoscale patterning to realize nanocolloids with anisotropic three-dimensional shapes, feature sizes down to 20 nm and a wide choice of materials. We demonstrate the versatility of the fabrication scheme by growing three-dimensional hybrid nanostructures that contain several functional materials with the lowest possible symmetry, and by fabricating hundreds of billions of plasmonic nanohelices, which we use as chiral metafluids with record circular dichroism and tunable chiroptical properties.

Max Planck Press Release.

pf

Video - Fabrication of Designer Nanostructures DOI [BibTex]


Thumb xl fig1
Chiral Colloidal Molecules And Observation of The Propeller Effect

Schamel, D., Pfeifer, M., Gibbs, J. G., Miksch, B., Mark, A. G., Fischer, P.

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 135(33):12353-12359, 2013 (article)

Abstract
Chiral molecules play an important role in biological and chemical processes, but physical effects due to their symmetry-breaking are generally weak. Several physical chiral separation schemes which could potentially be useful, including the propeller effect, have therefore not yet been demonstrated at the molecular scale. However, it has been proposed that complex nonspherical colloidal particles could act as ``colloidal molecules{''} in mesoscopic model systems to permit the visualization of molecular phenomena that are otherwise difficult to observe. Unfortunately, it is difficult to synthesize such colloids because surface minimization generally favors the growth of symmetric particles. Here we demonstrate the production of large numbers of complex colloids with glancing angle physical vapor deposition. We use chiral colloids to demonstrate the Baranova and Zel'dovich (Baranova, N. B.; Zel'dovich, B. Y. Chem. Phys. Lett. 1978, 57, 435) propeller effect: the separation of a racemic mixture by application of a rotating field that couples to the dipole moment of the enantiomers and screw propels them in opposite directions. The handedness of the colloidal suspensions is monitored with circular differential light scattering. An exact solution for the colloid's propulsion is derived, and comparisons between the colloidal system and the corresponding effect at the molecular scale are made.

pf

Video - Nanospropellers DOI [BibTex]

Video - Nanospropellers DOI [BibTex]


Thumb xl toc image
Indirect absorption spectroscopy using quantum cascade lasers: mid-infrared refractometry and photothermal spectroscopy

Pfeifer, M., Ruf, A., Fischer, P.

OPTICS EXPRESS, 21(22):25643-25654, 2013 (article)

Abstract
We record vibrational spectra with two indirect schemes that depend on the real part of the index of refraction: mid-infrared refractometry and photothermal spectroscopy. In the former, a quantum cascade laser (QCL) spot is imaged to determine the angles of total internal reflection, which yields the absorption line via a beam profile analysis. In the photothermal measurements, a tunable QCL excites vibrational resonances of a molecular monolayer, which heats the surrounding medium and changes its refractive index. This is observed with a probe laser in the visible. Sub-monolayer sensitivities are demonstrated. (C) 2013 Optical Society of America

pf

DOI [BibTex]


Thumb xl applied physics cover vol 103 number 21
Plasmonic nanohelix metamaterials with tailorable giant circular dichroism

Gibbs, J. G., Mark, A. G., Eslami, S., Fischer, P.

APPLIED PHYSICS LETTERS, 103(21), 2013, Featured cover article. (article)

Abstract
Plasmonic nanohelix arrays are shown to interact with electromagnetic fields in ways not typically seen with ordinary matter. Chiral metamaterials (CMMs) with feature sizes small with respect to the wavelength of visible light are a promising route to experimentally achieve such phenomena as negative refraction without the need for simultaneously negative e and mu. Here we not only show that giant circular dichroism in the visible is achievable with hexagonally arranged plasmonic nanohelix arrays, but that we can precisely tune the optical activity via morphology and lattice spacing. The discrete dipole approximation is implemented to support experimental data. (C) 2013 AIP Publishing LLC.

Featured cover article.

pf

DOI [BibTex]

DOI [BibTex]


no image
Information Driven Self-Organization of Complex Robotic Behaviors

Martius, G., Der, R., Ay, N.

PLoS ONE, 8(5):e63400, Public Library of Science, 2013 (article)

al

link (url) DOI [BibTex]

link (url) DOI [BibTex]


Thumb xl training faces
Random Forests for Real Time 3D Face Analysis

Fanelli, G., Dantone, M., Gall, J., Fossati, A., van Gool, L.

International Journal of Computer Vision, 101(3):437-458, Springer, 2013 (article)

Abstract
We present a random forest-based framework for real time head pose estimation from depth images and extend it to localize a set of facial features in 3D. Our algorithm takes a voting approach, where each patch extracted from the depth image can directly cast a vote for the head pose or each of the facial features. Our system proves capable of handling large rotations, partial occlusions, and the noisy depth data acquired using commercial sensors. Moreover, the algorithm works on each frame independently and achieves real time performance without resorting to parallel computations on a GPU. We present extensive experiments on publicly available, challenging datasets and present a new annotated head pose database recorded using a Microsoft Kinect.

ps

data and code publisher's site pdf DOI Project Page [BibTex]

data and code publisher's site pdf DOI Project Page [BibTex]


Thumb xl humans3tracking
Markerless Motion Capture of Multiple Characters Using Multi-view Image Segmentation

Liu, Y., Gall, J., Stoll, C., Dai, Q., Seidel, H., Theobalt, C.

Transactions on Pattern Analysis and Machine Intelligence, 35(11):2720-2735, 2013 (article)

Abstract
Capturing the skeleton motion and detailed time-varying surface geometry of multiple, closely interacting peoples is a very challenging task, even in a multicamera setup, due to frequent occlusions and ambiguities in feature-to-person assignments. To address this task, we propose a framework that exploits multiview image segmentation. To this end, a probabilistic shape and appearance model is employed to segment the input images and to assign each pixel uniquely to one person. Given the articulated template models of each person and the labeled pixels, a combined optimization scheme, which splits the skeleton pose optimization problem into a local one and a lower dimensional global one, is applied one by one to each individual, followed with surface estimation to capture detailed nonrigid deformations. We show on various sequences that our approach can capture the 3D motion of humans accurately even if they move rapidly, if they wear wide apparel, and if they are engaged in challenging multiperson motions, including dancing, wrestling, and hugging.

ps

data and video pdf DOI Project Page [BibTex]

data and video pdf DOI Project Page [BibTex]


Thumb xl perception
Viewpoint and pose in body-form adaptation

Sekunova, A., Black, M., Parkinson, L., Barton, J. J. S.

Perception, 42(2):176-186, 2013 (article)

Abstract
Faces and bodies are complex structures, perception of which can play important roles in person identification and inference of emotional state. Face representations have been explored using behavioural adaptation: in particular, studies have shown that face aftereffects show relatively broad tuning for viewpoint, consistent with origin in a high-level structural descriptor far removed from the retinal image. Our goals were to determine first, if body aftereffects also showed a degree of viewpoint invariance, and second if they also showed pose invariance, given that changes in pose create even more dramatic changes in the 2-D retinal image. We used a 3-D model of the human body to generate headless body images, whose parameters could be varied to generate different body forms, viewpoints, and poses. In the first experiment, subjects adapted to varying viewpoints of either slim or heavy bodies in a neutral stance, followed by test stimuli that were all front-facing. In the second experiment, we used the same front-facing bodies in neutral stance as test stimuli, but compared adaptation from bodies in the same neutral stance to adaptation with the same bodies in different poses. We found that body aftereffects were obtained over substantial viewpoint changes, with no significant decline in aftereffect magnitude with increasing viewpoint difference between adapting and test images. Aftereffects also showed transfer across one change in pose but not across another. We conclude that body representations may have more viewpoint invariance than faces, and demonstrate at least some transfer across pose, consistent with a high-level structural description. Keywords: aftereffect, shape, face, representation

ps

pdf from publisher abstract pdf link (url) Project Page [BibTex]

pdf from publisher abstract pdf link (url) Project Page [BibTex]


no image
Linear combination of one-step predictive information with an external reward in an episodic policy gradient setting: a critical analysis

Zahedi, K., Martius, G., Ay, N.

Frontiers in Psychology, 4(801), 2013 (article)

Abstract
One of the main challenges in the field of embodied artificial intelligence is the open-ended autonomous learning of complex behaviours. Our approach is to use task-independent, information-driven intrinsic motivation(s) to support task-dependent learning. The work presented here is a preliminary step in which we investigate the predictive information (the mutual information of the past and future of the sensor stream) as an intrinsic drive, ideally supporting any kind of task acquisition. Previous experiments have shown that the predictive information (PI) is a good candidate to support autonomous, open-ended learning of complex behaviours, because a maximisation of the PI corresponds to an exploration of morphology- and environment-dependent behavioural regularities. The idea is that these regularities can then be exploited in order to solve any given task. Three different experiments are presented and their results lead to the conclusion that the linear combination of the one-step PI with an external reward function is not generally recommended in an episodic policy gradient setting. Only for hard tasks a great speed-up can be achieved at the cost of an asymptotic performance lost.

al

link (url) DOI [BibTex]


no image
Robustness of guided self-organization against sensorimotor disruptions

Martius, G.

Advances in Complex Systems, 16(02n03):1350001, 2013 (article)

Abstract
Self-organizing processes are crucial for the development of living beings. Practical applications in robots may benefit from the self-organization of behavior, e.g.~to increase fault tolerance and enhance flexibility, provided that external goals can also be achieved. We present results on the guidance of self-organizing control by visual target stimuli and show a remarkable robustness to sensorimotor disruptions. In a proof of concept study an autonomous wheeled robot is learning an object finding and ball-pushing task from scratch within a few minutes in continuous domains. The robustness is demonstrated by the rapid recovery of the performance after severe changes of the sensor configuration.

al

DOI [BibTex]

DOI [BibTex]


Thumb xl 2013 ivc rkek teaser
Non-parametric hand pose estimation with object context

Romero, J., Kjellström, H., Ek, C. H., Kragic, D.

Image and Vision Computing , 31(8):555 - 564, 2013 (article)

Abstract
In the spirit of recent work on contextual recognition and estimation, we present a method for estimating the pose of human hands, employing information about the shape of the object in the hand. Despite the fact that most applications of human hand tracking involve grasping and manipulation of objects, the majority of methods in the literature assume a free hand, isolated from the surrounding environment. Occlusion of the hand from grasped objects does in fact often pose a severe challenge to the estimation of hand pose. In the presented method, object occlusion is not only compensated for, it contributes to the pose estimation in a contextual fashion; this without an explicit model of object shape. Our hand tracking method is non-parametric, performing a nearest neighbor search in a large database (.. entries) of hand poses with and without grasped objects. The system that operates in real time, is robust to self occlusions, object occlusions and segmentation errors, and provides full hand pose reconstruction from monocular video. Temporal consistency in hand pose is taken into account, without explicitly tracking the hand in the high-dim pose space. Experiments show the non-parametric method to outperform other state of the art regression methods, while operating at a significantly lower computational cost than comparable model-based hand tracking methods.

ps

Publisher site pdf link (url) [BibTex]

Publisher site pdf link (url) [BibTex]


Thumb xl toc image
Nonlinear optical spectroscopy of chiral molecules

Fischer, P., Hache, F.

CHIRALITY, 17(8):421-437, 2005 (article)

Abstract
We review nonlinear optical processes that are specific to chiral molecules in solution and on surfaces. In contrast to conventional natural optical activity phenomena, which depend linearly on the electric field strength of the optical field, we discuss how optical processes that are nonlinear (quadratic, cubic, and quartic) functions of the electromagnetic field strength may probe optically active centers and chiral vibrations. We show that nonlinear techniques open entirely new ways of exploring chirality in chemical and biological systems: The cubic processes give rise to nonlinear circular dichroism and nonlinear optical rotation and make it possible to observe dynamic chiral processes at ultrafast time scales. The quadratic second-harmonic and sum-frequency-generation phenomena and the quartic processes may arise entirely in the electric-dipole approximation and do not require the use of circularly polarized light to detect chirality: They provide surface selectivity and their observables can be relatively much larger than in linear optical activity. These processes also give rise to the generation of light at a new color, and in liquids this frequency conversion only occurs if the solution is optically active. We survey recent chiral nonlinear optical experiments and give examples of their application to problems of biophysical interest. (C) 2005 Wiley-Liss, Inc.

pf

DOI [BibTex]

DOI [BibTex]


Thumb xl toc image
Negative refraction at optical frequencies in nonmagnetic two-component molecular media

Chen, Y., Fischer, P., Wise, F.

PHYSICAL REVIEW LETTERS, 95(6), 2005 (article)

Abstract
There is significant motivation to develop media with negative refractive indices at optical frequencies, but efforts in this direction are hampered by the weakness of the magnetic response at such frequencies. We show theoretically that a nonmagnetic medium with two atomic or molecular constituents can exhibit a negative refractive index. A negative index is possible even when the real parts of both the permittivity and permeability are positive. This surprising result provides a route to isotropic negative-index media at optical frequencies.

pf

DOI [BibTex]

DOI [BibTex]