Header logo is



{MoSh}: Motion and Shape Capture from Sparse Markers
MoSh: Motion and Shape Capture from Sparse Markers

Loper, M. M., Mahmood, N., Black, M. J.

ACM Transactions on Graphics, (Proc. SIGGRAPH Asia), 33(6):220:1-220:13, ACM, New York, NY, USA, November 2014 (article)

Abstract
Marker-based motion capture (mocap) is widely criticized as producing lifeless animations. We argue that important information about body surface motion is present in standard marker sets but is lost in extracting a skeleton. We demonstrate a new approach called MoSh (Motion and Shape capture), that automatically extracts this detail from mocap data. MoSh estimates body shape and pose together using sparse marker data by exploiting a parametric model of the human body. In contrast to previous work, MoSh solves for the marker locations relative to the body and estimates accurate body shape directly from the markers without the use of 3D scans; this effectively turns a mocap system into an approximate body scanner. MoSh is able to capture soft tissue motions directly from markers by allowing body shape to vary over time. We evaluate the effect of different marker sets on pose and shape accuracy and propose a new sparse marker set for capturing soft-tissue motion. We illustrate MoSh by recovering body shape, pose, and soft-tissue motion from archival mocap data and using this to produce animations with subtlety and realism. We also show soft-tissue motion retargeting to new characters and show how to magnify the 3D deformations of soft tissue to create animations with appealing exaggerations.

ps

pdf video data pdf from publisher link (url) DOI Project Page Project Page Project Page [BibTex]

pdf video data pdf from publisher link (url) DOI Project Page Project Page Project Page [BibTex]


Can I recognize my body’s weight? The influence of shape and texture on the perception of self
Can I recognize my body’s weight? The influence of shape and texture on the perception of self

Piryankova, I., Stefanucci, J., Romero, J., de la Rosa, S., Black, M., Mohler, B.

ACM Transactions on Applied Perception for the Symposium on Applied Perception, 11(3):13:1-13:18, September 2014 (article)

Abstract
The goal of this research was to investigate women’s sensitivity to changes in their perceived weight by altering the body mass index (BMI) of the participants’ personalized avatars displayed on a large-screen immersive display. We created the personalized avatars with a full-body 3D scanner that records both the participants’ body geometry and texture. We altered the weight of the personalized avatars to produce changes in BMI while keeping height, arm length and inseam fixed and exploited the correlation between body geometry and anthropometric measurements encapsulated in a statistical body shape model created from thousands of body scans. In a 2x2 psychophysical experiment, we investigated the relative importance of visual cues, namely shape (own shape vs. an average female body shape with equivalent height and BMI to the participant) and texture (own photo-realistic texture or checkerboard pattern texture) on the ability to accurately perceive own current body weight (by asking them ‘Is the avatar the same weight as you?’). Our results indicate that shape (where height and BMI are fixed) had little effect on the perception of body weight. Interestingly, the participants perceived their body weight veridically when they saw their own photo-realistic texture and significantly underestimated their body weight when the avatar had a checkerboard patterned texture. The range that the participants accepted as their own current weight was approximately a 0.83 to −6.05 BMI% change tolerance range around their perceived weight. Both the shape and the texture had an effect on the reported similarity of the body parts and the whole avatar to the participant’s body. This work has implications for new measures for patients with body image disorders, as well as researchers interested in creating personalized avatars for games, training applications or virtual reality.

ps

pdf DOI Project Page Project Page [BibTex]

pdf DOI Project Page Project Page [BibTex]


Breathing Life into Shape: Capturing, Modeling and Animating {3D} Human Breathing
Breathing Life into Shape: Capturing, Modeling and Animating 3D Human Breathing

Tsoli, A., Mahmood, N., Black, M. J.

ACM Transactions on Graphics, (Proc. SIGGRAPH), 33(4):52:1-52:11, ACM, New York, NY, July 2014 (article)

Abstract
Modeling how the human body deforms during breathing is important for the realistic animation of lifelike 3D avatars. We learn a model of body shape deformations due to breathing for different breathing types and provide simple animation controls to render lifelike breathing regardless of body shape. We capture and align high-resolution 3D scans of 58 human subjects. We compute deviations from each subject’s mean shape during breathing, and study the statistics of such shape changes for different genders, body shapes, and breathing types. We use the volume of the registered scans as a proxy for lung volume and learn a novel non-linear model relating volume and breathing type to 3D shape deformations and pose changes. We then augment a SCAPE body model so that body shape is determined by identity, pose, and the parameters of the breathing model. These parameters provide an intuitive interface with which animators can synthesize 3D human avatars with realistic breathing motions. We also develop a novel interface for animating breathing using a spirometer, which measures the changes in breathing volume of a “breath actor.”

ps

pdf video link (url) DOI Project Page Project Page Project Page [BibTex]


Nanopropellers and Their Actuation in Complex Viscoelastic Media
Nanopropellers and Their Actuation in Complex Viscoelastic Media

Schamel, D., Mark, A. G., Gibbs, J. G., Miksch, C., Morozov, K. I., Leshansky, A. M., Fischer, P.

ACS Nano, 8(9):8794-8801, June 2014, Featured cover article. (article)

Abstract
Tissue and biological fluids are complex viscoelastic media with a nanoporous macromolecular structure. Here, we demonstrate that helical nanopropellers can be controllably steered through such a biological gel. The screw-propellers have a filament diameter of about 70 nm and are smaller than previously reported nanopropellers as well as any swimming microorganism. We show that the nanoscrews will move through high-viscosity solutions with comparable velocities to that of larger micropropellers, even though they are so small that Brownian forces suppress their actuation in pure water. When actuated in viscoelastic hyaluronan gels, the nanopropellers appear to have a significant advantage, as they are of the same size range as the gel’s mesh size. Whereas larger helices will show very low or negligible propulsion in hyaluronan solutions, the nanoscrews actually display significantly enhanced propulsion velocities that exceed the highest measured speeds in Newtonian fluids. The nanopropellers are not only promising for applications in the extracellular environment but small enough to be taken up by cells.

Featured cover article.

pf

Video - Helical Micro and Nanopropellers for Applications in Biological Fluidic Environments link (url) DOI [BibTex]


Convertor
Convertor

Fischer, P., Mark, A.

May 2014 (patent)

pf

[BibTex]

[BibTex]


3D Traffic Scene Understanding from Movable Platforms
3D Traffic Scene Understanding from Movable Platforms

Geiger, A., Lauer, M., Wojek, C., Stiller, C., Urtasun, R.

IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 36(5):1012-1025, published, IEEE, Los Alamitos, CA, May 2014 (article)

Abstract
In this paper, we present a novel probabilistic generative model for multi-object traffic scene understanding from movable platforms which reasons jointly about the 3D scene layout as well as the location and orientation of objects in the scene. In particular, the scene topology, geometry and traffic activities are inferred from short video sequences. Inspired by the impressive driving capabilities of humans, our model does not rely on GPS, lidar or map knowledge. Instead, it takes advantage of a diverse set of visual cues in the form of vehicle tracklets, vanishing points, semantic scene labels, scene flow and occupancy grids. For each of these cues we propose likelihood functions that are integrated into a probabilistic generative model. We learn all model parameters from training data using contrastive divergence. Experiments conducted on videos of 113 representative intersections show that our approach successfully infers the correct layout in a variety of very challenging scenarios. To evaluate the importance of each feature cue, experiments using different feature combinations are conducted. Furthermore, we show how by employing context derived from the proposed method we are able to improve over the state-of-the-art in terms of object detection and object orientation estimation in challenging and cluttered urban environments.

avg ps

pdf link (url) [BibTex]

pdf link (url) [BibTex]


Adaptive Offset Correction for Intracortical Brain Computer Interfaces
Adaptive Offset Correction for Intracortical Brain Computer Interfaces

Homer, M. L., Perge, J. A., Black, M. J., Harrison, M. T., Cash, S. S., Hochberg, L. R.

IEEE Transactions on Neural Systems and Rehabilitation Engineering, 22(2):239-248, March 2014 (article)

Abstract
Intracortical brain computer interfaces (iBCIs) decode intended movement from neural activity for the control of external devices such as a robotic arm. Standard approaches include a calibration phase to estimate decoding parameters. During iBCI operation, the statistical properties of the neural activity can depart from those observed during calibration, sometimes hindering a user’s ability to control the iBCI. To address this problem, we adaptively correct the offset terms within a Kalman filter decoder via penalized maximum likelihood estimation. The approach can handle rapid shifts in neural signal behavior (on the order of seconds) and requires no knowledge of the intended movement. The algorithm, called MOCA, was tested using simulated neural activity and evaluated retrospectively using data collected from two people with tetraplegia operating an iBCI. In 19 clinical research test cases, where a nonadaptive Kalman filter yielded relatively high decoding errors, MOCA significantly reduced these errors (10.6 ± 10.1\%; p < 0.05, pairwise t-test). MOCA did not significantly change the error in the remaining 23 cases where a nonadaptive Kalman filter already performed well. These results suggest that MOCA provides more robust decoding than the standard Kalman filter for iBCIs.

ps

pdf DOI Project Page [BibTex]

pdf DOI Project Page [BibTex]


Circular polarization interferometry: circularly polarized modes of cholesteric liquid crystals
Circular polarization interferometry: circularly polarized modes of cholesteric liquid crystals

Sanchez-Castillo, A., Eslami, S., Giesselmann, F., Fischer, P.

OPTICS EXPRESS, 22(25):31227-31236, 2014 (article)

Abstract
We describe a novel polarization interferometer which permits the determination of the refractive indices for circularly-polarized light. It is based on a Jamin-Lebedeff interferometer, modified with waveplates, and permits us to experimentally determine the refractive indices n(L) and n(R) of the respectively left- and right-circularly polarized modes in a cholesteric liquid crystal. Whereas optical rotation measurements only determine the circular birefringence, i.e. the difference (n(L) - n(R)), the interferometer also permits the determination of their absolute values. We report refractive indices of a cholesteric liquid crystal in the region of selective (Bragg) reflection as a function of temperature. (C) 2014 Optical Society of America

pf

DOI [BibTex]

DOI [BibTex]


Self-Propelling Nanomotors in the Presence of Strong Brownian Forces
Self-Propelling Nanomotors in the Presence of Strong Brownian Forces

Lee, T., Alarcon-Correa, M., Miksch, C., Hahn, K., Gibbs, J. G., Fischer, P.

NANO LETTERS, 14(5):2407-2412, 2014 (article)

Abstract
Motility in living systems is due to an array of complex molecular nanomotors that are essential for the function and survival of cells. These protein nanomotors operate not only despite of but also because of stochastic forces. Artificial means of realizing motility rely on local concentration or temperature gradients that are established across a particle, resulting in slip velocities at the particle surface and thus motion of the particle relative to the fluid. However, it remains unclear if these artificial motors can function at the smallest of scales, where Brownian motion dominates and no actively propelled living organisms can be found. Recently, the first reports have appeared suggesting that the swimming mechanisms of artificial structures may also apply to enzymes that are catalytically active. Here we report a scheme to realize artificial Janus nanoparticles (JNPs) with an overall size that is comparable to that of some enzymes similar to 30 nm. Our JNPs can catalyze the decomposition of hydrogen peroxide to water and oxygen and thus actively move by self-electrophoresis. Geometric anisotropy of the Pt-Au Janus nanoparticles permits the simultaneous observation of their translational and rotational motion by dynamic light scattering. While their dynamics is strongly influenced by Brownian rotation, the artificial Janus nanomotors show bursts of linear ballistic motion resulting in enhanced diffusion.

pf

DOI [BibTex]


Shape control in wafer-based aperiodic 3D nanostructures
Shape control in wafer-based aperiodic 3D nanostructures

Hyeon-Ho, J., Mark, A. G., Gibbs, J. G., Reindl, T., Waizmann, U., Weis, J., Fischer, P.

NANOTECHNOLOGY, 25(23), 2014, Cover article. (article)

Abstract
Controlled local fabrication of three-dimensional (3D) nanostructures is important to explore and enhance the function of single nanodevices, but is experimentally challenging. We present a scheme based on e-beam lithography (EBL) written seeds, and glancing angle deposition (GLAD) grown structures to create nanoscale objects with defined shapes but in aperiodic arrangements. By using a continuous sacrificial corral surrounding the features of interest we grow isolated 3D nanostructures that have complex cross-sections and sidewall morphology that are surrounded by zones of clean substrate.

Cover article.

pf

DOI [BibTex]

DOI [BibTex]


A freely-moving monkey treadmill model
A freely-moving monkey treadmill model

Foster, J., Nuyujukian, P., Freifeld, O., Gao, H., Walker, R., Ryu, S., Meng, T., Murmann, B., Black, M., Shenoy, K.

J. of Neural Engineering, 11(4):046020, 2014 (article)

Abstract
Objective: Motor neuroscience and brain-machine interface (BMI) design is based on examining how the brain controls voluntary movement, typically by recording neural activity and behavior from animal models. Recording technologies used with these animal models have traditionally limited the range of behaviors that can be studied, and thus the generality of science and engineering research. We aim to design a freely-moving animal model using neural and behavioral recording technologies that do not constrain movement. Approach: We have established a freely-moving rhesus monkey model employing technology that transmits neural activity from an intracortical array using a head-mounted device and records behavior through computer vision using markerless motion capture. We demonstrate the excitability and utility of this new monkey model, including the fi rst recordings from motor cortex while rhesus monkeys walk quadrupedally on a treadmill. Main results: Using this monkey model, we show that multi-unit threshold-crossing neural activity encodes the phase of walking and that the average ring rate of the threshold crossings covaries with the speed of individual steps. On a population level, we find that neural state-space trajectories of walking at diff erent speeds have similar rotational dynamics in some dimensions that evolve at the step rate of walking, yet robustly separate by speed in other state-space dimensions. Significance: Freely-moving animal models may allow neuroscientists to examine a wider range of behaviors and can provide a flexible experimental paradigm for examining the neural mechanisms that underlie movement generation across behaviors and environments. For BMIs, freely-moving animal models have the potential to aid prosthetic design by examining how neural encoding changes with posture, environment, and other real-world context changes. Understanding this new realm of behavior in more naturalistic settings is essential for overall progress of basic motor neuroscience and for the successful translation of BMIs to people with paralysis.

ps

pdf Supplementary DOI Project Page [BibTex]

pdf Supplementary DOI Project Page [BibTex]


Swimming by reciprocal motion at low Reynolds number
Swimming by reciprocal motion at low Reynolds number

Qiu, T., Lee, T., Mark, A. G., Morozov, K. I., Muenster, R., Mierka, O., Turek, S., Leshansky, A. M., Fischer, P.

NATURE COMMUNICATIONS, 5, 2014, Max Planck Press Release. (article)

Abstract
Biological microorganisms swim with flagella and cilia that execute nonreciprocal motions for low Reynolds number (Re) propulsion in viscous fluids. This symmetry requirement is a consequence of Purcell's scallop theorem, which complicates the actuation scheme needed by microswimmers. However, most biomedically important fluids are non-Newtonian where the scallop theorem no longer holds. It should therefore be possible to realize a microswimmer that moves with reciprocal periodic body-shape changes in non-Newtonian fluids. Here we report a symmetric `micro-scallop', a single-hinge microswimmer that can propel in shear thickening and shear thinning (non-Newtonian) fluids by reciprocal motion at low Re. Excellent agreement between our measurements and both numerical and analytical theoretical predictions indicates that the net propulsion is caused by modulation of the fluid viscosity upon varying the shear rate. This reciprocal swimming mechanism opens new possibilities in designing biomedical microdevices that can propel by a simple actuation scheme in non-Newtonian biological fluids.

Max Planck Press Release.

pf

Video - A Swimming Micro-Scallop Video - Winner of the Micro-robotic Design Challenge in Hamlyn Symposium on Medical Robotics DOI [BibTex]

Video - A Swimming Micro-Scallop Video - Winner of the Micro-robotic Design Challenge in Hamlyn Symposium on Medical Robotics DOI [BibTex]


Nanohelices by shadow growth
Nanohelices by shadow growth

Gibbs, J. G., Mark, A. G., Lee, T., Eslami, S., Schamel, D., Fischer, P.

NANOSCALE, 6(16):9457-9466, 2014 (article)

Abstract
The helix has remarkable qualities and is prevalent in many fields including mathematics, physics, chemistry, and biology. This shape, which is chiral by nature, is ubiquitous in biology with perhaps the most famous example being DNA. Other naturally occurring helices are common at the nanoscale in the form of protein secondary structures and in various macromolecules. Nanoscale helices exhibit a wide range of interesting mechanical, optical, and electrical properties which can be intentionally engineered into the structure by choosing the correct morphology and material. As technology advances, these fabrication parameters can be fine-tuned and matched to the application of interest. Herein, we focus on the fabrication and properties of nanohelices grown by a dynamic shadowing growth method combined with fast wafer-scale substrate patterning which has a number of distinct advantages. We review the fabrication methodology and provide several examples that illustrate the generality and utility of nanohelices shadow-grown on nanopatterns.

pf

Video - Fabrication of Designer Nanostructures DOI [BibTex]


Chiral Nanomagnets
Chiral Nanomagnets

Eslami, S., Gibbs, J. G., Rechkemmer, Y., van Slageren, J., Alarcon-Correa, M., Lee, T., Mark, A. G., Rikken, G. L. J. A., Fischer, P.

ACS PHOTONICS, 1(11):1231-1236, 2014 (article)

Abstract
We report on the enhanced optical properties of chiral magnetic nanohelices with critical dimensions comparable to the ferromagnetic domain size. They are shown to be ferromagnetic at room temperature, have defined chirality, and exhibit large optical activity in the visible as verified by electron microscopy, superconducting quantum interference device (SQUID) magnetometry, natural circular dichroism (NCD), and magnetic circular dichroism (MCD) measurements. The structures exhibit magneto-chiral dichroism (MChD), which directly demonstrates coupling between their structural chirality and magnetism. A chiral nickel (Ni) film consisting of an array of nanohelices similar to 100 nm in length exhibits an MChD anisotropy factor g(MChD) approximate to 10(-4) T-1 at room temperature in a saturation field of similar to 0.2 T, permitting polarization-independent control of the film's absorption properties through magnetic field modulation. This is also the first report of MChD in a material with structural chirality on the order of the wavelength of light, and therefore the Ni nanohelix array is a metamaterial with magnetochiral properties that can be tailored through a dynamic deposition process.

pf

DOI [BibTex]

DOI [BibTex]


Wireless powering of e-swimmers
Wireless powering of e-swimmers

Roche, J., Carrara, S., Sanchez, J., Lannelongue, J., Loget, G., Bouffier, L., Fischer, P., Kuhn, A.

SCIENTIFIC REPORTS, 4, 2014 (article)

Abstract
Miniaturized structures that can move in a controlled way in solution and integrate various functionalities are attracting considerable attention due to the potential applications in fields ranging from autonomous micromotors to roving sensors. Here we introduce a concept which allows, depending on their specific design, the controlled directional motion of objects in water, combined with electronic functionalities such as the emission of light, sensing, signal conversion, treatment and transmission. The approach is based on electric field-induced polarization, which triggers different chemical reactions at the surface of the object and thereby its propulsion. This results in a localized electric current that can power in a wireless way electronic devices in water, leading to a new class of electronic swimmers (e-swimmers).

pf

DOI [BibTex]

DOI [BibTex]


Swelling and shrinking behaviour of photoresponsive phosphonium-based ionogel microstructures
Swelling and shrinking behaviour of photoresponsive phosphonium-based ionogel microstructures

Czugala, M., O’Connell, C., Blin, C., Fischer, P., Fraser, K. J., Benito-Lopez, F., Diamond, D.

SENSORS AND ACTUATORS B-CHEMICAL, 194, pages: 105-113, 2014 (article)

Abstract
Photoresponsive N-isopropylacrylamide ionogel microstructures are presented in this study. These ionogels are synthesised using phosphonium based room temperature ionic liquids, together with the photochromic compound benzospiropyran. The microstructures can be actuated using light irradiation, facilitating non-contact and non-invasive operation. For the first time, the characterisation of the swelling and shrinking behaviour of several photopatterned ionogel microstructures is presented and the influence of surface-area-to-volume ratio on the swelling kinetics is evaluated. It was found that the swelling and shrinking behaviour of the ionogels is strongly dependent on the nature of the ionic liquid. In particular, the {[}P-6,P-6,P-6,P-14]{[}NTf2] ionogel exhibits the greatest degree of swelling, reaching up to 180\% of its initial size, and the fastest shrinkage rate (k(sh) = 29 +/- 4 x 10(-2) s(-1)). (C) 2014 Elsevier B. V. All rights reserved.

pf

DOI [BibTex]

DOI [BibTex]


A Quantitative Analysis of Current Practices in Optical Flow Estimation and the Principles behind Them
A Quantitative Analysis of Current Practices in Optical Flow Estimation and the Principles behind Them

Sun, D., Roth, S., Black, M. J.

International Journal of Computer Vision (IJCV), 106(2):115-137, 2014 (article)

Abstract
The accuracy of optical flow estimation algorithms has been improving steadily as evidenced by results on the Middlebury optical flow benchmark. The typical formulation, however, has changed little since the work of Horn and Schunck. We attempt to uncover what has made recent advances possible through a thorough analysis of how the objective function, the optimization method, and modern implementation practices influence accuracy. We discover that "classical'' flow formulations perform surprisingly well when combined with modern optimization and implementation techniques. One key implementation detail is the median filtering of intermediate flow fields during optimization. While this improves the robustness of classical methods it actually leads to higher energy solutions, meaning that these methods are not optimizing the original objective function. To understand the principles behind this phenomenon, we derive a new objective function that formalizes the median filtering heuristic. This objective function includes a non-local smoothness term that robustly integrates flow estimates over large spatial neighborhoods. By modifying this new term to include information about flow and image boundaries we develop a method that can better preserve motion details. To take advantage of the trend towards video in wide-screen format, we further introduce an asymmetric pyramid downsampling scheme that enables the estimation of longer range horizontal motions. The methods are evaluated on Middlebury, MPI Sintel, and KITTI datasets using the same parameter settings.

ps

pdf full text code [BibTex]

pdf full text code [BibTex]

2013


Branch\&Rank for Efficient Object Detection
Branch&Rank for Efficient Object Detection

Lehmann, A., Gehler, P., VanGool, L.

International Journal of Computer Vision, Springer, December 2013 (article)

Abstract
Ranking hypothesis sets is a powerful concept for efficient object detection. In this work, we propose a branch&rank scheme that detects objects with often less than 100 ranking operations. This efficiency enables the use of strong and also costly classifiers like non-linear SVMs with RBF-TeX kernels. We thereby relieve an inherent limitation of branch&bound methods as bounds are often not tight enough to be effective in practice. Our approach features three key components: a ranking function that operates on sets of hypotheses and a grouping of these into different tasks. Detection efficiency results from adaptively sub-dividing the object search space into decreasingly smaller sets. This is inherited from branch&bound, while the ranking function supersedes a tight bound which is often unavailable (except for rather limited function classes). The grouping makes the system effective: it separates image classification from object recognition, yet combines them in a single formulation, phrased as a structured SVM problem. A novel aspect of branch&rank is that a better ranking function is expected to decrease the number of classifier calls during detection. We use the VOC’07 dataset to demonstrate the algorithmic properties of branch&rank.

ps

pdf link (url) [BibTex]

2013


pdf link (url) [BibTex]


Extracting Postural Synergies for Robotic Grasping
Extracting Postural Synergies for Robotic Grasping

Romero, J., Feix, T., Ek, C., Kjellstrom, H., Kragic, D.

Robotics, IEEE Transactions on, 29(6):1342-1352, December 2013 (article)

ps

[BibTex]

[BibTex]


Markov Random Field Modeling, Inference & Learning in Computer Vision & Image Understanding: A Survey
Markov Random Field Modeling, Inference & Learning in Computer Vision & Image Understanding: A Survey

Wang, C., Komodakis, N., Paragios, N.

Computer Vision and Image Understanding (CVIU), 117(11):1610-1627, November 2013 (article)

Abstract
In this paper, we present a comprehensive survey of Markov Random Fields (MRFs) in computer vision and image understanding, with respect to the modeling, the inference and the learning. While MRFs were introduced into the computer vision field about two decades ago, they started to become a ubiquitous tool for solving visual perception problems around the turn of the millennium following the emergence of efficient inference methods. During the past decade, a variety of MRF models as well as inference and learning methods have been developed for addressing numerous low, mid and high-level vision problems. While most of the literature concerns pairwise MRFs, in recent years we have also witnessed significant progress in higher-order MRFs, which substantially enhances the expressiveness of graph-based models and expands the domain of solvable problems. This survey provides a compact and informative summary of the major literature in this research topic.

ps

Publishers site pdf [BibTex]

Publishers site pdf [BibTex]


Vision meets Robotics: The {KITTI} Dataset
Vision meets Robotics: The KITTI Dataset

Geiger, A., Lenz, P., Stiller, C., Urtasun, R.

International Journal of Robotics Research, 32(11):1231 - 1237 , Sage Publishing, September 2013 (article)

Abstract
We present a novel dataset captured from a VW station wagon for use in mobile robotics and autonomous driving research. In total, we recorded 6 hours of traffic scenarios at 10-100 Hz using a variety of sensor modalities such as high-resolution color and grayscale stereo cameras, a Velodyne 3D laser scanner and a high-precision GPS/IMU inertial navigation system. The scenarios are diverse, capturing real-world traffic situations and range from freeways over rural areas to inner-city scenes with many static and dynamic objects. Our data is calibrated, synchronized and timestamped, and we provide the rectified and raw image sequences. Our dataset also contains object labels in the form of 3D tracklets and we provide online benchmarks for stereo, optical flow, object detection and other tasks. This paper describes our recording platform, the data format and the utilities that we provide.

avg ps

pdf DOI [BibTex]

pdf DOI [BibTex]


Human Pose Calculation from Optical Flow Data
Human Pose Calculation from Optical Flow Data

Black, M., Loper, M., Romero, J., Zuffi, S.

European Patent Application EP 2843621 , August 2013 (patent)

ps

Google Patents [BibTex]

Google Patents [BibTex]


Unscented Kalman Filtering on Riemannian Manifolds
Unscented Kalman Filtering on Riemannian Manifolds

Soren Hauberg, Francois Lauze, Kim S. Pedersen

Journal of Mathematical Imaging and Vision, 46(1):103-120, Springer Netherlands, May 2013 (article)

ps

Publishers site PDF [BibTex]

Publishers site PDF [BibTex]


Quasi-Newton Methods: A New Direction
Quasi-Newton Methods: A New Direction

Hennig, P., Kiefel, M.

Journal of Machine Learning Research, 14(1):843-865, March 2013 (article)

Abstract
Four decades after their invention, quasi-Newton methods are still state of the art in unconstrained numerical optimization. Although not usually interpreted thus, these are learning algorithms that fit a local quadratic approximation to the objective function. We show that many, including the most popular, quasi-Newton methods can be interpreted as approximations of Bayesian linear regression under varying prior assumptions. This new notion elucidates some shortcomings of classical algorithms, and lights the way to a novel nonparametric quasi-Newton method, which is able to make more efficient use of available information at computational cost similar to its predecessors.

ei ps pn

website+code pdf link (url) [BibTex]

website+code pdf link (url) [BibTex]


Hybrid nanocolloids with programmed three-dimensional shape and material composition
Hybrid nanocolloids with programmed three-dimensional shape and material composition

Mark, A. G., Gibbs, J. G., Lee, T., Fischer, P.

NATURE MATERIALS, 12(9):802-807, 2013, Max Planck Press Release. (article)

Abstract
Tuning the optical(1,2), electromagnetic(3,4) and mechanical properties of a material requires simultaneous control over its composition and shape(5). This is particularly challenging for complex structures at the nanoscale because surface-energy minimization generally causes small structures to be highly symmetric(5). Here we combine low-temperature shadow deposition with nanoscale patterning to realize nanocolloids with anisotropic three-dimensional shapes, feature sizes down to 20 nm and a wide choice of materials. We demonstrate the versatility of the fabrication scheme by growing three-dimensional hybrid nanostructures that contain several functional materials with the lowest possible symmetry, and by fabricating hundreds of billions of plasmonic nanohelices, which we use as chiral metafluids with record circular dichroism and tunable chiroptical properties.

Max Planck Press Release.

pf

Video - Fabrication of Designer Nanostructures DOI [BibTex]


Chiral Colloidal Molecules And Observation of The Propeller Effect
Chiral Colloidal Molecules And Observation of The Propeller Effect

Schamel, D., Pfeifer, M., Gibbs, J. G., Miksch, B., Mark, A. G., Fischer, P.

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 135(33):12353-12359, 2013 (article)

Abstract
Chiral molecules play an important role in biological and chemical processes, but physical effects due to their symmetry-breaking are generally weak. Several physical chiral separation schemes which could potentially be useful, including the propeller effect, have therefore not yet been demonstrated at the molecular scale. However, it has been proposed that complex nonspherical colloidal particles could act as ``colloidal molecules{''} in mesoscopic model systems to permit the visualization of molecular phenomena that are otherwise difficult to observe. Unfortunately, it is difficult to synthesize such colloids because surface minimization generally favors the growth of symmetric particles. Here we demonstrate the production of large numbers of complex colloids with glancing angle physical vapor deposition. We use chiral colloids to demonstrate the Baranova and Zel'dovich (Baranova, N. B.; Zel'dovich, B. Y. Chem. Phys. Lett. 1978, 57, 435) propeller effect: the separation of a racemic mixture by application of a rotating field that couples to the dipole moment of the enantiomers and screw propels them in opposite directions. The handedness of the colloidal suspensions is monitored with circular differential light scattering. An exact solution for the colloid's propulsion is derived, and comparisons between the colloidal system and the corresponding effect at the molecular scale are made.

pf

Video - Nanospropellers DOI [BibTex]

Video - Nanospropellers DOI [BibTex]


Indirect absorption spectroscopy using quantum cascade lasers: mid-infrared refractometry and photothermal spectroscopy
Indirect absorption spectroscopy using quantum cascade lasers: mid-infrared refractometry and photothermal spectroscopy

Pfeifer, M., Ruf, A., Fischer, P.

OPTICS EXPRESS, 21(22):25643-25654, 2013 (article)

Abstract
We record vibrational spectra with two indirect schemes that depend on the real part of the index of refraction: mid-infrared refractometry and photothermal spectroscopy. In the former, a quantum cascade laser (QCL) spot is imaged to determine the angles of total internal reflection, which yields the absorption line via a beam profile analysis. In the photothermal measurements, a tunable QCL excites vibrational resonances of a molecular monolayer, which heats the surrounding medium and changes its refractive index. This is observed with a probe laser in the visible. Sub-monolayer sensitivities are demonstrated. (C) 2013 Optical Society of America

pf

DOI [BibTex]


Plasmonic nanohelix metamaterials with tailorable giant circular dichroism
Plasmonic nanohelix metamaterials with tailorable giant circular dichroism

Gibbs, J. G., Mark, A. G., Eslami, S., Fischer, P.

APPLIED PHYSICS LETTERS, 103(21), 2013, Featured cover article. (article)

Abstract
Plasmonic nanohelix arrays are shown to interact with electromagnetic fields in ways not typically seen with ordinary matter. Chiral metamaterials (CMMs) with feature sizes small with respect to the wavelength of visible light are a promising route to experimentally achieve such phenomena as negative refraction without the need for simultaneously negative e and mu. Here we not only show that giant circular dichroism in the visible is achievable with hexagonally arranged plasmonic nanohelix arrays, but that we can precisely tune the optical activity via morphology and lattice spacing. The discrete dipole approximation is implemented to support experimental data. (C) 2013 AIP Publishing LLC.

Featured cover article.

pf

DOI [BibTex]

DOI [BibTex]


no image
Information Driven Self-Organization of Complex Robotic Behaviors

Martius, G., Der, R., Ay, N.

PLoS ONE, 8(5):e63400, Public Library of Science, 2013 (article)

al

link (url) DOI [BibTex]

link (url) DOI [BibTex]


Random Forests for Real Time {3D} Face Analysis
Random Forests for Real Time 3D Face Analysis

Fanelli, G., Dantone, M., Gall, J., Fossati, A., van Gool, L.

International Journal of Computer Vision, 101(3):437-458, Springer, 2013 (article)

Abstract
We present a random forest-based framework for real time head pose estimation from depth images and extend it to localize a set of facial features in 3D. Our algorithm takes a voting approach, where each patch extracted from the depth image can directly cast a vote for the head pose or each of the facial features. Our system proves capable of handling large rotations, partial occlusions, and the noisy depth data acquired using commercial sensors. Moreover, the algorithm works on each frame independently and achieves real time performance without resorting to parallel computations on a GPU. We present extensive experiments on publicly available, challenging datasets and present a new annotated head pose database recorded using a Microsoft Kinect.

ps

data and code publisher's site pdf DOI Project Page [BibTex]

data and code publisher's site pdf DOI Project Page [BibTex]


Markerless Motion Capture of Multiple Characters Using Multi-view Image Segmentation
Markerless Motion Capture of Multiple Characters Using Multi-view Image Segmentation

Liu, Y., Gall, J., Stoll, C., Dai, Q., Seidel, H., Theobalt, C.

Transactions on Pattern Analysis and Machine Intelligence, 35(11):2720-2735, 2013 (article)

Abstract
Capturing the skeleton motion and detailed time-varying surface geometry of multiple, closely interacting peoples is a very challenging task, even in a multicamera setup, due to frequent occlusions and ambiguities in feature-to-person assignments. To address this task, we propose a framework that exploits multiview image segmentation. To this end, a probabilistic shape and appearance model is employed to segment the input images and to assign each pixel uniquely to one person. Given the articulated template models of each person and the labeled pixels, a combined optimization scheme, which splits the skeleton pose optimization problem into a local one and a lower dimensional global one, is applied one by one to each individual, followed with surface estimation to capture detailed nonrigid deformations. We show on various sequences that our approach can capture the 3D motion of humans accurately even if they move rapidly, if they wear wide apparel, and if they are engaged in challenging multiperson motions, including dancing, wrestling, and hugging.

ps

data and video pdf DOI Project Page [BibTex]

data and video pdf DOI Project Page [BibTex]


Viewpoint and pose in body-form adaptation
Viewpoint and pose in body-form adaptation

Sekunova, A., Black, M., Parkinson, L., Barton, J. J. S.

Perception, 42(2):176-186, 2013 (article)

Abstract
Faces and bodies are complex structures, perception of which can play important roles in person identification and inference of emotional state. Face representations have been explored using behavioural adaptation: in particular, studies have shown that face aftereffects show relatively broad tuning for viewpoint, consistent with origin in a high-level structural descriptor far removed from the retinal image. Our goals were to determine first, if body aftereffects also showed a degree of viewpoint invariance, and second if they also showed pose invariance, given that changes in pose create even more dramatic changes in the 2-D retinal image. We used a 3-D model of the human body to generate headless body images, whose parameters could be varied to generate different body forms, viewpoints, and poses. In the first experiment, subjects adapted to varying viewpoints of either slim or heavy bodies in a neutral stance, followed by test stimuli that were all front-facing. In the second experiment, we used the same front-facing bodies in neutral stance as test stimuli, but compared adaptation from bodies in the same neutral stance to adaptation with the same bodies in different poses. We found that body aftereffects were obtained over substantial viewpoint changes, with no significant decline in aftereffect magnitude with increasing viewpoint difference between adapting and test images. Aftereffects also showed transfer across one change in pose but not across another. We conclude that body representations may have more viewpoint invariance than faces, and demonstrate at least some transfer across pose, consistent with a high-level structural description. Keywords: aftereffect, shape, face, representation

ps

pdf from publisher abstract pdf link (url) Project Page [BibTex]

pdf from publisher abstract pdf link (url) Project Page [BibTex]


no image
Linear combination of one-step predictive information with an external reward in an episodic policy gradient setting: a critical analysis

Zahedi, K., Martius, G., Ay, N.

Frontiers in Psychology, 4(801), 2013 (article)

Abstract
One of the main challenges in the field of embodied artificial intelligence is the open-ended autonomous learning of complex behaviours. Our approach is to use task-independent, information-driven intrinsic motivation(s) to support task-dependent learning. The work presented here is a preliminary step in which we investigate the predictive information (the mutual information of the past and future of the sensor stream) as an intrinsic drive, ideally supporting any kind of task acquisition. Previous experiments have shown that the predictive information (PI) is a good candidate to support autonomous, open-ended learning of complex behaviours, because a maximisation of the PI corresponds to an exploration of morphology- and environment-dependent behavioural regularities. The idea is that these regularities can then be exploited in order to solve any given task. Three different experiments are presented and their results lead to the conclusion that the linear combination of the one-step PI with an external reward function is not generally recommended in an episodic policy gradient setting. Only for hard tasks a great speed-up can be achieved at the cost of an asymptotic performance lost.

al

link (url) DOI [BibTex]


no image
Robustness of guided self-organization against sensorimotor disruptions

Martius, G.

Advances in Complex Systems, 16(02n03):1350001, 2013 (article)

Abstract
Self-organizing processes are crucial for the development of living beings. Practical applications in robots may benefit from the self-organization of behavior, e.g.~to increase fault tolerance and enhance flexibility, provided that external goals can also be achieved. We present results on the guidance of self-organizing control by visual target stimuli and show a remarkable robustness to sensorimotor disruptions. In a proof of concept study an autonomous wheeled robot is learning an object finding and ball-pushing task from scratch within a few minutes in continuous domains. The robustness is demonstrated by the rapid recovery of the performance after severe changes of the sensor configuration.

al

DOI [BibTex]

DOI [BibTex]


Non-parametric hand pose estimation with object context
Non-parametric hand pose estimation with object context

Romero, J., Kjellström, H., Ek, C. H., Kragic, D.

Image and Vision Computing , 31(8):555 - 564, 2013 (article)

Abstract
In the spirit of recent work on contextual recognition and estimation, we present a method for estimating the pose of human hands, employing information about the shape of the object in the hand. Despite the fact that most applications of human hand tracking involve grasping and manipulation of objects, the majority of methods in the literature assume a free hand, isolated from the surrounding environment. Occlusion of the hand from grasped objects does in fact often pose a severe challenge to the estimation of hand pose. In the presented method, object occlusion is not only compensated for, it contributes to the pose estimation in a contextual fashion; this without an explicit model of object shape. Our hand tracking method is non-parametric, performing a nearest neighbor search in a large database (.. entries) of hand poses with and without grasped objects. The system that operates in real time, is robust to self occlusions, object occlusions and segmentation errors, and provides full hand pose reconstruction from monocular video. Temporal consistency in hand pose is taken into account, without explicitly tracking the hand in the high-dim pose space. Experiments show the non-parametric method to outperform other state of the art regression methods, while operating at a significantly lower computational cost than comparable model-based hand tracking methods.

ps

Publisher site pdf link (url) [BibTex]

Publisher site pdf link (url) [BibTex]

2008


Voltage-Controllable Magnetic Composite Based on Multifunctional Polyethylene Microparticles
Voltage-Controllable Magnetic Composite Based on Multifunctional Polyethylene Microparticles

Ghosh, A., Sheridon, N. K., Fischer, P.

SMALL, 4(11):1956-1958, 2008 (article)

pf

DOI [BibTex]

2008



Chiral molecules split light: Reflection and refraction in a chiral liquid
Chiral molecules split light: Reflection and refraction in a chiral liquid

Ghosh, A., Fischer, P.

PHYSICAL REVIEW LETTERS, 97(17), 2006, Featured highlight ‘Fundamental optical physics: Refraction’ Nature Photonics, Nov. 2006. (article)

Abstract
A light beam changes direction as it enters a liquid at an angle from another medium, such as air. Should the liquid contain molecules that lack mirror symmetry, then it has been predicted by Fresnel that the light beam will not only change direction, but will actually split into two separate beams with a small difference in the respective angles of refraction. Here we report the observation of this phenomenon. We also demonstrate that the angle of reflection does not equal the angle of incidence in a chiral medium. Unlike conventional optical rotation, which depends on the path-length through the sample, the reported reflection and refraction phenomena arise within a few wavelengths at the interface and thereby suggest a new approach to polarimetry that can be used in microfluidic volumes.

Featured highlight ‘Fundamental optical physics: Refraction’ Nature Photonics, Nov. 2006.

pf

DOI [BibTex]

DOI [BibTex]


Direct chiral discrimination in NMR spectroscopy
Direct chiral discrimination in NMR spectroscopy

Buckingham, A., Fischer, P.

CHEMICAL PHYSICS, 324(1):111-116, 2006 (article)

Abstract
Conventional nuclear magnetic resonance spectroscopy is unable to distinguish between the two mirror-image forms (enantiomers) of a chiral molecule. This is because the NMR spectrum is determined by the chemical shifts and spin-spin coupling constants which - in the absence of a chiral solvent - are identical for the two enantiomers. We discuss how chirality may nevertheless be directly detected in liquid-state NMR spectroscopy: In a chiral molecule, the rotating nuclear magnetic moment induces an electric dipole moment in the direction perpendicular to itself and to the permanent magnetic field of the spectrometer. We present computations of the precessing electric polarization following a pi/2 pulse. Our estimates indicate that the electric polarization should be detectable in favourable cases. We also predict that application of an electrostatic field induces a chirally sensitive magnetization oscillating in the direction of the permanent magnetic field. We show that the electric-field-perturbed chemical shift tensor, the nuclear magnetic shielding polarizability, underlies these chiral NMR effects. (c) 2005 Elsevier B.V. All rights reserved.

pf

DOI [BibTex]

DOI [BibTex]


Ring-resonator-based frequency-domain optical activity measurements of a chiral liquid
Ring-resonator-based frequency-domain optical activity measurements of a chiral liquid

Vollmer, F., Fischer, P.

OPTICS LETTERS, 31(4):453-455, 2006 (article)

Abstract
Chiral liquids rotate the plane of polarization of linearly polarized light and are therefore optically active. Here we show that optical rotation can be observed in the frequency domain. A chiral liquid introduced in a fiber-loop ring resonator that supports left and right circularly polarized modes gives rise to relative frequency shifts that are a direct measure of the liquid's circular birefringence and hence of its optical activity. The effect is in principle not diminished if the circumference of the ring is reduced. The technique is similarly applicable to refractive index and linear birefringence measurements. (c) 2006 Optical Society of America.

pf

DOI [BibTex]


Sign of the refractive index in a gain medium with negative permittivity and permeability
Sign of the refractive index in a gain medium with negative permittivity and permeability

Chen, Y., Fischer, P., Wise, F.

JOURNAL OF THE OPTICAL SOCIETY OF AMERICA B-OPTICAL PHYSICS, 23(1):45-50, 2006 (article)

Abstract
We show how the sign of the refractive index in any medium may be derived using a rigorous analysis based on Einstein causality. In particular, we consider left-handed materials, i.e., media that have negative permittivities and permeabilities at the frequency of interest. We find that the consideration of gain in such media can give rise to a positive refractive index. (c) 2006 Optical Society of America.

pf

DOI [BibTex]

DOI [BibTex]


no image
Rocking Stamper and Jumping Snake from a Dynamical System Approach to Artificial Life

Der, R., Hesse, F., Martius, G.

Adaptive Behavior, 14(2):105-115, 2006 (article)

Abstract
Dynamical systems offer intriguing possibilities as a substrate for the generation of behavior because of their rich behavioral complexity. However this complexity together with the largely covert relation between the parameters and the behavior of the agent is also the main hindrance in the goal-oriented design of a behavior system. This paper presents a general approach to the self-regulation of dynamical systems so that the design problem is circumvented. We consider the controller (a neural net work) as the mediator for changes in the sensor values over time and define a dynamics for the parameters of the controller by maximizing the dynamical complexity of the sensorimotor loop under the condition that the consequences of the actions taken are still predictable. This very general principle is given a concrete mathematical formulation and is implemented in an extremely robust and versatile algorithm for the parameter dynamics of the controller. We consider two different applications, a mechanical device called the rocking stamper and the ODE simulations of a "snake" with five degrees of freedom. In these and many other examples studied we observed various behavior modes of high dynamical complexity.

al

DOI [BibTex]

DOI [BibTex]


Nonlinear optical spectroscopy of chiral molecules
Nonlinear optical spectroscopy of chiral molecules

Fischer, P., Hache, F.

CHIRALITY, 17(8):421-437, 2005 (article)

Abstract
We review nonlinear optical processes that are specific to chiral molecules in solution and on surfaces. In contrast to conventional natural optical activity phenomena, which depend linearly on the electric field strength of the optical field, we discuss how optical processes that are nonlinear (quadratic, cubic, and quartic) functions of the electromagnetic field strength may probe optically active centers and chiral vibrations. We show that nonlinear techniques open entirely new ways of exploring chirality in chemical and biological systems: The cubic processes give rise to nonlinear circular dichroism and nonlinear optical rotation and make it possible to observe dynamic chiral processes at ultrafast time scales. The quadratic second-harmonic and sum-frequency-generation phenomena and the quartic processes may arise entirely in the electric-dipole approximation and do not require the use of circularly polarized light to detect chirality: They provide surface selectivity and their observables can be relatively much larger than in linear optical activity. These processes also give rise to the generation of light at a new color, and in liquids this frequency conversion only occurs if the solution is optically active. We survey recent chiral nonlinear optical experiments and give examples of their application to problems of biophysical interest. (C) 2005 Wiley-Liss, Inc.

pf

DOI [BibTex]

DOI [BibTex]


Negative refraction at optical frequencies in nonmagnetic two-component molecular media
Negative refraction at optical frequencies in nonmagnetic two-component molecular media

Chen, Y., Fischer, P., Wise, F.

PHYSICAL REVIEW LETTERS, 95(6), 2005 (article)

Abstract
There is significant motivation to develop media with negative refractive indices at optical frequencies, but efforts in this direction are hampered by the weakness of the magnetic response at such frequencies. We show theoretically that a nonmagnetic medium with two atomic or molecular constituents can exhibit a negative refractive index. A negative index is possible even when the real parts of both the permittivity and permeability are positive. This surprising result provides a route to isotropic negative-index media at optical frequencies.

pf

DOI [BibTex]

DOI [BibTex]