Header logo is


2018


A Value-Driven Eldercare Robot: Virtual and Physical Instantiations of a Case-Supported Principle-Based Behavior Paradigm
A Value-Driven Eldercare Robot: Virtual and Physical Instantiations of a Case-Supported Principle-Based Behavior Paradigm

Anderson, M., Anderson, S., Berenz, V.

Proceedings of the IEEE, pages: 1,15, October 2018 (article)

Abstract
In this paper, a case-supported principle-based behavior paradigm is proposed to help ensure ethical behavior of autonomous machines. We argue that ethically significant behavior of autonomous systems should be guided by explicit ethical principles determined through a consensus of ethicists. Such a consensus is likely to emerge in many areas in which autonomous systems are apt to be deployed and for the actions they are liable to undertake. We believe that this is the case since we are more likely to agree on how machines ought to treat us than on how human beings ought to treat one another. Given such a consensus, particular cases of ethical dilemmas where ethicists agree on the ethically relevant features and the right course of action can be used to help discover principles that balance these features when they are in conflict. Such principles not only help ensure ethical behavior of complex and dynamic systems but also can serve as a basis for justification of this behavior. The requirements, methods, implementation, and evaluation components of the paradigm are detailed as well as its instantiation in both a simulated and real robot functioning in the domain of eldercare.

am

link (url) DOI [BibTex]

2018



Probabilistic Solutions To Ordinary Differential Equations As Non-Linear Bayesian Filtering: A New Perspective
Probabilistic Solutions To Ordinary Differential Equations As Non-Linear Bayesian Filtering: A New Perspective

Tronarp, F., Kersting, H., Särkkä, S., Hennig, P.

ArXiv preprint 2018, arXiv:1810.03440 [stat.ME], October 2018 (article)

Abstract
We formulate probabilistic numerical approximations to solutions of ordinary differential equations (ODEs) as problems in Gaussian process (GP) regression with non-linear measurement functions. This is achieved by defining the measurement sequence to consists of the observations of the difference between the derivative of the GP and the vector field evaluated at the GP---which are all identically zero at the solution of the ODE. When the GP has a state-space representation, the problem can be reduced to a Bayesian state estimation problem and all widely-used approximations to the Bayesian filtering and smoothing problems become applicable. Furthermore, all previous GP-based ODE solvers, which were formulated in terms of generating synthetic measurements of the vector field, come out as specific approximations. We derive novel solvers, both Gaussian and non-Gaussian, from the Bayesian state estimation problem posed in this paper and compare them with other probabilistic solvers in illustrative experiments.

pn

link (url) Project Page [BibTex]


Playful: Reactive Programming for Orchestrating Robotic Behavior
Playful: Reactive Programming for Orchestrating Robotic Behavior

Berenz, V., Schaal, S.

IEEE Robotics Automation Magazine, 25(3):49-60, September 2018 (article) In press

Abstract
For many service robots, reactivity to changes in their surroundings is a must. However, developing software suitable for dynamic environments is difficult. Existing robotic middleware allows engineers to design behavior graphs by organizing communication between components. But because these graphs are structurally inflexible, they hardly support the development of complex reactive behavior. To address this limitation, we propose Playful, a software platform that applies reactive programming to the specification of robotic behavior.

am

playful website playful_IEEE_RAM link (url) DOI [BibTex]


ClusterNet: Instance Segmentation in RGB-D Images
ClusterNet: Instance Segmentation in RGB-D Images

Shao, L., Tian, Y., Bohg, J.

arXiv, September 2018, Submitted to ICRA'19 (article) Submitted

Abstract
We propose a method for instance-level segmentation that uses RGB-D data as input and provides detailed information about the location, geometry and number of {\em individual\/} objects in the scene. This level of understanding is fundamental for autonomous robots. It enables safe and robust decision-making under the large uncertainty of the real-world. In our model, we propose to use the first and second order moments of the object occupancy function to represent an object instance. We train an hourglass Deep Neural Network (DNN) where each pixel in the output votes for the 3D position of the corresponding object center and for the object's size and pose. The final instance segmentation is achieved through clustering in the space of moments. The object-centric training loss is defined on the output of the clustering. Our method outperforms the state-of-the-art instance segmentation method on our synthesized dataset. We show that our method generalizes well on real-world data achieving visually better segmentation results.

am

link (url) [BibTex]

link (url) [BibTex]


no image
Convergence Rates of Gaussian ODE Filters

Kersting, H., Sullivan, T. J., Hennig, P.

arXiv preprint 2018, arXiv:1807.09737 [math.NA], July 2018 (article)

Abstract
A recently-introduced class of probabilistic (uncertainty-aware) solvers for ordinary differential equations (ODEs) applies Gaussian (Kalman) filtering to initial value problems. These methods model the true solution $x$ and its first $q$ derivatives a priori as a Gauss--Markov process $\boldsymbol{X}$, which is then iteratively conditioned on information about $\dot{x}$. We prove worst-case local convergence rates of order $h^{q+1}$ for a wide range of versions of this Gaussian ODE filter, as well as global convergence rates of order $h^q$ in the case of $q=1$ and an integrated Brownian motion prior, and analyse how inaccurate information on $\dot{x}$ coming from approximate evaluations of $f$ affects these rates. Moreover, we present explicit formulas for the steady states and show that the posterior confidence intervals are well calibrated in all considered cases that exhibit global convergence---in the sense that they globally contract at the same rate as the truncation error.

pn

link (url) Project Page [BibTex]

link (url) Project Page [BibTex]


Real-time Perception meets Reactive Motion Generation
Real-time Perception meets Reactive Motion Generation

(Best Systems Paper Finalists - Amazon Robotics Best Paper Awards in Manipulation)

Kappler, D., Meier, F., Issac, J., Mainprice, J., Garcia Cifuentes, C., Wüthrich, M., Berenz, V., Schaal, S., Ratliff, N., Bohg, J.

IEEE Robotics and Automation Letters, 3(3):1864-1871, July 2018 (article)

Abstract
We address the challenging problem of robotic grasping and manipulation in the presence of uncertainty. This uncertainty is due to noisy sensing, inaccurate models and hard-to-predict environment dynamics. Our approach emphasizes the importance of continuous, real-time perception and its tight integration with reactive motion generation methods. We present a fully integrated system where real-time object and robot tracking as well as ambient world modeling provides the necessary input to feedback controllers and continuous motion optimizers. Specifically, they provide attractive and repulsive potentials based on which the controllers and motion optimizer can online compute movement policies at different time intervals. We extensively evaluate the proposed system on a real robotic platform in four scenarios that exhibit either challenging workspace geometry or a dynamic environment. We compare the proposed integrated system with a more traditional sense-plan-act approach that is still widely used. In 333 experiments, we show the robustness and accuracy of the proposed system.

am

arxiv video video link (url) DOI Project Page [BibTex]


no image
Distributed Event-Based State Estimation for Networked Systems: An LMI Approach

Muehlebach, M., Trimpe, S.

IEEE Transactions on Automatic Control, 63(1):269-276, January 2018 (article)

am ics

arXiv (extended version) DOI Project Page [BibTex]

arXiv (extended version) DOI Project Page [BibTex]


no image
Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences

Kanagawa, M., Hennig, P., Sejdinovic, D., Sriperumbudur, B. K.

Arxiv e-prints, arXiv:1805.08845v1 [stat.ML], 2018 (article)

Abstract
This paper is an attempt to bridge the conceptual gaps between researchers working on the two widely used approaches based on positive definite kernels: Bayesian learning or inference using Gaussian processes on the one side, and frequentist kernel methods based on reproducing kernel Hilbert spaces on the other. It is widely known in machine learning that these two formalisms are closely related; for instance, the estimator of kernel ridge regression is identical to the posterior mean of Gaussian process regression. However, they have been studied and developed almost independently by two essentially separate communities, and this makes it difficult to seamlessly transfer results between them. Our aim is to overcome this potential difficulty. To this end, we review several old and new results and concepts from either side, and juxtapose algorithmic quantities from each framework to highlight close similarities. We also provide discussions on subtle philosophical and theoretical differences between the two approaches.

pn

arXiv [BibTex]

arXiv [BibTex]


no image
Numerical Quadrature for Probabilistic Policy Search

Vinogradska, J., Bischoff, B., Achterhold, J., Koller, T., Peters, J.

IEEE Transactions on Pattern Analysis and Machine Intelligence, pages: 1-1, 2018 (article)

ev

DOI [BibTex]

DOI [BibTex]


no image
Transmission x-ray microscopy at low temperatures: Irregular supercurrent flow at small length scales

Simmendinger, J., Ruoss, S., Stahl, C., Weigand, M., Gräfe, J., Schütz, G., Albrecht, J.

{Physical Review B}, 97(13), American Physical Society, Woodbury, NY, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


Combining learned and analytical models for predicting action effects
Combining learned and analytical models for predicting action effects

Kloss, A., Schaal, S., Bohg, J.

arXiv, 2018 (article) Submitted

Abstract
One of the most basic skills a robot should possess is predicting the effect of physical interactions with objects in the environment. This enables optimal action selection to reach a certain goal state. Traditionally, dynamics are approximated by physics-based analytical models. These models rely on specific state representations that may be hard to obtain from raw sensory data, especially if no knowledge of the object shape is assumed. More recently, we have seen learning approaches that can predict the effect of complex physical interactions directly from sensory input. It is however an open question how far these models generalize beyond their training data. In this work, we investigate the advantages and limitations of neural network based learning approaches for predicting the effects of actions based on sensory input and show how analytical and learned models can be combined to leverage the best of both worlds. As physical interaction task, we use planar pushing, for which there exists a well-known analytical model and a large real-world dataset. We propose to use a convolutional neural network to convert raw depth images or organized point clouds into a suitable representation for the analytical model and compare this approach to using neural networks for both, perception and prediction. A systematic evaluation of the proposed approach on a very large real-world dataset shows two main advantages of the hybrid architecture. Compared to a pure neural network, it significantly (i) reduces required training data and (ii) improves generalization to novel physical interaction.

am

arXiv pdf link (url) [BibTex]


no image
Counterfactual Mean Embedding: A Kernel Method for Nonparametric Causal Inference

Muandet, K., Kanagawa, M., Saengkyongam, S., Marukata, S.

Arxiv e-prints, arXiv:1805.08845v1 [stat.ML], 2018 (article)

Abstract
This paper introduces a novel Hilbert space representation of a counterfactual distribution---called counterfactual mean embedding (CME)---with applications in nonparametric causal inference. Counterfactual prediction has become an ubiquitous tool in machine learning applications, such as online advertisement, recommendation systems, and medical diagnosis, whose performance relies on certain interventions. To infer the outcomes of such interventions, we propose to embed the associated counterfactual distribution into a reproducing kernel Hilbert space (RKHS) endowed with a positive definite kernel. Under appropriate assumptions, the CME allows us to perform causal inference over the entire landscape of the counterfactual distribution. The CME can be estimated consistently from observational data without requiring any parametric assumption about the underlying distributions. We also derive a rate of convergence which depends on the smoothness of the conditional mean and the Radon-Nikodym derivative of the underlying marginal distributions. Our framework can deal with not only real-valued outcome, but potentially also more complex and structured outcomes such as images, sequences, and graphs. Lastly, our experimental results on off-policy evaluation tasks demonstrate the advantages of the proposed estimator.

ei pn

arXiv [BibTex]

arXiv [BibTex]


no image
Model-based Kernel Sum Rule: Kernel Bayesian Inference with Probabilistic Models

Nishiyama, Y., Kanagawa, M., Gretton, A., Fukumizu, K.

Arxiv e-prints, arXiv:1409.5178v2 [stat.ML], 2018 (article)

Abstract
Kernel Bayesian inference is a powerful nonparametric approach to performing Bayesian inference in reproducing kernel Hilbert spaces or feature spaces. In this approach, kernel means are estimated instead of probability distributions, and these estimates can be used for subsequent probabilistic operations (as for inference in graphical models) or in computing the expectations of smooth functions, for instance. Various algorithms for kernel Bayesian inference have been obtained by combining basic rules such as the kernel sum rule (KSR), kernel chain rule, kernel product rule and kernel Bayes' rule. However, the current framework only deals with fully nonparametric inference (i.e., all conditional relations are learned nonparametrically), and it does not allow for flexible combinations of nonparametric and parametric inference, which are practically important. Our contribution is in providing a novel technique to realize such combinations. We introduce a new KSR referred to as the model-based KSR (Mb-KSR), which employs the sum rule in feature spaces under a parametric setting. Incorporating the Mb-KSR into existing kernel Bayesian framework provides a richer framework for hybrid (nonparametric and parametric) kernel Bayesian inference. As a practical application, we propose a novel filtering algorithm for state space models based on the Mb-KSR, which combines the nonparametric learning of an observation process using kernel mean embedding and the additive Gaussian noise model for a state transition process. While we focus on additive Gaussian noise models in this study, the idea can be extended to other noise models, such as the Cauchy and alpha-stable noise models.

pn

arXiv [BibTex]

arXiv [BibTex]


no image
Assessment methodology of promising porous materials for near ambient temperature hydrogen storage applications

Minuto, F. D., Balderas-Xicohténcatl, R., Policicchio, A., Hirscher, M., Agostino, R. G.

{International Journal of Hydrogen Energy}, 43(31):14550-14556, Elsevier, Amsterdam, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


A probabilistic model for the numerical solution of initial value problems
A probabilistic model for the numerical solution of initial value problems

Schober, M., Särkkä, S., Philipp Hennig,

Statistics and Computing, Springer US, 2018 (article)

Abstract
We study connections between ordinary differential equation (ODE) solvers and probabilistic regression methods in statistics. We provide a new view of probabilistic ODE solvers as active inference agents operating on stochastic differential equation models that estimate the unknown initial value problem (IVP) solution from approximate observations of the solution derivative, as provided by the ODE dynamics. Adding to this picture, we show that several multistep methods of Nordsieck form can be recast as Kalman filtering on q-times integrated Wiener processes. Doing so provides a family of IVP solvers that return a Gaussian posterior measure, rather than a point estimate. We show that some such methods have low computational overhead, nontrivial convergence order, and that the posterior has a calibrated concentration rate. Additionally, we suggest a step size adaptation algorithm which completes the proposed method to a practically useful implementation, which we experimentally evaluate using a representative set of standard codes in the DETEST benchmark set.

pn

PDF Code DOI Project Page [BibTex]


no image
Incorporation of Terbium into a Microalga Leads to Magnetotactic Swimmers

Santomauro, G., Singh, A., Park, B. W., Mohammadrahimi, M., Erkoc, P., Goering, E., Schütz, G., Sitti, M., Bill, J.

Advanced Biosystems, 2(12):1800039, 2018 (article)

mms pi

[BibTex]

[BibTex]


no image
Thermodynamics, kinetics and selectivity of H2 and D2 on zeolite 5A below 77K

Xiong, R., Balderas-Xicohténcatl, R., Zhang, L., Li, P., Yao, Y., Sang, G., Chen, C., Tang, T., Luo, D., Hirscher, M.

{Microporous and Mesoporous Materials}, 264, pages: 22-27, Elsevier, Amsterdam, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Volumetric hydrogen storage capacity in metal-organic frameworks

Balderas-Xicohténcatl, R., Schlichtenmayer, M., Hirscher, M.

{Energy Technology}, 6(3):578-582, Wiley-VCH, Weinheim, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
3D nanoprinted plastic kinoform x-ray optics

Sanli, U. T., Ceylan, H., Bykova, I., Weigand, M., Sitti, M., Schütz, G., Keskinbora, K.

{Advanced Materials}, 30(36), Wiley-VCH, Weinheim, 2018 (article)

mms pi

DOI [BibTex]

DOI [BibTex]


no image
High volumetric hydrogen storage capacity using interpenetrated metal-organic frameworks

Balderas-Xicohténcatl, R., Schmieder, P., Denysenko, D., Volkmer, D., Hirscher, M.

{Energy Technology}, 6(3):510-512, Wiley-VCH, Weinheim, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Thick permalloy films for the imaging of spin texture dynamics in perpendicularly magnetized systems

Finizio, S., Wintz, S., Bracher, D., Kirk, E., Semisalova, A. S., Förster, J., Zeissler, K., We\ssels, T., Weigand, M., Lenz, K., Kleibert, A., Raabe, J.

{Physical Review B}, 98(10), American Physical Society, Woodbury, NY, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Dynamic Janus metasurfaces in the visible spectral region

Yu, P., Li, J., Zhang, S., Jin, Z., Schütz, G., Qiu, C., Hirscher, M., Liu, N.

{Nano Letters}, 18(7):4584-4589, American Chemical Society, Washington, DC, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Review of ultrafast demagnetization after femtosecond laser pulses: A complex interaction of light with quantum matter

Fähnle, M., Haag, M., Illg, C., Müller, B. Y., Weng, W., Tsatsoulis, T., Huang, H., Briones Paz, J. Z., Teeny, N., Zhang, L., Kuhn, T.

{American Journal of Modern Physics}, 7(2):68-74, Science Publishing Group, New York, NY, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Direct observation of Zhang-Li torque expansion of magnetic droplet solitons

Chung, S., Tuan Le, Q., Ahlberg, M., Awad, A. A., Weigand, M., Bykova, I., Khymyn, R., Dvornik, M., Mazraati, H., Houshang, A., Jiang, S., Nguyen, T. N. A., Goering, E., Schütz, G., Gräfe, J., \AAkerman, J.

{Physical Review Letters}, 120(21), American Physical Society, Woodbury, N.Y., 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Current-induced skyrmion generation through morphological thermal transitions in chiral ferromagnetic heterostructures

Lemesh, I., Litzius, K., Böttcher, M., Bassirian, P., Kerber, N., Heinze, D., Zázvorka, J., Büttner, F., Caretta, L., Mann, M., Weigand, M., Finizio, S., Raabe, J., Im, M., Stoll, H., Schütz, G., Dupé, B., Kläui, M., Beach, G. S. D.

{Advanced Materials}, 30(49), Wiley-VCH, Weinheim, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
3d nanofabrication of high-resolution multilayer Fresnel zone plates

Sanli, U. T., Jiao, C., Baluktsian, M., Grévent, C., Hahn, K., Wang, Y., Srot, V., Richter, G., Bykova, I., Weigand, M., Schütz, G., Keskinbora, K.

{Advanced Science}, 5(9), Wiley-VCH, Weinheim, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Photocatalytic CO2 reduction by Cr-substituted Ba2(In2-xCrx)O5\mbox⋅(H2O)δ(0.04 ≤x ≤0.60)

Yoon, S., Gaul, M., Sharma, S., Son, K., Hagemann, H., Ziegenbalg, D., Schwingenschlogl, U., Widenmeyer, M., Weidenkaff, A.

{Solid State Sciences}, 78, pages: 22-29, Elsevier Masson SAS, Paris, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Correction of axial position uncertainty and systematic detector errors in ptychographic diffraction imaging

Loetgering, L., Rose, M., Keskinbora, K., Baluktsian, M., Dogan, G., Sanli, U., Bykova, I., Weigand, M., Schütz, G., Wilhein, T.

{Optical Engineering}, 57(8), The Society, Redondo Beach, Calif., 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
The role of surface oxides on hydrogen sorption kinetics in titanium thin films

Hadjixenophontos, E., Michalek, L., Roussel, M., Hirscher, M., Schmitz, G.

{Applied Surface Science}, 441, pages: 324-330, Elsevier B.V., Amsterdam, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Ferromagnetism in nitrogen and fluorine substituted BaTiO3

Yoon, S., Son, K., Ebbinghaus, S. G., Widenmeyer, M., Weidenkaff, A.

{Journal of Alloys and Compounds}, 749, pages: 628-633, Elsevier B.V., Lausanne, Switzerland, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
New concepts for 3d optics in x-ray microscopy

Sanli, U., Ceylan, H., Jiao, C., Baluktsian, M., Grevent, C., Hahn, K., Wang, Y., Srot, V., Richter, G., Bykova, I., Weigand, M., Sitti, M., Schütz, G., Keskinbora, K.

{Microscopy and Microanalysis}, 24(Suppl 2):288-289, Cambridge University Press, New York, NY, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Spin-wave interference in magnetic vortex stacks

Behncke, C., Adolff, C. F., Lenzing, N., Hänze, M., Schulte, B., Weigand, M., Schütz, G., Meier, G.

{Communications Physics}, 1, Nature Publishing Group, London, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
High-throughput synthesis of modified Fresnel zone plate arrays via ion beam lithography

Keskinbora, K., Sanli, U. T., Baluktsian, M., Grévent, C., Weigand, M., Schütz, G.

{Beilstein Journal of Nanotechnology}, 9, pages: 2049-2056, Beilstein-Institut, Frankfurt am Main, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Deterministic creation and deletion of a single magnetic skyrmion observed by direct time-resolved X-ray microscopy

Woo, S., Song, K. M., Zhang, X., Ezawa, M., Zhou, Y., Liu, X., Weigand, M., Finizio, S., Raabe, J., Park, M.-C., Lee, K.-Y., Choi, J. W., Min, B.-C., Koo, H. C., Chang, J.

{Nature Electronics}, 1(5):288-296, Springer Nature, London, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Magnetic skyrmion as a nonlinear resistive element: A potential building block for reservoir computing

Prychynenko, D., Sitte, M., Litzius, K., Krüger, B., Bourianoff, G., Kläui, M., Sinova, J., Everschor-Sitte, K.

{Physical Review Applied}, 9(1), American Physical Society, College Park, Md. [u.a.], 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Tunable geometrical frustration in magnoic vortex crystals

Behncke, C., Adolff, C. F., Wintz, S., Hänze, M., Schulte, B., Weigand, M., Finizio, S., Raabe, J., Meier, G.

{Scientific Reports}, 8, Nature Publishing Group, London, UK, 2018 (article)

mms

DOI [BibTex]

DOI [BibTex]

2017


Interactive Perception: Leveraging Action in Perception and Perception in Action
Interactive Perception: Leveraging Action in Perception and Perception in Action

Bohg, J., Hausman, K., Sankaran, B., Brock, O., Kragic, D., Schaal, S., Sukhatme, G.

IEEE Transactions on Robotics, 33, pages: 1273-1291, December 2017 (article)

Abstract
Recent approaches in robotics follow the insight that perception is facilitated by interactivity with the environment. These approaches are subsumed under the term of Interactive Perception (IP). We argue that IP provides the following benefits: (i) any type of forceful interaction with the environment creates a new type of informative sensory signal that would otherwise not be present and (ii) any prior knowledge about the nature of the interaction supports the interpretation of the signal. This is facilitated by knowledge of the regularity in the combined space of sensory information and action parameters. The goal of this survey is to postulate this as a principle and collect evidence in support by analyzing and categorizing existing work in this area. We also provide an overview of the most important applications of Interactive Perception. We close this survey by discussing the remaining open questions. Thereby, we hope to define a field and inspire future work.

am

arXiv DOI Project Page [BibTex]

2017


arXiv DOI Project Page [BibTex]


Probabilistic Line Searches for Stochastic Optimization
Probabilistic Line Searches for Stochastic Optimization

Mahsereci, M., Hennig, P.

Journal of Machine Learning Research, 18(119):1-59, November 2017 (article)

pn

link (url) Project Page [BibTex]

link (url) Project Page [BibTex]


Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning
Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning

Li, W., Bohg, J., Fritz, M.

arXiv, November 2017 (article) Submitted

Abstract
Understanding physical phenomena is a key component of human intelligence and enables physical interaction with previously unseen environments. In this paper, we study how an artificial agent can autonomously acquire this intuition through interaction with the environment. We created a synthetic block stacking environment with physics simulation in which the agent can learn a policy end-to-end through trial and error. Thereby, we bypass to explicitly model physical knowledge within the policy. We are specifically interested in tasks that require the agent to reach a given goal state that may be different for every new trial. To this end, we propose a deep reinforcement learning framework that learns policies which are parametrized by a goal. We validated the model on a toy example navigating in a grid world with different target positions and in a block stacking task with different target structures of the final tower. In contrast to prior work, our policies show better generalization across different goals.

am

arXiv [BibTex]


no image
Event-based State Estimation: An Emulation-based Approach

Trimpe, S.

IET Control Theory & Applications, 11(11):1684-1693, July 2017 (article)

Abstract
An event-based state estimation approach for reducing communication in a networked control system is proposed. Multiple distributed sensor agents observe a dynamic process and sporadically transmit their measurements to estimator agents over a shared bus network. Local event-triggering protocols ensure that data is transmitted only when necessary to meet a desired estimation accuracy. The event-based design is shown to emulate the performance of a centralised state observer design up to guaranteed bounds, but with reduced communication. The stability results for state estimation are extended to the distributed control system that results when the local estimates are used for feedback control. Results from numerical simulations and hardware experiments illustrate the effectiveness of the proposed approach in reducing network communication.

am ics

arXiv Supplementary material PDF DOI Project Page [BibTex]

arXiv Supplementary material PDF DOI Project Page [BibTex]


Probabilistic Articulated Real-Time Tracking for Robot Manipulation
Probabilistic Articulated Real-Time Tracking for Robot Manipulation

(Best Paper of RA-L 2017, Finalist of Best Robotic Vision Paper Award of ICRA 2017)

Garcia Cifuentes, C., Issac, J., Wüthrich, M., Schaal, S., Bohg, J.

IEEE Robotics and Automation Letters (RA-L), 2(2):577-584, April 2017 (article)

Abstract
We propose a probabilistic filtering method which fuses joint measurements with depth images to yield a precise, real-time estimate of the end-effector pose in the camera frame. This avoids the need for frame transformations when using it in combination with visual object tracking methods. Precision is achieved by modeling and correcting biases in the joint measurements as well as inaccuracies in the robot model, such as poor extrinsic camera calibration. We make our method computationally efficient through a principled combination of Kalman filtering of the joint measurements and asynchronous depth-image updates based on the Coordinate Particle Filter. We quantitatively evaluate our approach on a dataset recorded from a real robotic platform, annotated with ground truth from a motion capture system. We show that our approach is robust and accurate even under challenging conditions such as fast motion, significant and long-term occlusions, and time-varying biases. We release the dataset along with open-source code of our approach to allow for quantitative comparison with alternative approaches.

am

arXiv video code and dataset video PDF DOI Project Page [BibTex]


no image
Anticipatory Action Selection for Human-Robot Table Tennis

Wang, Z., Boularias, A., Mülling, K., Schölkopf, B., Peters, J.

Artificial Intelligence, 247, pages: 399-414, 2017, Special Issue on AI and Robotics (article)

Abstract
Abstract Anticipation can enhance the capability of a robot in its interaction with humans, where the robot predicts the humans' intention for selecting its own action. We present a novel framework of anticipatory action selection for human-robot interaction, which is capable to handle nonlinear and stochastic human behaviors such as table tennis strokes and allows the robot to choose the optimal action based on prediction of the human partner's intention with uncertainty. The presented framework is generic and can be used in many human-robot interaction scenarios, for example, in navigation and human-robot co-manipulation. In this article, we conduct a case study on human-robot table tennis. Due to the limited amount of time for executing hitting movements, a robot usually needs to initiate its hitting movement before the opponent hits the ball, which requires the robot to be anticipatory based on visual observation of the opponent's movement. Previous work on Intention-Driven Dynamics Models (IDDM) allowed the robot to predict the intended target of the opponent. In this article, we address the problem of action selection and optimal timing for initiating a chosen action by formulating the anticipatory action selection as a Partially Observable Markov Decision Process (POMDP), where the transition and observation are modeled by the \{IDDM\} framework. We present two approaches to anticipatory action selection based on the \{POMDP\} formulation, i.e., a model-free policy learning method based on Least-Squares Policy Iteration (LSPI) that employs the \{IDDM\} for belief updates, and a model-based Monte-Carlo Planning (MCP) method, which benefits from the transition and observation model by the IDDM. Experimental results using real data in a simulated environment show the importance of anticipatory action selection, and that \{POMDPs\} are suitable to formulate the anticipatory action selection problem by taking into account the uncertainties in prediction. We also show that existing algorithms for POMDPs, such as \{LSPI\} and MCP, can be applied to substantially improve the robot's performance in its interaction with humans.

am ei

DOI Project Page [BibTex]

DOI Project Page [BibTex]


Early Stopping Without a Validation Set
Early Stopping Without a Validation Set

Mahsereci, M., Balles, L., Lassner, C., Hennig, P.

arXiv preprint arXiv:1703.09580, 2017 (article)

Abstract
Early stopping is a widely used technique to prevent poor generalization performance when training an over-expressive model by means of gradient-based optimization. To find a good point to halt the optimizer, a common practice is to split the dataset into a training and a smaller validation set to obtain an ongoing estimate of the generalization performance. In this paper we propose a novel early stopping criterion which is based on fast-to-compute, local statistics of the computed gradients and entirely removes the need for a held-out validation set. Our experiments show that this is a viable approach in the setting of least-squares and logistic regression as well as neural networks.

ps pn

link (url) Project Page Project Page [BibTex]


no image
Krylov Subspace Recycling for Fast Iterative Least-Squares in Machine Learning

Roos, F. D., Hennig, P.

arXiv preprint arXiv:1706.00241, 2017 (article)

Abstract
Solving symmetric positive definite linear problems is a fundamental computational task in machine learning. The exact solution, famously, is cubicly expensive in the size of the matrix. To alleviate this problem, several linear-time approximations, such as spectral and inducing-point methods, have been suggested and are now in wide use. These are low-rank approximations that choose the low-rank space a priori and do not refine it over time. While this allows linear cost in the data-set size, it also causes a finite, uncorrected approximation error. Authors from numerical linear algebra have explored ways to iteratively refine such low-rank approximations, at a cost of a small number of matrix-vector multiplications. This idea is particularly interesting in the many situations in machine learning where one has to solve a sequence of related symmetric positive definite linear problems. From the machine learning perspective, such deflation methods can be interpreted as transfer learning of a low-rank approximation across a time-series of numerical tasks. We study the use of such methods for our field. Our empirical results show that, on regression and classification problems of intermediate size, this approach can interpolate between low computational cost and numerical precision.

pn

link (url) Project Page [BibTex]


no image
Convergence Analysis of Deterministic Kernel-Based Quadrature Rules in Misspecified Settings

Kanagawa, M., Sriperumbudur, B. K., Fukumizu, K.

Arxiv e-prints, arXiv:1709.00147v1 [math.NA], 2017 (article)

Abstract
This paper presents convergence analysis of kernel-based quadrature rules in misspecified settings, focusing on deterministic quadrature in Sobolev spaces. In particular, we deal with misspecified settings where a test integrand is less smooth than a Sobolev RKHS based on which a quadrature rule is constructed. We provide convergence guarantees based on two different assumptions on a quadrature rule: one on quadrature weights, and the other on design points. More precisely, we show that convergence rates can be derived (i) if the sum of absolute weights remains constant (or does not increase quickly), or (ii) if the minimum distance between distance design points does not decrease very quickly. As a consequence of the latter result, we derive a rate of convergence for Bayesian quadrature in misspecified settings. We reveal a condition on design points to make Bayesian quadrature robust to misspecification, and show that, under this condition, it may adaptively achieve the optimal rate of convergence in the Sobolev space of a lesser order (i.e., of the unknown smoothness of a test integrand), under a slightly stronger regularity condition on the integrand.

pn

arXiv [BibTex]

arXiv [BibTex]


no image
New Directions for Learning with Kernels and Gaussian Processes (Dagstuhl Seminar 16481)

Gretton, A., Hennig, P., Rasmussen, C., Schölkopf, B.

Dagstuhl Reports, 6(11):142-167, 2017 (book)

ei pn

DOI [BibTex]

DOI [BibTex]


no image
Functionalised metal-organic frameworks: a novel approach to stabilising single metal atoms

Szilágyi, P. Á., Rogers, D. M., Zaiser, I., Callini, E., Turner, S., Borgschulte, A., Züttel, A., Geerlings, H., Hirscher, M., Dam, B.

{Journal of Materials Chemistry A}, 5(30):15559-15566, Royal Society of Chemistry, Cambridge, UK, 2017 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Exploiting diffusion barrier and chemical affinity of metal-organic frameworks for efficient hydrogen isotope separation

Kim, J. Y., Balderas-Xicohténcatl, R., Zhang, L., Kang, S. G., Hirscher, M., Oh, H., Moon, H. R.

{Journal of the American Chemical Society}, 139(42):15135-15141, American Chemical Society, Washington, DC, 2017 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Facile fabrication of mesoporous silica micro-jets with multi-functionalities

Vilela, D., Hortelao, A. C., Balderas-Xicohténcatl, R., Hirscher, M., Hahn, K., Ma, X., Sánchez, S.

{Nanoscale}, 9(37):13990-13997, Royal Society of Chemistry, Cambridge, UK, 2017 (article)

mms

DOI [BibTex]

DOI [BibTex]