Header logo is


2017


On the Design of {LQR} Kernels for Efficient Controller Learning
On the Design of LQR Kernels for Efficient Controller Learning

Marco, A., Hennig, P., Schaal, S., Trimpe, S.

Proceedings of the 56th IEEE Annual Conference on Decision and Control (CDC), pages: 5193-5200, IEEE, IEEE Conference on Decision and Control, December 2017 (conference)

Abstract
Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As is shown herein, GPs with a common kernel choice can, however, lead to poor learning outcomes on standard quadratic control problems. For a first-order system, we construct two kernels that specifically leverage the structure of the well-known Linear Quadratic Regulator (LQR), yet retain the flexibility of Bayesian nonparametric learning. Simulations of uncertain linear and nonlinear systems demonstrate that the LQR kernels yield superior learning performance.

am ics pn

arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI Project Page [BibTex]

2017


arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI Project Page [BibTex]


Coupling Adaptive Batch Sizes with Learning Rates
Coupling Adaptive Batch Sizes with Learning Rates

Balles, L., Romero, J., Hennig, P.

In Proceedings Conference on Uncertainty in Artificial Intelligence (UAI) 2017, pages: 410-419, (Editors: Gal Elidan and Kristian Kersting), Association for Uncertainty in Artificial Intelligence (AUAI), Conference on Uncertainty in Artificial Intelligence (UAI), August 2017 (inproceedings)

Abstract
Mini-batch stochastic gradient descent and variants thereof have become standard for large-scale empirical risk minimization like the training of neural networks. These methods are usually used with a constant batch size chosen by simple empirical inspection. The batch size significantly influences the behavior of the stochastic optimization algorithm, though, since it determines the variance of the gradient estimates. This variance also changes over the optimization process; when using a constant batch size, stability and convergence is thus often enforced by means of a (manually tuned) decreasing learning rate schedule. We propose a practical method for dynamic batch size adaptation. It estimates the variance of the stochastic gradients and adapts the batch size to decrease the variance proportionally to the value of the objective function, removing the need for the aforementioned learning rate decrease. In contrast to recent related work, our algorithm couples the batch size to the learning rate, directly reflecting the known relationship between the two. On three image classification benchmarks, our batch size adaptation yields faster optimization convergence, while simultaneously simplifying learning rate tuning. A TensorFlow implementation is available.

ps pn

Code link (url) Project Page [BibTex]

Code link (url) Project Page [BibTex]


no image
Dynamic Time-of-Flight

Schober, M., Adam, A., Yair, O., Mazor, S., Nowozin, S.

Proceedings IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017, pages: 170-179, IEEE, Piscataway, NJ, USA, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017 (conference)

ei pn

DOI [BibTex]

DOI [BibTex]


Virtual vs. {R}eal: Trading Off Simulations and Physical Experiments in Reinforcement Learning with {B}ayesian Optimization
Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

Marco, A., Berkenkamp, F., Hennig, P., Schoellig, A. P., Krause, A., Schaal, S., Trimpe, S.

In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pages: 1557-1563, IEEE, Piscataway, NJ, USA, IEEE International Conference on Robotics and Automation (ICRA), May 2017 (inproceedings)

am ics pn

PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI Project Page [BibTex]

PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI Project Page [BibTex]


Scalable Pneumatic and Tendon Driven Robotic Joint Inspired by Jumping Spiders
Scalable Pneumatic and Tendon Driven Robotic Joint Inspired by Jumping Spiders

Sproewitz, A., Göttler, C., Sinha, A., Caer, C., Öztekin, M. U., Petersen, K., Sitti, M.

In Proceedings 2017 IEEE International Conference on Robotics and Automation (ICRA), pages: 64-70, IEEE, Piscataway, NJ, USA, IEEE International Conference on Robotics and Automation (ICRA), May 2017 (inproceedings)

dlg

Video link (url) DOI Project Page [BibTex]

Video link (url) DOI Project Page [BibTex]


Fast Bayesian Optimization of Machine Learning Hyperparameters on Large Datasets
Fast Bayesian Optimization of Machine Learning Hyperparameters on Large Datasets

Klein, A., Falkner, S., Bartels, S., Hennig, P., Hutter, F.

Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS 2017), 54, pages: 528-536, Proceedings of Machine Learning Research, (Editors: Sign, Aarti and Zhu, Jerry), PMLR, April 2017 (conference)

pn

pdf link (url) Project Page [BibTex]

pdf link (url) Project Page [BibTex]


Linking {Mechanics} and {Learning}
Linking Mechanics and Learning

Heim, S., Grimminger, F., Drama, Ö., Spröwitz, A.

In Proceedings of Dynamic Walking 2017, 2017 (inproceedings)

dlg

[BibTex]

[BibTex]


Is Growing Good for Learning?
Is Growing Good for Learning?

Heim, S., Spröwitz, A.

Proceedings of the 8th International Symposium on Adaptive Motion of Animals and Machines AMAM2017, 2017 (conference)

dlg

[BibTex]

[BibTex]


Evaluation of the passive dynamics of compliant legs with inertia
Evaluation of the passive dynamics of compliant legs with inertia

Györfi, B.

University of Applied Science Pforzheim, Germany, 2017 (mastersthesis)

dlg

[BibTex]

[BibTex]


no image
Understanding FORC using synthetic micro-structured systems with variable coupling- and coercivefield distributions

Groß, Felix

Universität Stuttgart, Stuttgart, 2017 (mastersthesis)

mms

[BibTex]


no image
Adsorption von Wasserstoffmolekülen in nanoporösen Gerüststrukturen

Kotzur, Nadine

Universität Stuttgart, Stuttgart, 2017 (mastersthesis)

mms

[BibTex]

[BibTex]

2012


Quasi-Newton Methods: A New Direction
Quasi-Newton Methods: A New Direction

Hennig, P., Kiefel, M.

In Proceedings of the 29th International Conference on Machine Learning, pages: 25-32, ICML ’12, (Editors: John Langford and Joelle Pineau), Omnipress, New York, NY, USA, ICML, July 2012 (inproceedings)

Abstract
Four decades after their invention, quasi- Newton methods are still state of the art in unconstrained numerical optimization. Although not usually interpreted thus, these are learning algorithms that fit a local quadratic approximation to the objective function. We show that many, including the most popular, quasi-Newton methods can be interpreted as approximations of Bayesian linear regression under varying prior assumptions. This new notion elucidates some shortcomings of classical algorithms, and lights the way to a novel nonparametric quasi-Newton method, which is able to make more efficient use of available information at computational cost similar to its predecessors.

ei ps pn

website+code pdf link (url) [BibTex]

2012


website+code pdf link (url) [BibTex]


no image
Learning Tracking Control with Forward Models

Bócsi, B., Hennig, P., Csató, L., Peters, J.

In pages: 259 -264, IEEE International Conference on Robotics and Automation (ICRA), May 2012 (inproceedings)

Abstract
Performing task-space tracking control on redundant robot manipulators is a difficult problem. When the physical model of the robot is too complex or not available, standard methods fail and machine learning algorithms can have advantages. We propose an adaptive learning algorithm for tracking control of underactuated or non-rigid robots where the physical model of the robot is unavailable. The control method is based on the fact that forward models are relatively straightforward to learn and local inversions can be obtained via local optimization. We use sparse online Gaussian process inference to obtain a flexible probabilistic forward model and second order optimization to find the inverse mapping. Physical experiments indicate that this approach can outperform state-of-the-art tracking control algorithms in this context.

ei pn

PDF Web DOI [BibTex]

PDF Web DOI [BibTex]


no image
Approximate Gaussian Integration using Expectation Propagation

Cunningham, J., Hennig, P., Lacoste-Julien, S.

In pages: 1-11, -, January 2012 (inproceedings) Submitted

Abstract
While Gaussian probability densities are omnipresent in applied mathematics, Gaussian cumulative probabilities are hard to calculate in any but the univariate case. We offer here an empirical study of the utility of Expectation Propagation (EP) as an approximate integration method for this problem. For rectangular integration regions, the approximation is highly accurate. We also extend the derivations to the more general case of polyhedral integration regions. However, we find that in this polyhedral case, EP's answer, though often accurate, can be almost arbitrarily wrong. These unexpected results elucidate an interesting and non-obvious feature of EP not yet studied in detail, both for the problem of Gaussian probabilities and for EP more generally.

ei pn

Web [BibTex]

Web [BibTex]


no image
Kernel Topic Models

Hennig, P., Stern, D., Herbrich, R., Graepel, T.

In Fifteenth International Conference on Artificial Intelligence and Statistics, 22, pages: 511-519, JMLR Proceedings, (Editors: Lawrence, N. D. and Girolami, M.), JMLR.org, AISTATS , 2012 (inproceedings)

Abstract
Latent Dirichlet Allocation models discrete data as a mixture of discrete distributions, using Dirichlet beliefs over the mixture weights. We study a variation of this concept, in which the documents' mixture weight beliefs are replaced with squashed Gaussian distributions. This allows documents to be associated with elements of a Hilbert space, admitting kernel topic models (KTM), modelling temporal, spatial, hierarchical, social and other structure between documents. The main challenge is efficient approximate inference on the latent Gaussian. We present an approximate algorithm cast around a Laplace approximation in a transformed basis. The KTM can also be interpreted as a type of Gaussian process latent variable model, or as a topic model conditional on document features, uncovering links between earlier work in these areas.

ei pn

PDF Web [BibTex]

PDF Web [BibTex]


Development of a Minimalistic Pneumatic Quadruped Robot for Fast Locomotion
Development of a Minimalistic Pneumatic Quadruped Robot for Fast Locomotion

Narioka, K., Rosendo, A., Spröwitz, A., Hosoda, K.

In Proceedings of the 2012 IEEE International Conference on Robotics and Biomimetics (ROBIO), 2012, pages: 307-311, IEEE, Guangzhou, 2012 (inproceedings)

Abstract
In this paper, we describe the development of the quadruped robot ”Ken” with the minimalistic and lightweight body design for achieving fast locomotion. We use McKibben pneumatic artificial muscles as actuators, providing high frequency and wide stride motion of limbs, also avoiding problems with overheating. We conducted a preliminary experiment, finding out that the robot can swing its limb over 7.5 Hz without amplitude reduction, nor heat problems. Moreover, the robot realized a several steps of bouncing gait by using simple CPG-based open loop controller, indicating that the robot can generate enough torque to kick the ground and limb contraction to avoid stumbling.

dlg

DOI [BibTex]

DOI [BibTex]


Locomotion through Reconfiguration based on Motor Primitives for Roombots Self-Reconfigurable Modular Robots
Locomotion through Reconfiguration based on Motor Primitives for Roombots Self-Reconfigurable Modular Robots

Bonardi, S., Moeckel, R., Spröwitz, A., Vespignani, M., Ijspeert, A. J.

In Robotics; Proceedings of ROBOTIK 2012; 7th German Conference on, pages: 1-6, 2012 (inproceedings)

Abstract
We present the hardware and reconfiguration experiments for an autonomous self-reconfigurable modular robot called Roombots (RB). RB were designed to form the basis for self-reconfigurable furniture. Each RB module contains three degrees of freedom that have been carefully selected to allow a single module to reach any position on a 2-dimensional grid and to overcome concave corners in a 3-dimensional grid. For the first time we demonstrate locomotion capabilities of single RB modules through reconfiguration with real hardware. The locomotion through reconfiguration is controlled by a planner combining the well-known D* algorithm and composed motor primitives. The novelty of our approach is the use of an online running hierarchical planner closely linked to the real hardware.

dlg

link (url) [BibTex]

link (url) [BibTex]


no image
Wasserstoffspeicherkapazität poröser Materialien in Kryoadsorptionstanks

Schlichtenmayer, M.

Universität Stuttgart, Stuttgart, 2012 (phdthesis)

mms

link (url) [BibTex]

link (url) [BibTex]


no image
Behandlung stark nichtkollinearer Magnetisierungsstrukturen mit der Spin-Cluster-Entwicklung

Dietermann, F.

Universität Stuttgart, Stuttgart, 2012 (phdthesis)

mms

link (url) [BibTex]

link (url) [BibTex]


no image
Spin wave mediated magnetic vortex core reversal

Stoll, H.

In 8461, San Diego, California, USA, 2012 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]


no image
Spinwelleninduziertes Schalten magnetischer Vortexkerne

Kammerer, M.

Universität Stuttgart, Stuttgart, 2012 (phdthesis)

mms

link (url) [BibTex]

link (url) [BibTex]


no image
Die Stabilität des stromtragenden Zustands in MgB2 Schichten mit modifizierter Mikrostruktur

Treiber, S.

Universität Stuttgart, Stuttgart, 2012 (phdthesis)

mms

[BibTex]

[BibTex]


no image
Hartmagnetische L10-FePt basierte gro\ssflächige Nanomuster mittels Nanoimprint-Lithografie

Bublat, T.

Universität Stuttgart, Stuttgart, 2012 (phdthesis)

mms

[BibTex]

[BibTex]

2005


Adaptation of Central Pattern Generators to Preexisting Mechanical Structure
Adaptation of Central Pattern Generators to Preexisting Mechanical Structure

Spröwitz, A.

Technische Universität Ilmenau, Ilmenau, 2005 (mastersthesis)

dlg

[BibTex]

2005


[BibTex]


no image
Untersuchung über die Grenzflächenmagnetisierung an (FM) Kobalt- und (AFM) Eisen-Mangan-Schichten

Harlander, M.

Stuttgart, Universität Stuttgart, 2005 (mastersthesis)

mms

[BibTex]

[BibTex]


no image
Magnetization reversal behavior of nanogranular CoCrPt alloy thin films studied with magnetic transmission X-ray microscopy

Fischer, P., Im, M., Eimüller, T., Schütz, G., Shin, S.

In 286, pages: 311-314, Boulder, CO, USA, 2005 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Defects distribution of Pr2Fe14B hard magnetic magnet from amorphous to nanostructures characterized by positron annihilation spectroscopy

Wu, Y. C., Sprengel, W., Reimann, K., Reichle, K. J., Goll, D., Würschum, R., Schaefer, H. E.

In PRICM 5. Proceedings of the Fifth Pacific RIM International Conference on Advanced Materials and Processing, 475-479, pages: 2123-2126, Materials Science Forum, Trans Tech, Beijing, China, 2005 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Implementing sub-ns time resolution into magnetic X-ray microscopies

Puzic, A., Stoll, H., Fischer, P., Van Waeyenberge, B., Raabe, J., Denbeaux, G., Haug, T., Weiss, D., Schütz, G.

In T115, pages: 1029-1031, Malmö/Lund, Sweden, 2005 (inproceedings)

mms

[BibTex]

[BibTex]