Header logo is de


2013


Thumb xl humans3tracking
Markerless Motion Capture of Multiple Characters Using Multi-view Image Segmentation

Liu, Y., Gall, J., Stoll, C., Dai, Q., Seidel, H., Theobalt, C.

Transactions on Pattern Analysis and Machine Intelligence, 35(11):2720-2735, 2013 (article)

Abstract
Capturing the skeleton motion and detailed time-varying surface geometry of multiple, closely interacting peoples is a very challenging task, even in a multicamera setup, due to frequent occlusions and ambiguities in feature-to-person assignments. To address this task, we propose a framework that exploits multiview image segmentation. To this end, a probabilistic shape and appearance model is employed to segment the input images and to assign each pixel uniquely to one person. Given the articulated template models of each person and the labeled pixels, a combined optimization scheme, which splits the skeleton pose optimization problem into a local one and a lower dimensional global one, is applied one by one to each individual, followed with surface estimation to capture detailed nonrigid deformations. We show on various sequences that our approach can capture the 3D motion of humans accurately even if they move rapidly, if they wear wide apparel, and if they are engaged in challenging multiperson motions, including dancing, wrestling, and hugging.

ps

data and video pdf DOI Project Page [BibTex]

2013


data and video pdf DOI Project Page [BibTex]


Thumb xl perception
Viewpoint and pose in body-form adaptation

Sekunova, A., Black, M., Parkinson, L., Barton, J. J. S.

Perception, 42(2):176-186, 2013 (article)

Abstract
Faces and bodies are complex structures, perception of which can play important roles in person identification and inference of emotional state. Face representations have been explored using behavioural adaptation: in particular, studies have shown that face aftereffects show relatively broad tuning for viewpoint, consistent with origin in a high-level structural descriptor far removed from the retinal image. Our goals were to determine first, if body aftereffects also showed a degree of viewpoint invariance, and second if they also showed pose invariance, given that changes in pose create even more dramatic changes in the 2-D retinal image. We used a 3-D model of the human body to generate headless body images, whose parameters could be varied to generate different body forms, viewpoints, and poses. In the first experiment, subjects adapted to varying viewpoints of either slim or heavy bodies in a neutral stance, followed by test stimuli that were all front-facing. In the second experiment, we used the same front-facing bodies in neutral stance as test stimuli, but compared adaptation from bodies in the same neutral stance to adaptation with the same bodies in different poses. We found that body aftereffects were obtained over substantial viewpoint changes, with no significant decline in aftereffect magnitude with increasing viewpoint difference between adapting and test images. Aftereffects also showed transfer across one change in pose but not across another. We conclude that body representations may have more viewpoint invariance than faces, and demonstrate at least some transfer across pose, consistent with a high-level structural description. Keywords: aftereffect, shape, face, representation

ps

pdf from publisher abstract pdf link (url) Project Page [BibTex]

pdf from publisher abstract pdf link (url) Project Page [BibTex]


no image
Magnetically Actuated Soft Capsule With the Multimodal Drug Release Function

Yim, S., Goyal, K., Sitti, M.

IEEE/ASME Trans. on Mechatronics, 18(4):1413-1418, IEEE, 2013 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
A simulation and design tool for a passive rotation flapping wing mechanism

Arabagi, V., Hines, L., Sitti, M.

IEEE/ASME Transactions on Mechatronics, 18(2):787-798, 2013 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
GECKO-INSPIRED POLYMER ADHESIVES

Menguc, Yigit, Metin, Metin

Polymer Adhesion, Friction, and Lubrication, pages: 351, Wiley, 2013 (article)

pi

[BibTex]

[BibTex]


no image
Near and far-wall effects on the three-dimensional motion of bacteria-driven microbeads

Edwards, M. R., Wright Carlsen, R., Sitti, M.

Applied Physics Letters, 102(14):143701, AIP, 2013 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
SoftCubes: towards a soft modular matter

Yim, S., Sitti, M.

In Robotics and Automation (ICRA), 2013 IEEE International Conference on, pages: 530-536, 2013 (inproceedings)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Self-organized state formation in magnonic vortex crystals

Adolff, C. F., Hänze, M., Vogel, A., Weigand, M., Martens, M., Meier, G.

{Physical Review B}, 88(22), American Physical Society, Woodbury, NY, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Erratum: Generalized Gilbert equation including inertial damping: Derivation from an extended breathing Fermi surface model [Phys. Rev. B 84, 172403 (2011)]

Fähnle, M., Steiauf, D., Illg, C.

{Physical Review B}, 88, American Physical Society, Woodbury, NY, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Strain and composition dependence of orbital polarization in nickel oxide superlattices

Wu, M., Benckiser, E., Haverkort, M. W., Franco, A., Lu, J., Nwankwo, U., Brück, S., Audehm, P., Goering, E., Macke, S., Hinkov, V., Wochner, P., Christiani, G., Heinze, S., Logvenov, G., Habermeier, H., Keimer, B.

{Physical Review B}, 88, Published by the American Physical Society through the American Institute of Physics, Woodbury, NY, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Efficient focusing of 8 keV X-rays with multilayer Fresnel zone plates fabricated by atomic layer deposition and focused ion beam milling

Mayer, M., Keskinbora, K., Grévent, C., Szeghalmi, A., Knez, M., Weigand, M., Snigirev, A., Snigereva, I., Schütz, G.

{Journal of Synchrotron Radiation}, 20, pages: 433-440, Published for the International Union of Crystallography by Munksgaard, Copenhagen, Denmark, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Rapid prototyping of Fresnel zone plates via direct Ga+ ion beam lithography for high-resolution x-ray imaging

Keskinbora, K., Grévent, C., Eigenthaler, U., Weigand, M., Schütz, G.

{ACS Nano}, 7(11):9788-9797, American Chemical Society, Washington, DC, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Eine kryoflexible kovalente organische Gerüststruktur für die effiziente Trennung von Wasserstoffisotopien durch Quantensieben

Oh, H., Kalidindi, S. B., Um, Y., Bureekaew, S., Schmid, R., Fischer, R. A., Hirscher, M.

{Angewandte Chemie}, 125(50):13461-13464, Wiley-VCH Verl., Weinheim, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Ultrafast demagnetization after laser irradiation in transition metals: Ab initio calculations of the spin-flip electron-phonon scattering with reduced exchange splitting

Illg, C., Haag, M., Fähnle, M.

{Physical Review B}, 88, American Physical Society, Woodbury, NY, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Phase diagram for magnetic vortex core switching studied by ferromagnetic absorption spectroscopy and time-resolved transmission x-ray microscopy

Martens, M., Kamionka, T., Weigand, M., Stoll, H., Tyliszczak, T., Meier, G.

{Physical Review B}, 87, Published by the American Physical Society through the American Institute of Physics, Woodbury, NY, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Abstraction in Decision-Makers with Limited Information Processing Capabilities

Genewein, T, Braun, DA

pages: 1-9, NIPS Workshop Planning with Information Constraints for Control, Reinforcement Learning, Computational Neuroscience, Robotics and Games, December 2013 (conference)

Abstract
A distinctive property of human and animal intelligence is the ability to form abstractions by neglecting irrelevant information which allows to separate structure from noise. From an information theoretic point of view abstractions are desirable because they allow for very efficient information processing. In artificial systems abstractions are often implemented through computationally costly formations of groups or clusters. In this work we establish the relation between the free-energy framework for decision-making and rate-distortion theory and demonstrate how the application of rate-distortion for decision-making leads to the emergence of abstractions. We argue that abstractions are induced due to a limit in information processing capacity.

ei

link (url) [BibTex]

link (url) [BibTex]


Thumb xl houghforest
Class-Specific Hough Forests for Object Detection

Gall, J., Lempitsky, V.

In Decision Forests for Computer Vision and Medical Image Analysis, pages: 143-157, 11, (Editors: Criminisi, A. and Shotton, J.), Springer, 2013 (incollection)

ps

code Project Page [BibTex]

code Project Page [BibTex]


no image
Tank-like module-based climbing robot using passive compliant joints

Seo, T., Sitti, M.

IEEE/ASME Transactions on Mechatronics, 18(1):397-408, 2013 (article)

pi

[BibTex]

[BibTex]


no image
Flapping wings via direct-driving by DC motors

Azhar, M., Campolo, D., Lau, G., Hines, L., Sitti, M.

In Robotics and Automation (ICRA), 2013 IEEE International Conference on, pages: 1397-1402, 2013 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Three dimensional independent control of multiple magnetic microrobots

Diller, E., Giltinan, J., Jena, P., Sitti, M.

In Robotics and Automation (ICRA), 2013 IEEE International Conference on, pages: 2576-2581, 2013 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Enhanced fabrication and characterization of gecko-inspired mushroom-tipped microfiber adhesives

Song, J., Mengüç, Y., Sitti, M.

Journal of Adhesion Science and Technology, 27(17):1921-1932, Routledge, 2013 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Linear combination of one-step predictive information with an external reward in an episodic policy gradient setting: a critical analysis

Zahedi, K., Martius, G., Ay, N.

Frontiers in Psychology, 4(801), 2013 (article)

Abstract
One of the main challenges in the field of embodied artificial intelligence is the open-ended autonomous learning of complex behaviours. Our approach is to use task-independent, information-driven intrinsic motivation(s) to support task-dependent learning. The work presented here is a preliminary step in which we investigate the predictive information (the mutual information of the past and future of the sensor stream) as an intrinsic drive, ideally supporting any kind of task acquisition. Previous experiments have shown that the predictive information (PI) is a good candidate to support autonomous, open-ended learning of complex behaviours, because a maximisation of the PI corresponds to an exploration of morphology- and environment-dependent behavioural regularities. The idea is that these regularities can then be exploited in order to solve any given task. Three different experiments are presented and their results lead to the conclusion that the linear combination of the one-step PI with an external reward function is not generally recommended in an episodic policy gradient setting. Only for hard tasks a great speed-up can be achieved at the cost of an asymptotic performance lost.

al

link (url) DOI [BibTex]


no image
Switching modes in easy and hard axis magnetic reversal in a self-assembled antidot array

Haering, F., Wiedwald, U., Nothelfer, S., Koslowski, B., Ziemann, P., Lechner, L., Wallucks, A., Lebecki, K., Nowak, U., Gräfe, J., Goering, E., Schütz, G.

{Nanotechnology}, 24, IOP Pub., Bristol, UK, 2013 (article)

mms

DOI Project Page [BibTex]

DOI Project Page [BibTex]


no image
Time-resolved imaging of nonlinear magnetic domain-wall dynamics in ferromagnetic nanowires

Stein, F.-U., Bocklage, L., Weigand, M., Meier, G.

{Scientific Reports}, 3, Nature Publishing Group, London, UK, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
A cryogenically flexible covalent organic framework for efficient hydrogen isotrope separation by quantum sieving

Oh, H., Kalidindi, S. B., Um, Y., Bureekaew, S., Schmid, R., Fischer, R. A., Hirscher, M.

{Angewandte Chemie International Edition in English}, 52(50):13219-13222, Wiley-VCH Verlag GmbH & Co. KGaA, D-69451 Weinheim, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Unexpected room-temperature ferromagnetism in bulk ZnO

Chen, Y., Goering, E., Jeurgens, L., Wang, Z., Phillipp, F., Baier, J., Tietze, T., Schütz, G.

{Applied Physics Letters}, (103), American Institute of Physics, Melville, NY, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Large-area hard magnetic L10-FePt and composite L10-FePt based nanopatterns

Goll, D., Bublat, T.

{Physica Status Solidi A-Applications and Materials Science}, 210(7):1261-1271, Wiley-VCH, Weinheim, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Wave modes of collective vortex gyration in dipolar-coupled-dot-array magnonic crystals

Han, D., Vogel, A., Jung, H., Lee, K., Weigand, M., Stoll, H., Schütz, G., Fischer, P., Meier, G., Kim, S.

{Scientific Reports}, 3, Nature Publishing Group, London, UK, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Controlled Reduction with Unactuated Cyclic Variables: Application to 3D Bipedal Walking with Passive Yaw Rotation

Gregg, R., Righetti, L.

IEEE Transactions on Automatic Control, 58(10):2679-2685, October 2013 (article)

Abstract
This technical note shows that viscous damping can shape momentum conservation laws in a manner that stabilizes yaw rotation and enables steering for underactuated 3D walking. We first show that unactuated cyclic variables can be controlled by passively shaped conservation laws given a stabilizing controller in the actuated coordinates. We then exploit this result to realize controlled geometric reduction with multiple unactuated cyclic variables. We apply this underactuated control strategy to a five-link 3D biped to produce exponentially stable straight-ahead walking and steering in the presence of passive yawing.

mg

link (url) DOI [BibTex]

link (url) DOI [BibTex]


no image
Learning Task Error Models for Manipulation

Pastor, P., Kalakrishnan, M., Binney, J., Kelly, J., Righetti, L., Sukhatme, G. S., Schaal, S.

In 2013 IEEE Conference on Robotics and Automation, IEEE, Karlsruhe, Germany, 2013 (inproceedings)

Abstract
Precise kinematic forward models are important for robots to successfully perform dexterous grasping and manipulation tasks, especially when visual servoing is rendered infeasible due to occlusions. A lot of research has been conducted to estimate geometric and non-geometric parameters of kinematic chains to minimize reconstruction errors. However, kinematic chains can include non-linearities, e.g. due to cable stretch and motor-side encoders, that result in significantly different errors for different parts of the state space. Previous work either does not consider such non-linearities or proposes to estimate non-geometric parameters of carefully engineered models that are robot specific. We propose a data-driven approach that learns task error models that account for such unmodeled non-linearities. We argue that in the context of grasping and manipulation, it is sufficient to achieve high accuracy in the task relevant state space. We identify this relevant state space using previously executed joint configurations and learn error corrections for those. Therefore, our system is developed to generate subsequent executions that are similar to previous ones. The experiments show that our method successfully captures the non-linearities in the head kinematic chain (due to a counterbalancing spring) and the arm kinematic chains (due to cable stretch) of the considered experimental platform, see Fig. 1. The feasibility of the presented error learning approach has also been evaluated in independent DARPA ARM-S testing contributing to successfully complete 67 out of 72 grasping and manipulation tasks.

am mg

link (url) DOI [BibTex]

link (url) DOI [BibTex]


no image
Bounded Rational Decision-Making in Changing Environments

Grau-Moya, J, Braun, DA

pages: 1-9, NIPS Workshop Planning with Information Constraints for Control, Reinforcement Learning, Computational Neuroscience, Robotics and Games, December 2013 (conference)

Abstract
A perfectly rational decision-maker chooses the best action with the highest utility gain from a set of possible actions. The optimality principles that describe such decision processes do not take into account the computational costs of finding the optimal action. Bounded rational decision-making addresses this problem by specifically trading off information-processing costs and expected utility. Interestingly, a similar trade-off between energy and entropy arises when describing changes in thermodynamic systems. This similarity has been recently used to describe bounded rational agents. Crucially, this framework assumes that the environment does not change while the decision-maker is computing the optimal policy. When this requirement is not fulfilled, the decision-maker will suffer inefficiencies in utility, that arise because the current policy is optimal for an environment in the past. Here we borrow concepts from non-equilibrium thermodynamics to quantify these inefficiencies and illustrate with simulations its relationship with computational resources.

ei

link (url) [BibTex]

link (url) [BibTex]


Thumb xl 2013 ivc rkek teaser
Non-parametric hand pose estimation with object context

Romero, J., Kjellström, H., Ek, C. H., Kragic, D.

Image and Vision Computing , 31(8):555 - 564, 2013 (article)

Abstract
In the spirit of recent work on contextual recognition and estimation, we present a method for estimating the pose of human hands, employing information about the shape of the object in the hand. Despite the fact that most applications of human hand tracking involve grasping and manipulation of objects, the majority of methods in the literature assume a free hand, isolated from the surrounding environment. Occlusion of the hand from grasped objects does in fact often pose a severe challenge to the estimation of hand pose. In the presented method, object occlusion is not only compensated for, it contributes to the pose estimation in a contextual fashion; this without an explicit model of object shape. Our hand tracking method is non-parametric, performing a nearest neighbor search in a large database (.. entries) of hand poses with and without grasped objects. The system that operates in real time, is robust to self occlusions, object occlusions and segmentation errors, and provides full hand pose reconstruction from monocular video. Temporal consistency in hand pose is taken into account, without explicitly tracking the hand in the high-dim pose space. Experiments show the non-parametric method to outperform other state of the art regression methods, while operating at a significantly lower computational cost than comparable model-based hand tracking methods.

ps

Publisher site pdf link (url) [BibTex]

Publisher site pdf link (url) [BibTex]


no image
A Perching Mechanism for Flying Robots Using a Fibre-Based Adhesive

Daler, L., Klaptocz, A., Briod, A., Sitti, M., Floreano, D.

In Robotics and Automation (ICRA), 2013 IEEE International Conference on, 2013 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Micro-scale mobile robotics

Diller, E., Sitti, M.

Foundations and Trends in Robotics, 2(3):143-259, Now Publishers Incorporated, 2013 (article)

pi

[BibTex]

[BibTex]


no image
Survey and Introduction to the Focused Section on Bio-Inspired Mechatronics

Sitti, M., Menciassi, A., Ijspeert, A., Low, K. H., Kim, S.

Mechatronics, IEEE/ASME Transactions on, 18(2):409-418, DOI: 10.1109/TMECH.2012. 2233492, 2013 (article)

pi

[BibTex]

[BibTex]


no image
Bonding methods for modular micro-robotic assemblies

Diller, E., Zhang, N., Sitti, M.

In Robotics and Automation (ICRA), 2013 IEEE International Conference on, pages: 2588-2593, 2013 (inproceedings)

pi

[BibTex]

[BibTex]


no image
Robustness of guided self-organization against sensorimotor disruptions

Martius, G.

Advances in Complex Systems, 16(02n03):1350001, 2013 (article)

Abstract
Self-organizing processes are crucial for the development of living beings. Practical applications in robots may benefit from the self-organization of behavior, e.g.~to increase fault tolerance and enhance flexibility, provided that external goals can also be achieved. We present results on the guidance of self-organizing control by visual target stimuli and show a remarkable robustness to sensorimotor disruptions. In a proof of concept study an autonomous wheeled robot is learning an object finding and ball-pushing task from scratch within a few minutes in continuous domains. The robustness is demonstrated by the rapid recovery of the performance after severe changes of the sensor configuration.

al

DOI [BibTex]

DOI [BibTex]


no image
Ferromagnetism of zinc oxide nanograined films

Straumal, B. B., Protasova, S. G., Mazilkin, A. A., Schütz, G., Goering, E., Baretzky, B., Straumal, P. B.

{Journal of Experimental and Theoretical Physics Letters}, 97(6):367-377, Pleiades Publishing, Inc., 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Hydrogen adsorption properties of platinum decorated hierarchically structured templated carbons

Oh, H., Gennett, T., Atanassov, P., Kurttepeli, M., Bals, S., Hurst, K. E., Hirscher, M.

{Microporous and Mesoporous Materials}, pages: 66-74, Elsevier, Amsterdam, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Extended s-d models for the dynamics of noncollinear magnetization: Short review of two different approaches

Fähnle, M., Zhang, S.

{Journal of Magnetism and Magnetic Materials}, 326, pages: 232-234, NH, Elsevier, Amsterdam, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Correlation between spin structure oscillations and domain wall velocities

Bisig, A., Stärk, M., Mawass, M., Moutafis, C., Rhensius, J., Heidler, J., Büttner, F., Noske, M., Weigand, M., Eisebitt, S., Tyliszczak, T., Van Wayenberge, B., Stoll, H., Schütz, G., Kläui, M.

{Nature Communications}, 4, Nature Publishing Group, London, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Recent advances in use of atomic layer deposition and focused ion beams for fabrication of Fresnel zone plates for hard x-rays

Keskinbora, K., Robisch, A., Mayer, M., Grévent, C., Szeghalmi, A. V., Knez, M., Weigand, M., Snigireva, I., Snigirev, A., Salditt, T., Schütz, G.

{Proceedings of SPIE (The International Society for Optical Engineering)}, 8851, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Magnetic states in low-pinning high-anisotropy material nanostructures suitable for dynamic imaging

Büttner, F., Moutafis, C., Bisig, A., Wohlhüter, P., Günther, C. M., Mohanty, J., Geilhufe, J., Schneider, M., v. Korff Schmising, C., Schaffert, S., Pfau, B., Hantschmann, M., Riemeier, M., Emmel, M., Finizio, S., Jakob, G., Weigand, M., Rhensius, J., Franken, J. H., Lavrijsen, R., Swagten, H. J. M., Stoll, H., Eisebitt, S., Kläui, M.

{Physical Review B}, 87, Published by the American Physical Society through the American Institute of Physics, Woodbury, NY, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Experimental and theoretical study of D2/H2 quantum sieving in a carbon molecular sieve

Gotzias, A., Charalambopoulou, G., Ampoumogli, A., Krkljus, I., Hirscher, M., Steriotis, T.

{Adsorption}, 19(2-4):373-379, Springer Science+Business Media, New York, 2013 (article)

mms

DOI [BibTex]

DOI [BibTex]

1997


no image
Locally weighted learning

Atkeson, C. G., Moore, A. W., Schaal, S.

Artificial Intelligence Review, 11(1-5):11-73, 1997, clmc (article)

Abstract
This paper surveys locally weighted learning, a form of lazy learning and memory-based learning, and focuses on locally weighted linear regression. The survey discusses distance functions, smoothing parameters, weighting functions, local model structures, regularization of the estimates and bias, assessing predictions, handling noisy data and outliers, improving the quality of predictions by tuning fit parameters, interference between old and new data, implementing locally weighted learning efficiently, and applications of locally weighted learning. A companion paper surveys how locally weighted learning can be used in robot learning and control. Keywords: locally weighted regression, LOESS, LWR, lazy learning, memory-based learning, least commitment learning, distance functions, smoothing parameters, weighting functions, global tuning, local tuning, interference.

am

link (url) [BibTex]

1997


link (url) [BibTex]


no image
Locally weighted learning for control

Atkeson, C. G., Moore, A. W., Schaal, S.

Artificial Intelligence Review, 11(1-5):75-113, 1997, clmc (article)

Abstract
Lazy learning methods provide useful representations and training algorithms for learning about complex phenomena during autonomous adaptive control of complex systems. This paper surveys ways in which locally weighted learning, a type of lazy learning, has been applied by us to control tasks. We explain various forms that control tasks can take, and how this affects the choice of learning paradigm. The discussion section explores the interesting impact that explicitly remembering all previous experiences has on the problem of learning to control. Keywords: locally weighted regression, LOESS, LWR, lazy learning, memory-based learning, least commitment learning, forward models, inverse models, linear quadratic regulation (LQR), shifting setpoint algorithm, dynamic programming.

am

link (url) [BibTex]

link (url) [BibTex]


no image
Learning from demonstration

Schaal, S.

In Advances in Neural Information Processing Systems 9, pages: 1040-1046, (Editors: Mozer, M. C.;Jordan, M.;Petsche, T.), MIT Press, Cambridge, MA, 1997, clmc (inproceedings)

Abstract
By now it is widely accepted that learning a task from scratch, i.e., without any prior knowledge, is a daunting undertaking. Humans, however, rarely attempt to learn from scratch. They extract initial biases as well as strategies how to approach a learning problem from instructions and/or demonstrations of other humans. For learning control, this paper investigates how learning from demonstration can be applied in the context of reinforcement learning. We consider priming the Q-function, the value function, the policy, and the model of the task dynamics as possible areas where demonstrations can speed up learning. In general nonlinear learning problems, only model-based reinforcement learning shows significant speed-up after a demonstration, while in the special case of linear quadratic regulator (LQR) problems, all methods profit from the demonstration. In an implementation of pole balancing on a complex anthropomorphic robot arm, we demonstrate that, when facing the complexities of real signal processing, model-based reinforcement learning offers the most robustness for LQR problems. Using the suggested methods, the robot learns pole balancing in just a single trial after a 30 second long demonstration of the human instructor. 

am

link (url) [BibTex]

link (url) [BibTex]


no image
Robot learning from demonstration

Atkeson, C. G., Schaal, S.

In Machine Learning: Proceedings of the Fourteenth International Conference (ICML ’97), pages: 12-20, (Editors: Fisher Jr., D. H.), Morgan Kaufmann, Nashville, TN, July 8-12, 1997, 1997, clmc (inproceedings)

Abstract
The goal of robot learning from demonstration is to have a robot learn from watching a demonstration of the task to be performed. In our approach to learning from demonstration the robot learns a reward function from the demonstration and a task model from repeated attempts to perform the task. A policy is computed based on the learned reward function and task model. Lessons learned from an implementation on an anthropomorphic robot arm using a pendulum swing up task include 1) simply mimicking demonstrated motions is not adequate to perform this task, 2) a task planner can use a learned model and reward function to compute an appropriate policy, 3) this model-based planning process supports rapid learning, 4) both parametric and nonparametric models can be learned and used, and 5) incorporating a task level direct learning component, which is non-model-based, in addition to the model-based planner, is useful in compensating for structural modeling errors and slow model learning. 

am

link (url) [BibTex]

link (url) [BibTex]


no image
Local dimensionality reduction for locally weighted learning

Vijayakumar, S., Schaal, S.

In International Conference on Computational Intelligence in Robotics and Automation, pages: 220-225, Monteray, CA, July10-11, 1997, 1997, clmc (inproceedings)

Abstract
Incremental learning of sensorimotor transformations in high dimensional spaces is one of the basic prerequisites for the success of autonomous robot devices as well as biological movement systems. So far, due to sparsity of data in high dimensional spaces, learning in such settings requires a significant amount of prior knowledge about the learning task, usually provided by a human expert. In this paper we suggest a partial revision of the view. Based on empirical studies, it can been observed that, despite being globally high dimensional and sparse, data distributions from physical movement systems are locally low dimensional and dense. Under this assumption, we derive a learning algorithm, Locally Adaptive Subspace Regression, that exploits this property by combining a local dimensionality reduction as a preprocessing step with a nonparametric learning technique, locally weighted regression. The usefulness of the algorithm and the validity of its assumptions are illustrated for a synthetic data set and data of the inverse dynamics of an actual 7 degree-of-freedom anthropomorphic robot arm.

am

link (url) [BibTex]

link (url) [BibTex]