Header logo is


2011


no image
Learning, planning, and control for quadruped locomotion over challenging terrain

Kalakrishnan, Mrinal, Buchli, Jonas, Pastor, Peter, Mistry, Michael, Schaal, S.

International Journal of Robotics Research, 30(2):236-258, February 2011 (article)

am

[BibTex]

2011


[BibTex]


no image
Design and application of a wire-driven bidirectional telescopic mechanism for workspace expansion with a focus on shipbuilding tasks

Lee, D., Chang, D., Shin, Y., Son, D., Kim, T., Lee, K., Kim, J.

Advanced Robotics, 25, 2011 (article)

pi

[BibTex]

[BibTex]


no image
Bayesian robot system identification with input and output noise

Ting, J., D’Souza, A., Schaal, S.

Neural Networks, 24(1):99-108, 2011, clmc (article)

Abstract
For complex robots such as humanoids, model-based control is highly beneficial for accurate tracking while keeping negative feedback gains low for compliance. However, in such multi degree-of-freedom lightweight systems, conventional identification of rigid body dynamics models using CAD data and actuator models is inaccurate due to unknown nonlinear robot dynamic effects. An alternative method is data-driven parameter estimation, but significant noise in measured and inferred variables affects it adversely. Moreover, standard estimation procedures may give physically inconsistent results due to unmodeled nonlinearities or insufficiently rich data. This paper addresses these problems, proposing a Bayesian system identification technique for linear or piecewise linear systems. Inspired by Factor Analysis regression, we develop a computationally efficient variational Bayesian regression algorithm that is robust to ill-conditioned data, automatically detects relevant features, and identifies input and output noise. We evaluate our approach on rigid body parameter estimation for various robotic systems, achieving an error of up to three times lower than other state-of-the-art machine learning methods

am

link (url) [BibTex]

link (url) [BibTex]


no image
Learning variable impedance control

Buchli, J., Stulp, F., Theodorou, E., Schaal, S.

International Journal of Robotics Research, 2011, clmc (article)

Abstract
One of the hallmarks of the performance, versatility, and robustness of biological motor control is the ability to adapt the impedance of the overall biomechanical system to different task requirements and stochastic disturbances. A transfer of this principle to robotics is desirable, for instance to enable robots to work robustly and safely in everyday human environments. It is, however, not trivial to derive variable impedance controllers for practical high degree-of-freedom (DOF) robotic tasks. In this contribution, we accomplish such variable impedance control with the reinforcement learning (RL) algorithm PISq ({f P}olicy {f I}mprovement with {f P}ath {f I}ntegrals). PISq is a model-free, sampling based learning method derived from first principles of stochastic optimal control. The PISq algorithm requires no tuning of algorithmic parameters besides the exploration noise. The designer can thus fully focus on cost function design to specify the task. From the viewpoint of robotics, a particular useful property of PISq is that it can scale to problems of many DOFs, so that reinforcement learning on real robotic systems becomes feasible. We sketch the PISq algorithm and its theoretical properties, and how it is applied to gain scheduling for variable impedance control. We evaluate our approach by presenting results on several simulated and real robots. We consider tasks involving accurate tracking through via-points, and manipulation tasks requiring physical contact with the environment. In these tasks, the optimal strategy requires both tuning of a reference trajectory emph{and} the impedance of the end-effector. The results show that we can use path integral based reinforcement learning not only for planning but also to derive variable gain feedback controllers in realistic scenarios. Thus, the power of variable impedance control is made available to a wide variety of robotic systems and practical applications.

am

link (url) [BibTex]

link (url) [BibTex]


no image
Iterative path integral stochastic optimal control: Theory and applications to motor control

Theodorou, E. A.

University of Southern California, University of Southern California, Los Angeles, CA, 2011 (phdthesis)

am

PDF [BibTex]

PDF [BibTex]


no image
Learning of grasp selection based on shape-templates

Herzog, A.

Karlsruhe Institute of Technology, 2011 (mastersthesis)

am

[BibTex]

[BibTex]


no image
Waalbot II: Adhesion recovery and improved performance of a climbing robot using fibrillar adhesives

Murphy, M. P., Kute, C., Mengüç, Y., Sitti, M.

The International Journal of Robotics Research, 30(1):118-133, SAGE Publications Sage UK: London, England, 2011 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Automated 2-D nanoparticle manipulation using atomic force microscopy

Onal, C. D., Ozcan, O., Sitti, M.

IEEE Transactions on Nanotechnology, 10(3):472-481, IEEE, 2011 (article)

pi

[BibTex]

[BibTex]


no image
Biaxial mechanical modeling of the small intestine

Bellini, C., Glass, P., Sitti, M., Di Martino, E. S.

Journal of the mechanical behavior of biomedical materials, 4(8):1727-1740, Elsevier, 2011 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Understanding haptics by evolving mechatronic systems

Loeb, G. E., Tsianos, G.A., Fishel, J.A., Wettels, N., Schaal, S.

Progress in Brain Research, 192, pages: 129, 2011 (article)

am

[BibTex]

[BibTex]


no image
Assembly and disassembly of magnetic mobile micro-robots towards deterministic 2-D reconfigurable micro-systems

Diller, E., Pawashe, C., Floyd, S., Sitti, M.

The International Journal of Robotics Research, 30(14):1667-1680, SAGE Publications Sage UK: London, England, 2011 (article)

pi

[BibTex]

[BibTex]


no image
Modeling of stochastic motion of bacteria propelled spherical microbeads

Arabagi, V., Behkam, B., Cheung, E., Sitti, M.

Journal of Applied Physics, 109(11):114702, AIP, 2011 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
The effect of aspect ratio on adhesion and stiffness for soft elastic fibres

Aksak, B., Hui, C., Sitti, M.

Journal of The Royal Society Interface, 8(61):1166-1175, The Royal Society, 2011 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Enhancing adhesion of biologically inspired polymer microfibers with a viscous oil coating

Cheung, E., Sitti, M.

The Journal of Adhesion, 87(6):547-557, Taylor & Francis Group, 2011 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Piezoelectric polymer fiber arrays for tactile sensing applications

Sümer, B., Aksak, B., Şsahin, K., Chuengsatiansup, K., Sitti, M.

Sensor Letters, 9(2):457-463, American Scientific Publishers, 2011 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Control methodologies for a heterogeneous group of untethered magnetic micro-robots

Floyd, S., Diller, E., Pawashe, C., Sitti, M.

The International Journal of Robotics Research, 30(13):1553-1565, SAGE Publications, 2011 (article)

pi

[BibTex]

[BibTex]

2008


no image
Learning to control in operational space

Peters, J., Schaal, S.

International Journal of Robotics Research, 27, pages: 197-212, 2008, clmc (article)

Abstract
One of the most general frameworks for phrasing control problems for complex, redundant robots is operational space control. However, while this framework is of essential importance for robotics and well-understood from an analytical point of view, it can be prohibitively hard to achieve accurate control in face of modeling errors, which are inevitable in com- plex robots, e.g., humanoid robots. In this paper, we suggest a learning approach for opertional space control as a direct inverse model learning problem. A first important insight for this paper is that a physically cor- rect solution to the inverse problem with redundant degrees-of-freedom does exist when learning of the inverse map is performed in a suitable piecewise linear way. The second crucial component for our work is based on the insight that many operational space controllers can be understood in terms of a constrained optimal control problem. The cost function as- sociated with this optimal control problem allows us to formulate a learn- ing algorithm that automatically synthesizes a globally consistent desired resolution of redundancy while learning the operational space controller. From the machine learning point of view, this learning problem corre- sponds to a reinforcement learning problem that maximizes an immediate reward. We employ an expectation-maximization policy search algorithm in order to solve this problem. Evaluations on a three degrees of freedom robot arm are used to illustrate the suggested approach. The applica- tion to a physically realistic simulator of the anthropomorphic SARCOS Master arm demonstrates feasibility for complex high degree-of-freedom robots. We also show that the proposed method works in the setting of learning resolved motion rate control on real, physical Mitsubishi PA-10 medical robotics arm.

am ei

link (url) DOI [BibTex]

2008


link (url) DOI [BibTex]


no image
ENHANCED ADHESION OF PDMS SURFACES FUNCTIONALIZED BY POLY (n-BUTYL ACRYLATE) BRUSHES INSPIRED BY GECKO FOOT HAIRS

Nese, A., Lee, H., Dong, H., Aksak, B., Cusick, B., Kowalewski, T., Matyjaszewski, K., Sitti, M.

Polymer Preprints, 49(2):107, 2008 (article)

pi

[BibTex]

[BibTex]


no image
Design and development of the lifting and propulsion mechanism for a biologically inspired water runner robot

Floyd, S., Sitti, M.

IEEE transactions on robotics, 24(3):698-709, IEEE, 2008 (article)

pi

[BibTex]

[BibTex]


no image
Control of Cell Behavior by Aligned Micro/Nanofibrous Biomaterial Scaffolds Fabricated by Spinneret-Based Tunable Engineered Parameters (STEP) Technique

Nain, A. S., Phillippi, J. A., Sitti, M., MacKrell, J., Campbell, P. G., Amon, C.

Small, 4(8):1153-1159, Wiley Online Library, 2008 (article)

pi

[BibTex]

[BibTex]


no image
Adaptation to a sub-optimal desired trajectory

M. Mistry, E. A. G. L. T. Y. S. S. M. K.

Advances in Computational Motor Control VII, Symposium at the Society for Neuroscience Meeting, Washington DC, 2008, 2008, clmc (article)

am

PDF [BibTex]

PDF [BibTex]


no image
Rolling and spinning friction characterization of fine particles using lateral force microscopy based contact pushing

Sümer, B., Sitti, M.

Journal of Adhesion Science and Technology, 22(5-6):481-506, Taylor & Francis Group, 2008 (article)

pi

[BibTex]

[BibTex]


no image
Modeling the soft backing layer thickness effect on adhesion of elastic microfiber arrays

Long, R., Hui, C., Kim, S., Sitti, M.

Journal of Applied Physics, 104(4):044301, AIP, 2008 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Cross-talk compensation in atomic force microscopy

Onal, C. D., Sümer, B., Sitti, M.

Review of scientific instruments, 79(10):103706, AIP, 2008 (article)

pi

[BibTex]

[BibTex]


no image
Operational space control: A theoretical and emprical comparison

Nakanishi, J., Cory, R., Mistry, M., Peters, J., Schaal, S.

International Journal of Robotics Research, 27(6):737-757, 2008, clmc (article)

Abstract
Dexterous manipulation with a highly redundant movement system is one of the hallmarks of hu- man motor skills. From numerous behavioral studies, there is strong evidence that humans employ compliant task space control, i.e., they focus control only on task variables while keeping redundant degrees-of-freedom as compliant as possible. This strategy is robust towards unknown disturbances and simultaneously safe for the operator and the environment. The theory of operational space con- trol in robotics aims to achieve similar performance properties. However, despite various compelling theoretical lines of research, advanced operational space control is hardly found in actual robotics imple- mentations, in particular new kinds of robots like humanoids and service robots, which would strongly profit from compliant dexterous manipulation. To analyze the pros and cons of different approaches to operational space control, this paper focuses on a theoretical and empirical evaluation of different methods that have been suggested in the literature, but also some new variants of operational space controllers. We address formulations at the velocity, acceleration and force levels. First, we formulate all controllers in a common notational framework, including quaternion-based orientation control, and discuss some of their theoretical properties. Second, we present experimental comparisons of these approaches on a seven-degree-of-freedom anthropomorphic robot arm with several benchmark tasks. As an aside, we also introduce a novel parameter estimation algorithm for rigid body dynamics, which ensures physical consistency, as this issue was crucial for our successful robot implementations. Our extensive empirical results demonstrate that one of the simplified acceleration-based approaches can be advantageous in terms of task performance, ease of parameter tuning, and general robustness and compliance in face of inevitable modeling errors.

am

link (url) [BibTex]

link (url) [BibTex]


no image
Adhesion of biologically inspired oil-coated polymer micropillars

Cheung, E., Sitti, M.

Journal of Adhesion Science and Technology, 22(5-6):569-589, Taylor & Francis Group, 2008 (article)

pi

[BibTex]

[BibTex]


no image
Vision-based feedback strategy for controlled pushing of microparticles

Lynch, N. A., Onal, C. D., Schuster, E., Sitti, M.

Journal of Micro-Nano Mechatronics, 4(1-2):73-83, Springer-Verlag, 2008 (article)

pi

[BibTex]

[BibTex]


no image
Effect of quantity and configuration of attached bacteria on bacterial propulsion of microbeads

Behkam, B., Sitti, M.

Applied Physics Letters, 93(22):223901, AIP, 2008 (article)

pi

[BibTex]

[BibTex]


no image
A library for locally weighted projection regression

Klanke, S., Vijayakumar, S., Schaal, S.

Journal of Machine Learning Research, 9, pages: 623-626, 2008, clmc (article)

Abstract
In this paper we introduce an improved implementation of locally weighted projection regression (LWPR), a supervised learning algorithm that is capable of handling high-dimensional input data. As the key features, our code supports multi-threading, is available for multiple platforms, and provides wrappers for several programming languages.

am

link (url) [BibTex]

link (url) [BibTex]


no image
Preface to the Journal of Micro-Nano Mechatronics

Dario, P., Fukuda, T., Sitti, M.

Journal of Micro-Nano Mechatronics, 4(1-2):1-1, Springer-Verlag, 2008 (article)

pi

[BibTex]

[BibTex]


no image
A legged anchoring mechanism for capsule endoscopes using micropatterned adhesives

Glass, P., Cheung, E., Sitti, M.

IEEE Transactions on Biomedical Engineering, 55(12):2759-2767, IEEE, 2008 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Optimization strategies in human reinforcement learning

Hoffmann, H., Theodorou, E., Schaal, S.

Advances in Computational Motor Control VII, Symposium at the Society for Neuroscience Meeting, Washington DC, 2008, 2008, clmc (article)

am

PDF [BibTex]

PDF [BibTex]


no image
Dynamic modeling of stick slip motion in an untethered magnetic microrobot

Pawashe, C., Floyd, S., Sitti, M.

Proceedings of Robotics: Science and Systems IV, Zurich, Switzerland, 2008 (article)

pi

[BibTex]

[BibTex]

2005


no image
Composite adaptive control with locally weighted statistical learning

Nakanishi, J., Farrell, J. A., Schaal, S.

Neural Networks, 18(1):71-90, January 2005, clmc (article)

Abstract
This paper introduces a provably stable learning adaptive control framework with statistical learning. The proposed algorithm employs nonlinear function approximation with automatic growth of the learning network according to the nonlinearities and the working domain of the control system. The unknown function in the dynamical system is approximated by piecewise linear models using a nonparametric regression technique. Local models are allocated as necessary and their parameters are optimized on-line. Inspired by composite adaptive control methods, the proposed learning adaptive control algorithm uses both the tracking error and the estimation error to update the parameters. We first discuss statistical learning of nonlinear functions, and motivate our choice of the locally weighted learning framework. Second, we begin with a class of first order SISO systems for theoretical development of our learning adaptive control framework, and present a stability proof including a parameter projection method that is needed to avoid potential singularities during adaptation. Then, we generalize our adaptive controller to higher order SISO systems, and discuss further extension to MIMO problems. Finally, we evaluate our theoretical control framework in numerical simulations to illustrate the effectiveness of the proposed learning adaptive controller for rapid convergence and high accuracy of control.

am

link (url) [BibTex]

2005


link (url) [BibTex]


no image
A model of smooth pursuit based on learning of the target dynamics using only retinal signals

Shibata, T., Tabata, H., Schaal, S., Kawato, M.

Neural Networks, 18, pages: 213-225, 2005, clmc (article)

Abstract
While the predictive nature of the primate smooth pursuit system has been evident through several behavioural and neurophysiological experiments, few models have attempted to explain these results comprehensively. The model we propose in this paper in line with previous models employing optimal control theory; however, we hypothesize two new issues: (1) the medical superior temporal (MST) area in the cerebral cortex implements a recurrent neural network (RNN) in order to predict the current or future target velocity, and (2) a forward model of the target motion is acquired by on-line learning. We use stimulation studies to demonstrate how our new model supports these hypotheses.

am

link (url) [BibTex]

link (url) [BibTex]


no image
Parametric and Non-Parametric approaches for nonlinear tracking of moving objects

Hidaka, Y, Theodorou, E.

Technical Report-2005-1, 2005, clmc (article)

am

PDF [BibTex]

PDF [BibTex]

1992


no image
Ins CAD integrierte Kostenkalkulation (CAD-Integrated Cost Calculation)

Ehrlenspiel, K., Schaal, S.

Konstruktion 44, 12, pages: 407-414, 1992, clmc (article)

am

[BibTex]

1992


[BibTex]