Header logo is


2011


no image
Applications of AFM Based Nanorobotic Systems

Xie, H., Onal, C., Régnier, S., Sitti, M.

In Atomic Force Microscopy Based Nanorobotics, pages: 313-342, Springer Berlin Heidelberg, 2011 (incollection)

pi

[BibTex]

2011


[BibTex]


no image
Modeling of stochastic motion of bacteria propelled spherical microbeads

Arabagi, V., Behkam, B., Cheung, E., Sitti, M.

Journal of Applied Physics, 109(11):114702, AIP, 2011 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
The effect of aspect ratio on adhesion and stiffness for soft elastic fibres

Aksak, B., Hui, C., Sitti, M.

Journal of The Royal Society Interface, 8(61):1166-1175, The Royal Society, 2011 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


Thumb xl srf2011 2
Steerable random fields for image restoration and inpainting

Roth, S., Black, M. J.

In Markov Random Fields for Vision and Image Processing, pages: 377-387, (Editors: Blake, A. and Kohli, P. and Rother, C.), MIT Press, 2011 (incollection)

Abstract
This chapter introduces the concept of a Steerable Random Field (SRF). In contrast to traditional Markov random field (MRF) models in low-level vision, the random field potentials of a SRF are defined in terms of filter responses that are steered to the local image structure. This steering uses the structure tensor to obtain derivative responses that are either aligned with, or orthogonal to, the predominant local image structure. Analysis of the statistics of these steered filter responses in natural images leads to the model proposed here. Clique potentials are defined over steered filter responses using a Gaussian scale mixture model and are learned from training data. The SRF model connects random fields with anisotropic regularization and provides a statistical motivation for the latter. Steering the random field to the local image structure improves image denoising and inpainting performance compared with traditional pairwise MRFs.

ps

publisher site [BibTex]

publisher site [BibTex]


no image
Large hidden orbital moments in magnetite

Goering, E.

{Physica Status Solidi B}, 248(10):2345-2351, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Cr magnetization reversal at the CrO2/RuO2 interface: Origin of the reduced GMR effect

Zafar, K., Audehm, P., Schütz, G., Goering, E., Pathak, M., Chetry, K. B., LeClair, P. R., Gupta, A.

{Physical Review B}, 84, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Magnetocaloric effect, magnetic domain structure and spin-reorientation transitios in HoCo5 single crystals

Skokov, K. P., Pastushenkov, Y. G., Koshkid\textquotesingleko, Y. S., Schütz, G., Goll, D., Ivanova, T. I., Nikitin, S. A., Semenova, E. M., Petrenko, A. V.

{Journal of Magnetism and Magnetic Materials}, 323(5):447-450, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Elucidating gating effects for hydrogen sorption in MFU-4-type triazolate-based metal-organic frameworks featuring different pore sizes

Denysenko, D., Grzywa, M., Tonigold, M., Streppel, B., Krkljus, I., Hirscher, M., Mugnaioli, E., Kolb, U., Hanss, J., Volkmer, D.

{Chemistry - A European Journal}, 17(6):1837-1848, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
BET specific surface area and pore structure of MOFs determined by hydrogen adsorption at 20 K

Streppel, B., Hirscher, M.

{Physical Chemistry Chemical Physics}, 13(8):3220-3222, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
High contrast magnetic and nonmagnetic sample current microscopy for bulk and transparent samples using soft X-rays

Nolle, D., Weigand, M., Schütz, G., Goering, E.

{Microscopy and Microanalysis}, 17, pages: 834-842, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Magnetic vortex core reversal by rotating magnetic fields generated on micrometer length scales

Curcic, M., Stoll, H., Weigand, M., Sackmann, V., Jüllig, P., Kammerer, M., Noske, M., Sproll, M., Van Waeyenberge, B., Vansteenkiste, A., Woltersdorf, G., Tyliszczak, T., Schütz, G.

{Physica Status Solidi B}, 248(10):2317-2322, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Nanomechanics of AFM based nanomanipulation

Xie, H., Onal, C., Régnier, S., Sitti, M.

In Atomic Force Microscopy Based Nanorobotics, pages: 87-143, Springer Berlin Heidelberg, 2011 (incollection)

pi

[BibTex]

[BibTex]


no image
Enhancing adhesion of biologically inspired polymer microfibers with a viscous oil coating

Cheung, E., Sitti, M.

The Journal of Adhesion, 87(6):547-557, Taylor & Francis Group, 2011 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Formation of two amorphous phases in the Ni60Nb18Y22 alloy after high pressure torsion

Straumal, B. B., Mazilkin, A. A., Protasova, S. G., Goll, D., Baretzky, B., Bakai, A. S., Dobatkin, S. V.

{Kovove Materialy-Metallic Materials}, 49(1):17-22, 2011 (article)

mms

link (url) [BibTex]

link (url) [BibTex]


no image
Structure and properties of nanograined Fe-C alloys after severe plastic deformation

Straumal, B. B., Dobatkin, S. V., Rodin, A. O., Protasova, S. G., Mazilkin, A. A., Goll, D., Baretzky, B.

{Advanced Engineering Materials}, 13(6):463-469, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Increased flux pinning in YBa2Cu3O7-δthin-film devices through embedding of Au nano crystals

Katzer, C., Schmidt, M., Michalowski, P., Kuhwald, D., Schmidl, F., Grosse, V., Treiber, S., Stahl, C., Albrecht, J., Hübner, U., Undisz, A., Rettenmayr, M., Schütz, G., Seidel, P.

{Europhysics Letters}, 95(6), 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Signal transfer in a chain of stray-field coupled ferromagnetic squares

Vogel, A., Martens, M., Weigand, M., Meier, G.

{Applied Physics Letters}, 99, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Electron theory of magnetoelectric effects in metallic ferromagnetic nanostructures

Subkow, S., Fähnle, M.

{Physical Review B}, 84, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Magnetic antivortex-core reversal by rotating magnetic fields

Kamionka, T., Martens, M., Chou, K., Drews, A., Tyliszczak, T., Stoll, H., Van Waeyenberge, B., Meier, G.

{Physical Review B}, 83, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Magnetic properties of exchange-spring composite films

Kronmüller, H., Goll, D.

{Physica Status Solidi B}, 248(10):2361-2367, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Wetting transition of grain boundaries in the Sn-rich part of the Sn-Bi phase diagram

Yeh, C.-H., Chang, L.-S., Straumal, B. B.

{Journal of Materials Science}, 46(5):1557-1562, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Instrumentation Issues of an AFM Based Nanorobotic System

Xie, H., Onal, C., Régnier, S., Sitti, M.

In Atomic Force Microscopy Based Nanorobotics, pages: 31-86, Springer Berlin Heidelberg, 2011 (incollection)

pi

[BibTex]

[BibTex]


no image
Piezoelectric polymer fiber arrays for tactile sensing applications

Sümer, B., Aksak, B., Şsahin, K., Chuengsatiansup, K., Sitti, M.

Sensor Letters, 9(2):457-463, American Scientific Publishers, 2011 (article)

pi

Project Page [BibTex]

Project Page [BibTex]


no image
Control methodologies for a heterogeneous group of untethered magnetic micro-robots

Floyd, S., Diller, E., Pawashe, C., Sitti, M.

The International Journal of Robotics Research, 30(13):1553-1565, SAGE Publications, 2011 (article)

pi

[BibTex]

[BibTex]


no image
Projected Newton-type methods in machine learning

Schmidt, M., Kim, D., Sra, S.

In Optimization for Machine Learning, pages: 305-330, MIT Press, Cambridge, MA, USA, 2011 (incollection)

Abstract
{We consider projected Newton-type methods for solving large-scale optimization problems arising in machine learning and related fields. We first introduce an algorithmic framework for projected Newton-type methods by reviewing a canonical projected (quasi-)Newton method. This method, while conceptually pleasing, has a high computation cost per iteration. Thus, we discuss two variants that are more scalable, namely, two-metric projection and inexact projection methods. Finally, we show how to apply the Newton-type framework to handle non-smooth objectives. Examples are provided throughout the chapter to illustrate machine learning applications of our framework.}

mms

link (url) [BibTex]

link (url) [BibTex]


no image
Influence of dot size and annealing on the magnetic properties of large-area L10-FePt nanopatterns

Bublat, T., Goll, D.

{Journal of Applied Physics}, 110(7), 2011 (article)

mms

DOI [BibTex]


no image
The temperature-dependent magnetization profile across an epitaxial bilayer of ferromagnetic La2/3Ca1/3MnO3 and superconducting YBa2Cu3O7-δ

Brück, S., Treiber, S., Macke, S., Audehm, P., Christiani, G., Soltan, S., Habermeier, H., Goering, E., Albrecht, J.

{New Journal of Physics}, 13(3), 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Spin interactions in bcc and fcc Fe beyond the Heisenberg model

Singer, R., Dietermann, F., Fähnle, M.

{Physical Review Letters}, 107, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Route to a family of robust, non-interpenetrated metal-organic frameworks with pto-like topology

Klein, N., Senkovska, I., Baburin, I. A., Grünker, R., Stoeck, U., Schlichtenmayer, M., Streppel, B., Mueller, U., Leoni, S., Hirscher, M., Kaskel, S.

{Chemistry - A European Journal}, 17(46):13007-13016, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Initial stages of growth of iron on silicon for spin injection through Schottky barrier

Dash, S. P., Carstanjen, H. D.

{Physica Status Solidi B}, 248(10):2300-2304, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Fe3O4/ZnO: A high-quality magnetic oxide-semiconductor heterostructure by reactive deposition

Paul, M., Kufer, D., Müller, A., Brück, S., Goering, E., Kamp, M., Verbeeck, J., Tian, H., Van Tendeloo, G., Ingle, N. J. C., Sing, M., Claessen, R.

{Applied Physics Letters}, 98, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Influence of texture on the ferromagnetic properties of nanograined ZnO films

Straumal, B., Mazilkin, A., Protasova, S., Myatiev, A., Straumal, P., Goering, E., Baretzky, B.

{Physica Status Solidi B}, 248(7):1581-1586, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Control of spin configuration in half-metallic La0.7Sr0.3MnO3 nano-structures

Rhensius, J., Vaz, C. A. F., Bisig, A., Schweitzer, S., Heidler, J., Körner, H. S., Locatelli, A., Niño, M. A., Weigand, M., Méchin, L., Gaucher, F., Goering, E., Heyderman, L. J., Kläui, M.

{Applied Physics Letters}, 99(6), 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]


no image
Comparison of various sol-gel derived metal oxide layers for inverted organic solar cells

Oh, H., Krantz, J., Litzov, I., Stubhan, T., Pinna, L., Brabec, C. J.

{Solar Energy Materials \& Solar Cells}, 95(8):2194-2199, 2011 (article)

mms

DOI [BibTex]

DOI [BibTex]

2005


no image
Kernel Methods for Measuring Independence

Gretton, A., Herbrich, R., Smola, A., Bousquet, O., Schölkopf, B.

Journal of Machine Learning Research, 6, pages: 2075-2129, December 2005 (article)

Abstract
We introduce two new functionals, the constrained covariance and the kernel mutual information, to measure the degree of independence of random variables. These quantities are both based on the covariance between functions of the random variables in reproducing kernel Hilbert spaces (RKHSs). We prove that when the RKHSs are universal, both functionals are zero if and only if the random variables are pairwise independent. We also show that the kernel mutual information is an upper bound near independence on the Parzen window estimate of the mutual information. Analogous results apply for two correlation-based dependence functionals introduced earlier: we show the kernel canonical correlation and the kernel generalised variance to be independence measures for universal kernels, and prove the latter to be an upper bound on the mutual information near independence. The performance of the kernel dependence functionals in measuring independence is verified in the context of independent component analysis.

ei

PDF PostScript PDF [BibTex]

2005


PDF PostScript PDF [BibTex]


no image
A Unifying View of Sparse Approximate Gaussian Process Regression

Quinonero Candela, J., Rasmussen, C.

Journal of Machine Learning Research, 6, pages: 1935-1959, December 2005 (article)

Abstract
We provide a new unifying view, including all existing proper probabilistic sparse approximations for Gaussian process regression. Our approach relies on expressing the effective prior which the methods are using. This allows new insights to be gained, and highlights the relationship between existing methods. It also allows for a clear theoretically justified ranking of the closeness of the known approximations to the corresponding full GPs. Finally we point directly to designs of new better sparse approximations, combining the best of the existing strategies, within attractive computational constraints.

ei

PDF [BibTex]

PDF [BibTex]


no image
Maximal Margin Classification for Metric Spaces

Hein, M., Bousquet, O., Schölkopf, B.

Journal of Computer and System Sciences, 71(3):333-359, October 2005 (article)

Abstract
In order to apply the maximum margin method in arbitrary metric spaces, we suggest to embed the metric space into a Banach or Hilbert space and to perform linear classification in this space. We propose several embeddings and recall that an isometric embedding in a Banach space is always possible while an isometric embedding in a Hilbert space is only possible for certain metric spaces. As a result, we obtain a general maximum margin classification algorithm for arbitrary metric spaces (whose solution is approximated by an algorithm of Graepel. Interestingly enough, the embedding approach, when applied to a metric which can be embedded into a Hilbert space, yields the SVM algorithm, which emphasizes the fact that its solution depends on the metric and not on the kernel. Furthermore we give upper bounds of the capacity of the function classes corresponding to both embeddings in terms of Rademacher averages. Finally we compare the capacities of these function classes directly.

ei

PDF PDF DOI [BibTex]

PDF PDF DOI [BibTex]


no image
Selective integration of multiple biological data for supervised network inference

Kato, T., Tsuda, K., Asai, K.

Bioinformatics, 21(10):2488 , October 2005 (article)

ei

PDF [BibTex]

PDF [BibTex]


no image
Assessing Approximate Inference for Binary Gaussian Process Classification

Kuss, M., Rasmussen, C.

Journal of Machine Learning Research, 6, pages: 1679 , October 2005 (article)

Abstract
Gaussian process priors can be used to define flexible, probabilistic classification models. Unfortunately exact Bayesian inference is analytically intractable and various approximation techniques have been proposed. In this work we review and compare Laplace‘s method and Expectation Propagation for approximate Bayesian inference in the binary Gaussian process classification model. We present a comprehensive comparison of the approximations, their predictive performance and marginal likelihood estimates to results obtained by MCMC sampling. We explain theoretically and corroborate empirically the advantages of Expectation Propagation compared to Laplace‘s method.

ei

PDF PDF [BibTex]

PDF PDF [BibTex]


no image
Clustering on the Unit Hypersphere using von Mises-Fisher Distributions

Banerjee, A., Dhillon, I., Ghosh, J., Sra, S.

Journal of Machine Learning Research, 6, pages: 1345-1382, September 2005 (article)

Abstract
Several large scale data mining applications, such as text categorization and gene expression analysis, involve high-dimensional data that is also inherently directional in nature. Often such data is L2 normalized so that it lies on the surface of a unit hypersphere. Popular models such as (mixtures of) multi-variate Gaussians are inadequate for characterizing such data. This paper proposes a generative mixture-model approach to clustering directional data based on the von Mises-Fisher (vMF) distribution, which arises naturally for data distributed on the unit hypersphere. In particular, we derive and analyze two variants of the Expectation Maximization (EM) framework for estimating the mean and concentration parameters of this mixture. Numerical estimation of the concentration parameters is non-trivial in high dimensions since it involves functional inversion of ratios of Bessel functions. We also formulate two clustering algorithms corresponding to the variants of EM that we derive. Our approach provides a theoretical basis for the use of cosine similarity that has been widely employed by the information retrieval community, and obtains the spherical kmeans algorithm (kmeans with cosine similarity) as a special case of both variants. Empirical results on clustering of high-dimensional text and gene-expression data based on a mixture of vMF distributions show that the ability to estimate the concentration parameter for each vMF component, which is not present in existing approaches, yields superior results, especially for difficult clustering tasks in high-dimensional spaces.

ei

PDF [BibTex]

PDF [BibTex]


no image
Support Vector Machines for 3D Shape Processing

Steinke, F., Schölkopf, B., Blanz, V.

Computer Graphics Forum, 24(3, EUROGRAPHICS 2005):285-294, September 2005 (article)

Abstract
We propose statistical learning methods for approximating implicit surfaces and computing dense 3D deformation fields. Our approach is based on Support Vector (SV) Machines, which are state of the art in machine learning. It is straightforward to implement and computationally competitive; its parameters can be automatically set using standard machine learning methods. The surface approximation is based on a modified Support Vector regression. We present applications to 3D head reconstruction, including automatic removal of outliers and hole filling. In a second step, we build on our SV representation to compute dense 3D deformation fields between two objects. The fields are computed using a generalized SVMachine enforcing correspondence between the previously learned implicit SV object representations, as well as correspondences between feature points if such points are available. We apply the method to the morphing of 3D heads and other objects.

ei

PDF [BibTex]

PDF [BibTex]


no image
Fast Protein Classification with Multiple Networks

Tsuda, K., Shin, H., Schölkopf, B.

Bioinformatics, 21(Suppl. 2):59-65, September 2005 (article)

Abstract
Support vector machines (SVM) have been successfully used to classify proteins into functional categories. Recently, to integrate multiple data sources, a semidefinite programming (SDP) based SVM method was introduced Lanckriet et al (2004). In SDP/SVM, multiple kernel matrices corresponding to each of data sources are combined with weights obtained by solving an SDP. However, when trying to apply SDP/SVM to large problems, the computational cost can become prohibitive, since both converting the data to a kernel matrix for the SVM and solving the SDP are time and memory demanding. Another application-specific drawback arises when some of the data sources are protein networks. A common method of converting the network to a kernel matrix is the diffusion kernel method, which has time complexity of O(n^3), and produces a dense matrix of size n x n. We propose an efficient method of protein classification using multiple protein networks. Available protein networks, such as a physical interaction network or a metabolic network, can be directly incorporated. Vectorial data can also be incorporated after conversion into a network by means of neighbor point connection. Similarly to the SDP/SVM method, the combination weights are obtained by convex optimization. Due to the sparsity of network edges, the computation time is nearly linear in the number of edges of the combined network. Additionally, the combination weights provide information useful for discarding noisy or irrelevant networks. Experiments on function prediction of 3588 yeast proteins show promising results: the computation time is enormously reduced, while the accuracy is still comparable to the SDP/SVM method.

ei

PDF Web DOI [BibTex]

PDF Web DOI [BibTex]


no image
Iterative Kernel Principal Component Analysis for Image Modeling

Kim, K., Franz, M., Schölkopf, B.

IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(9):1351-1366, September 2005 (article)

Abstract
In recent years, Kernel Principal Component Analysis (KPCA) has been suggested for various image processing tasks requiring an image model such as, e.g., denoising or compression. The original form of KPCA, however, can be only applied to strongly restricted image classes due to the limited number of training examples that can be processed. We therefore propose a new iterative method for performing KPCA, the Kernel Hebbian Algorithm which iteratively estimates the Kernel Principal Components with only linear order memory complexity. In our experiments, we compute models for complex image classes such as faces and natural images which require a large number of training examples. The resulting image models are tested in single-frame super-resolution and denoising applications. The KPCA model is not specifically tailored to these tasks; in fact, the same model can be used in super-resolution with variable input resolution, or denoising with unknown noise characteristics. In spite of this, both super-resolution a nd denoising performance are comparable to existing methods.

ei

Web DOI [BibTex]

Web DOI [BibTex]


no image
Phenotypic characterization of chondrosarcoma-derived cell lines

Schorle, C., Finger, F., Zien, A., Block, J., Gebhard, P., Aigner, T.

Cancer Letters, 226(2):143-154, August 2005 (article)

Abstract
Gene expression profiling of three chondrosarcoma derived cell lines (AD, SM, 105KC) showed an increased proliferative activity and a reduced expression of chondrocytic-typical matrix products compared to primary chondrocytes. The incapability to maintain an adequate matrix synthesis as well as a notable proliferative activity at the same time is comparable to neoplastic chondrosarcoma cells in vivo which cease largely cartilage matrix formation as soon as their proliferative activity increases. Thus, the investigated cell lines are of limited value as substitute of primary chondrocytes but might have a much higher potential to investigate the behavior of neoplastic chondrocytes, i.e. chondrosarcoma biology.

ei

Web [BibTex]

Web [BibTex]


no image
Local Rademacher Complexities

Bartlett, P., Bousquet, O., Mendelson, S.

The Annals of Statistics, 33(4):1497-1537, August 2005 (article)

Abstract
We propose new bounds on the error of learning algorithms in terms of a data-dependent notion of complexity. The estimates we establish give optimal rates and are based on a local and empirical version of Rademacher averages, in the sense that the Rademacher averages are computed from the data, on a subset of functions with small empirical error. We present some applications to classification and prediction with convex function classes, and with kernel classes in particular.

ei

PDF PostScript Web [BibTex]

PDF PostScript Web [BibTex]


no image
Learning the Kernel with Hyperkernels

Ong, CS., Smola, A., Williamson, R.

Journal of Machine Learning Research, 6, pages: 1043-1071, July 2005 (article)

Abstract
This paper addresses the problem of choosing a kernel suitable for estimation with a Support Vector Machine, hence further automating machine learning. This goal is achieved by defining a Reproducing Kernel Hilbert Space on the space of kernels itself. Such a formulation leads to a statistical estimation problem similar to the problem of minimizing a regularized risk functional. We state the equivalent representer theorem for the choice of kernels and present a semidefinite programming formulation of the resulting optimization problem. Several recipes for constructing hyperkernels are provided, as well as the details of common machine learning problems. Experimental results for classification, regression and novelty detection on UCI data show the feasibility of our approach.

ei

PDF [BibTex]

PDF [BibTex]


no image
Image Reconstruction by Linear Programming

Tsuda, K., Rätsch, G.

IEEE Transactions on Image Processing, 14(6):737-744, June 2005 (article)

Abstract
One way of image denoising is to project a noisy image to the subspace of admissible images derived, for instance, by PCA. However, a major drawback of this method is that all pixels are updated by the projection, even when only a few pixels are corrupted by noise or occlusion. We propose a new method to identify the noisy pixels by l1-norm penalization and to update the identified pixels only. The identification and updating of noisy pixels are formulated as one linear program which can be efficiently solved. In particular, one can apply the upsilon trick to directly specify the fraction of pixels to be reconstructed. Moreover, we extend the linear program to be able to exploit prior knowledge that occlusions often appear in contiguous blocks (e.g., sunglasses on faces). The basic idea is to penalize boundary points and interior points of the occluded area differently. We are also able to show the upsilon property for this extended LP leading to a method which is easy to use. Experimental results demonstrate the power of our approach.

ei

PDF DOI [BibTex]

PDF DOI [BibTex]


no image
RASE: recognition of alternatively spliced exons in C.elegans

Rätsch, G., Sonnenburg, S., Schölkopf, B.

Bioinformatics, 21(Suppl. 1):i369-i377, June 2005 (article)

ei

PDF Web DOI [BibTex]

PDF Web DOI [BibTex]


no image
Matrix Exponentiated Gradient Updates for On-line Learning and Bregman Projection

Tsuda, K., Rätsch, G., Warmuth, M.

Journal of Machine Learning Research, 6, pages: 995-1018, June 2005 (article)

Abstract
We address the problem of learning a symmetric positive definite matrix. The central issue is to design parameter updates that preserve positive definiteness. Our updates are motivated with the von Neumann divergence. Rather than treating the most general case, we focus on two key applications that exemplify our methods: on-line learning with a simple square loss, and finding a symmetric positive definite matrix subject to linear constraints. The updates generalize the exponentiated gradient (EG) update and AdaBoost, respectively: the parameter is now a symmetric positive definite matrix of trace one instead of a probability vector (which in this context is a diagonal positive definite matrix with trace one). The generalized updates use matrix logarithms and exponentials to preserve positive definiteness. Most importantly, we show how the derivation and the analyses of the original EG update and AdaBoost generalize to the non-diagonal case. We apply the resulting matrix exponentiated gradient (MEG) update and DefiniteBoost to the problem of learning a kernel matrix from distance measurements.

ei

PDF [BibTex]

PDF [BibTex]