| |
Greenwood, R. & Thesmar, D., 2009. Stock Price Fragility.
Abstract: We investigate the relationship between ownership structure of financial assets and nonfundamental risk. We define an asset to be fragile if it susceptible to non-fundamental trading shocks. An asset can be fragile because of concentrated ownership, or because its owners face correlated liquidity shocks, ie., they must buy or sell at the same time. Two assets are “cofragile” if their owners have correlated trading needs, even if the holdings of these owners do not directly overlap. We formalize this idea and apply it to the ownership of US stocks between 1990 and 2007. Consistent with our predictions, fragility strongly predicts future price volatility, and co-fragility predicts cross-stock return comovement.
Keywords: FinancialRatios
|
|
Mirowski, P. & LeCun, Y., 2009. Dynamic Factor Graphs for Time Series Modeling.
Abstract: This article presents a method for training Dynamic Fac- tor Graphs (DFG) with continuous latent state variables. A DFG in- cludes factors modeling joint probabilities between hidden and observed variables, and factors modeling dynamical constraints on hidden vari- ables. The DFG assigns a scalar energy to each configuration of hidden and observed variables. A gradient-based inference procedure finds the minimum-energy state sequence for a given observation sequence. Be- cause the factors are designed to ensure a constant partition function, they can be trained by minimizing the expected energy over training sequences with respect to the factors’ parameters. These alternated in- ference and parameter updates can be seen as a deterministic EM-like procedure. Using smoothing regularizers, DFGs are shown to reconstruct chaotic attractors and to separate a mixture of independent oscillatory sources perfectly. DFGs outperform the best known algorithm on the CATS competition benchmark for time series prediction. DFGs also suc- cessfully reconstruct missing motion capture data.
Keywords: Bayesian
|
|
Hoyle, D.C., 2008. Automatic PCA Dimension Selection for High Dimensional Data and Small Sample Sizes, Journal of Machine Learning Research, 9, p. 2733–2759.
Abstract: Bayesian inference from high-dimensional data involves the integration over a large number of model parameters. Accurate evaluation of such high-dimensional integrals raises a unique set of issues. These issues are illustrated using the exemplar of model selection for principal component analysis (PCA). A Bayesian model selection criterion, based on a Laplace approximation to the model evidence for determining the number of signal principal components present in a data set, has previously been show to perform well on various test data sets. Using simulated data we show that for d-dimensional data and small sample sizes, N, the accuracy of this model selection method is strongly affected by increasing values of d. By taking proper account of the contribution to the evidence from the large number of model parameters we show that model selection accuracy is substantially improved. The accuracy of the improved model evidence is studied in the asymptotic limit d ! ¥ at fixed ratio a = N=d, with a < 1. In this limit, model selection based upon the improved model evidence agrees with a frequentist hypothesis testing approach.
Keywords: DimReduction
|
|
Dalalyan, A.S., Juditsky, A. & Spokoiny, V., 2008. A New Algorithm for Estimating the Effective Dimension-Reduction Subspace, Journal of Machine Learning Research, 9, p. 1647–1678.
Abstract: The statistical problem of estimating the effective dimension-reduction (EDR) subspace in the multi-index regression model with deterministic design and additive noise is considered. A new procedure for recovering the directions of the EDR subspace is proposed. Many methods for estimating the EDR subspace perform principal component analysis on a family of vectors, say ˆb 1; : : : ;ˆbL, nearly lying in the EDR subspace. This is in particular the case for the structure-adaptive approach proposed by Hristache et al. (2001a). In the present work, we propose to estimate the projector onto the EDR subspace by the solution to the optimization problem minimize max `=1;:::;L ˆb >` (IA)ˆb` subject to A 2 Am ; where Am is the set of all symmetric matrices with eigenvalues in [0;1] and trace less than or equal to m, with m being the true structural dimension. Under mild assumptions, pn-consistency of the proposed procedure is proved (up to a logarithmic factor) in the case when the structural dimension is not larger than 4. Moreover, the stochastic error of the estimator of the projector onto the EDR subspace is shown to depend on L logarithmically. This enables us to use a large number of vectors ˆb ` for estimating the EDR subspace. The empirical behavior of the algorithm is studied through numerical simulations.
Keywords: DimReduction
|
|
Krupka, E., Navot, A. & Tishby, N., 2008. Learning to Select Features using their Properties, Journal of Machine Learning Research, 9, p. 2349–2376.
Abstract: Feature selection is the task of choosing a small subset of features that is sufficient to predict the target labels well. Here, instead of trying to directly determine which features are better, we attempt to learn the properties of good features. For this purpose we assume that each feature is represented by a set of properties, referred to as meta-features. This approach enables prediction of the quality of features without measuring their value on the training instances. We use this ability to devise new selection algorithms that can efficiently search for new good features in the presence of a huge number of features, and to dramatically reduce the number of feature measurements needed. We demonstrate our algorithms on a handwritten digit recognition problem and a visual object category recognition problem. In addition, we show how this novel viewpoint enables derivation of better generalization bounds for the joint learning problem of selection and classification, and how it contributes to a better understanding of the problem. Specifically, in the context of object recognition, previous works showed that it is possible to find one set of features which fits most object categories (aka a universal dictionary). Here we use our framework to analyze one such universal dictionary and find that the quality of features in this dictionary can be predicted accurately by its meta-features.
Keywords: DimReduction
|
|
|