|
|
2008. Mathematics for Anaylsis of Petascale Data. ASCR, Office of Science, Department of Enegy.
Keywords: DataMiningGeneral; LargeScaleLearning
|
|
|
|
Arcioni, G., 2008. Using self-similarity and renormalization group to analyze time series.
Abstract: An algorithm based on Renormalization Group (RG) to analyze time series forecasting was proposed in cond-mat/0110285. In this paper we explicitly code and test it. We choose in particular some financial time series (stocks, indexes and commodities) with daily data and compute one step ahead forecasts. We then construct some indicators to evaluate performances. The algorithm is supposed to prescribe the future development of the time series by using the selfsimilarity property intrinsically present in RG approach. This property could be potentially very attractive for the purpose of building winning trading systems. We discuss some relevant points along this direction. Although current performances have to be improved the algorithm seems quite reactive to various combinations of input parameters and different past values sequences. This makes it a potentially good candidate to detect sharp market movements. We finally mention current drawbacks and sketch how to improve them.
Keywords: DataMiningGeneral; TechnicalAnalysis
|
|
|
|
Arlot, S. & Celisse, A., 2009. A survey of cross-validation procedures for model selection, 907.4728.
Abstract: Used to estimate the risk of an estimator or to perform model selection, cross-validation is a widespread strategy because of its simplicity and its apparent universality. Many results exist on the model selection performances of cross-validation procedures. This survey intends to relate these results to the most recent advances of model selection theory, with a particular emphasis on distinguishing empirical statements from rigorous theoretical results. As a conclusion, guidelines are provided for choosing the best cross-validation procedure according to the particular features of the problem in hand.
Keywords: DataMiningGeneral; Ensembles
|
|
|
|
Barnston, A., 1992. Correspondence among the Correlation, RMSE, and Heidke Forecast Varification Measures; Refinement of the Heidke Score,.
Keywords: DataMiningGeneral
|
|
|
|
Bengio, Y. & LeCun, Y., 2007. Scalaing Learning Algorithms towards AI.
Abstract: One long-term goal of machine learning research is to produce methods that are applicable to highly complex tasks, such as perception (vision, audition), reasoning, intelligent control, and other artificially intelligent behaviors. We argue that in order to progress toward this goal, the Machine Learning community must endeavor to discover algorithms that can learn highly complex functions, with minimal need for prior knowledge, and with minimal human intervention. We present mathematical and empirical evidence suggesting that many popular approaches to non-parametric learning, particularly kernel methods, are fundamentally limited in their ability to learn complex high-dimensional functions. Our analysis focuses on two problems. First, kernel machines are shallow architectures, in which one large layer of simple template matchers is followed by a single layer of trainable coefficients. We argue that shallow architectures can be very inefficient in terms of required number of computational elements and examples. Second, we analyze a limitation of kernel machines with a local kernel, linked to the curse of dimensionality, that applies to supervised, unsupervised (manifold learning) and semi-supervised kernel machines. Using empirical results on invariant image recognition tasks, kernel methods are compared with deep architectures, in which lower-level features or concepts are progressively combined into more abstract and higher-level representations. We argue that deep architectures have the potential to generalize in non-local ways, i.e., beyond immediate neighbors, and that this is crucial in order to make progress on the kind of complex tasks required for artificial intelligence.
Keywords: DataMiningGeneral; LargeScaleLearning
|
|
|
|
Blum, A.. On-Line Algorithms in Machine Learning.
Keywords: DataMiningGeneral
|
|
|
|
Bohm, C., Haegler, K., Muller, N.S. & Plant, C., 2009. CoCo: Coding Cost For Parameter-Free Outlier Detection. Paris, France: ACM Press.
Abstract: How can we automatically spot all outstanding observations in a data set? This question arises in a large variety of applications, e.g. in economy, biology and medicine. Existing approaches to outlier detection suer from one or more of the following drawbacks: The results of many methods strongly depend on suitable parameter settings being very dicult to estimate without background knowledge on the data, e.g. the minimum cluster size or the number of desired outliers. Many methods implicitly assume Gaussian or uniformly distributed data, and/or their result is di- cult to interpret. To cope with these problems, we propose CoCo, a technique for parameter-free outlier detection. The basic idea of our technique relates outlier detection to data compression: Outliers are objects which can not be eectively compressed given the data set. To avoid the assumption of a certain data distribution, CoCo relies on a very general data model combining the Exponential Power Distribution with Independent Components. We dene an intuitive outlier factor based on the principle of the Minimum Description Length together with an novel algorithm for outlier detection. An extensive experimental evaluation on synthetic and real world data demonstrates the benets of our technique.
Keywords: DataMiningGeneral
|
|
|
|
Bromberg, F. & Margaritis, D., 2009. Improving the Reliability of Causal Discovery from Small Data Sets Using Argumentation, Journal of Machine Learning Research, 10, p. 301–340.
Abstract: We address the problem of improving the reliability of independence-based causal discovery algorithms that results from the execution of statistical independence tests on small data sets, which typically have low reliability. We model the problem as a knowledge base containing a set of independence facts that are related through Pearl’s well-known axioms. Statistical tests on finite data sets may result in errors in these tests and inconsistencies in the knowledge base. We resolve these inconsistencies through the use of an instance of the class of defeasible logics called argumentation, augmented with a preference function, that is used to reason about and possibly correct errors in these tests. This results in a more robust conditional independence test, called an argumentative independence test. Our experimental evaluation shows clear positive improvements in the accuracy of argumentative over purely statistical tests. We also demonstrate significant improvements on the accuracy of causal structure discovery from the outcomes of independence tests both on sampled data from randomly generated causal models and on real-world data sets.
Keywords: DataMiningGeneral
|
|