Pesquisa | Portal Regional da BVS

1.

A bootstrap evaluation of the effect of data splitting on financial time series.

LeBaron, B; Weigend, A S.

IEEE Trans Neural Netw ; 9(1): 213-20, 1998.

Artigo em Inglês | MEDLINE | ID: mdl-18252443

RESUMO

Exposes problems of the commonly used technique of splitting the available data into training, validation, and test sets that are held fixed, warns about drawing too strong conclusions from such static splits, and shows potential pitfalls of ignoring variability across splits. Using a bootstrap or resampling method, we compare the uncertainty in the solution stemming from the data splitting with neural-network specific uncertainties (parameter initialization, choice of number of hidden units, etc.). We present two results on data from the New York Stock Exchange. First, the variation due to different resamplings is significantly larger than the variation due to different network conditions. This result implies that it is important to not over-interpret a model (or an ensemble of models) estimated on one specific split of the data. Second, on each split, the neural-network solution with early stopping is very close to a linear model; no significant nonlinearities are extracted.

2.

Data mining in finance: introducing the special issue of IJNS.

Srivastava, A N; Weigend, A S.

Int J Neural Syst ; 8(4): 367-71, 1997 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-9730014

Assuntos

Economia , Modelos Econômicos , Redes Neurais de Computação

3.

Modeling volatility using state space models.

Timmer, J; Weigend, A S.

Int J Neural Syst ; 8(4): 385-98, 1997 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-9730016

RESUMO

In time series problems, noise can be divided into two categories: dynamic noise which drives the process, and observational noise which is added in the measurement process, but does not influence future values of the system. In this framework, we show that empirical volatilities (the squared relative returns of prices) exhibit a significant amount of observational noise. To model and predict their time evolution adequately, we estimate state space models that explicitly include observational noise. We obtain relaxation times for shocks in the logarithm of volatility ranging from three weeks (for foreign exchange) to three to five months (for stock indices). In most cases, a two-dimensional hidden state is required to yield residuals that are consistent with white noise. We compare these results with ordinary autoregressive models (without a hidden state) and find that autoregressive models underestimate the relaxation times by about two orders of magnitude since they do not distinguish between observational and dynamic noise. This new interpretation of the dynamics of volatility in terms of relaxators in a state space model carries over to stochastic volatility models and to GARCH models, and is useful for several problems in finance, including risk management and the pricing of derivative securities. Data sets used: Olsen & Associates high frequency DEM/USD foreign exchange rates (8 years). Nikkei 225 index (40 years). Dow Jones Industrial Average (25 years).

Assuntos

Inteligência Artificial , Modelos Econômicos , Redes Neurais de Computação , Modelos Lineares

4.

Nonlinear trading models through Sharpe Ratio maximization.

Choey, M; Weigend, A S.

Int J Neural Syst ; 8(4): 417-31, 1997 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-9730018

RESUMO

While many trading strategies are based on price prediction, traders in financial markets are typically interested in optimizing risk-adjusted performance such as the Sharpe Ratio, rather than the price predictions themselves. This paper introduces an approach which generates a nonlinear strategy that explicitly maximizes the Sharpe Ratio. It is expressed as a neural network model whose output is the position size between a risky and a risk-free asset. The iterative parameter update rules are derived and compared to alternative approaches. The resulting trading strategy is evaluated and analyzed on both computer-generated data and real world data (DAX, the daily German equity index). Trading based on Sharpe Ratio maximization compares favorably to both profit optimization and probability matching (through cross-entropy optimization). The results show that the goal of optimizing out-of-sample risk-adjusted profit can indeed be achieved with this nonlinear approach.

Assuntos

Inteligência Artificial , Modelos Econômicos , Redes Neurais de Computação , Dinâmica não Linear , Algoritmos , Interpretação Estatística de Dados

5.

A first application of independent component analysis to extracting structure from stock returns.

Back, A D; Weigend, A S.

Int J Neural Syst ; 8(4): 473-84, 1997 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-9730022

RESUMO

This paper explores the application of a signal processing technique known as independent component analysis (ICA) or blind source separation to multivariate financial time series such as a portfolio of stocks. The key idea of ICA is to linearly map the observed multivariate time series into a new space of statistically independent components (ICs). We apply ICA to three years of daily returns of the 28 largest Japanese stocks and compare the results with those obtained using principal component analysis. The results indicate that the estimated ICs fall into two categories, (i) infrequent large shocks (responsible for the major changes in the stock prices), and (ii) frequent smaller fluctuations (contributing little to the overall level of the stocks). We show that the overall stock price can be reconstructed surprisingly well by using a small number of thresholded weighted ICs. In contrast, when using shocks derived from principal components instead of independent components, the reconstructed price is less similar to the original one. ICA is shown to be a potentially powerful method of analyzing and understanding driving mechanisms in financial time series. The application to portfolio optimization is described in Chin and Weigend (1998).

Assuntos

Inteligência Artificial , Modelos Econômicos , Redes Neurais de Computação , Algoritmos

6.

Nonlinear gated experts for time series: discovering regimes and avoiding overfitting.

Weigend, A S; Mangeas, M; Srivastava, A N.

Int J Neural Syst ; 6(4): 373-99, 1995 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-8963468

RESUMO

In the analysis and prediction of real-world systems, two of the key problems are nonstationarity (often in the form of switching between regimes) and overfitting (particularly serious for noisy processes). This article addresses these problems using gated experts, consisting of a (nonlinear) gating network, and several (also nonlinear) competing experts. Each expert learns to predict the conditional mean, and each expert adapts its width to match the noise level in its regime. The gating network learns to predict the probability of each expert, given the input. This article focuses on the case where the gating network bases its decision on information from the inputs. This can be contrasted to hidden Markov models where the decision is based on the previous state(s) (i.e. on the output of the gating network at the previous time step), as well as to averaging over several predictors. In contrast, gated experts soft-partition the input space, only learning to model their region. This article discusses the underlying statistical assumptions, derives the weight update rules, and compares the performance of gated experts to standard methods on three time series: (1) a computer-generated series, obtained by randomly switching between two nonlinear processes; (2) a time series from the Santa Fe Time Series Competition (the light intensity of a laser in chaotic state); and (3) the daily electricity demand of France, a real-world multivariate problem with structure on several time scales. The main results are: (1) the gating network correctly discovers the different regimes of the process; (2) the widths associated with each expert are important for the segmentation task (and they can be used to characterize the sub-processes); and (3) there is less overfitting compared to single networks (homogeneous multilayer perceptrons), since the experts learn to match their variances to the (local) noise levels. This can be viewed as matching the local complexity of the model to the local complexity of the data.

Assuntos

Redes Neurais de Computação , Dinâmica não Linear , Computadores , Modelos Teóricos

7.

Predicting conditional probability distributions: a connectionist approach.

Weigend, A S; Srivastava, A N.

Int J Neural Syst ; 6(2): 109-18, 1995 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-7496584

RESUMO

Most traditional prediction techniques deliver a single point, usually the mean of a probability distribution. For multimodal processes, instead of predicting the mean, it is important to predict the full distribution. This article presents a new connectionist method to predict the conditional probability distribution in response to an input. The main idea is to transform the problem from a regression problem to a classification problem. The conditional probability distribution network can perform both direct predictions and iterated predictions, the latter task being specific for time series problems. We compare this new method to fuzzy logic and discuss important differences, and also demonstrate the architecture on two time series. The first is the benchmark laser series used in the Santa Fe competition, a deterministic chaotic system. The second is a time series from a Markov process which exhibits structure on two time scales. The network produces multimodal predictions for this series. We compare the predictions of the network with a nearest-neighbor predictor and find that the conditional probability network is more than twice as likely a model.

Assuntos

Redes Neurais de Computação , Teoria da Probabilidade , Análise de Regressão , Processos Estocásticos

8.

Computing second derivatives in feed-forward networks: a review.

Buntine, W L; Weigend, A S.

IEEE Trans Neural Netw ; 5(3): 480-8, 1994.

Artigo em Inglês | MEDLINE | ID: mdl-18267814

RESUMO

The calculation of second derivatives is required by recent training and analysis techniques of connectionist networks, such as the elimination of superfluous weights, and the estimation of confidence intervals both for weights and network outputs. We review and develop exact and approximate algorithms for calculating second derivatives. For networks with |w| weights, simply writing the full matrix of second derivatives requires O(|w|(2)) operations. For networks of radial basis units or sigmoid units, exact calculation of the necessary intermediate terms requires of the order of 2h+2 backward/forward-propagation passes where h is the number of hidden units in the network. We also review and compare three approximations (ignoring some components of the second derivative, numerical differentiation, and scoring). The algorithms apply to arbitrary activation functions, networks, and error functions.

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA