Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Appl Stat ; 51(8): 1427-1445, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38863805

RESUMO

Datasets that are big with regard to their volume, variety and velocity are becoming increasingly common. However, limitations in computer processing can often restrict analysis performed on them. Nonuniform subsampling methods are effective in reducing computational loads for massive data. However, the variance of the estimator of nonuniform subsampling methods becomes large when the subsampling probabilities are highly heterogenous. To this end, we develop two new algorithms to improve the estimation method for massive data logistic regression based on a chosen hard threshold value and combining subsamples, respectively. The basic idea of the hard threshold method is to carefully select a threshold value and then replace subsampling probabilities lower than the threshold value with the chosen value itself. The main idea behind the combining subsamples method is to better exploit information in the data without hitting the computation bottleneck by generating many subsamples and then combining estimates constructed from the subsamples. The combining subsamples method obtains the standard error of the parameter estimator without estimating the sandwich matrix, which provides convenience for statistical inference in massive data, and can significantly improve the estimation efficiency. Asymptotic properties of the resultant estimators are established. Simulations and analysis of real data are conducted to assess and showcase the practical performance of the proposed methods.

2.
J Appl Stat ; 51(7): 1318-1343, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38835830

RESUMO

Autoregressive models in time series are useful in various areas. In this article, we propose a skew-t autoregressive model. We estimate its parameters using the expectation-maximization (EM) method and develop the influence methodology based on local perturbations for its validation. We obtain the normal curvatures for four perturbation strategies to identify influential observations, and then to assess their performance through Monte Carlo simulations. An example of financial data analysis is presented to study daily log-returns for Brent crude futures and investigate possible impact by the COVID-19 pandemic.

3.
Entropy (Basel) ; 25(8)2023 Aug 03.
Artigo em Inglês | MEDLINE | ID: mdl-37628194

RESUMO

This paper introduces a novel approach, called causal relation quantification, based on change-point detection to address the issue of harmonic responsibility division in power systems. The proposed method focuses on determining the causal effect of chronological continuous treatment, enabling the identification of crucial treatment intervals. Within each interval, three propensity-score-based algorithms are executed to assess their respective causal effects. By integrating the results from each interval, the overall causal effect of a chronological continuous treatment variable can be calculated. This calculated overall causal effect represents the causal responsibility of each harmonic customer. The effectiveness of the proposed method is evaluated through a simulation study and demonstrated in an empirical harmonic application. The results of the simulation study indicate that our method provides accurate and robust estimates, while the calculated results in the harmonic application align closely with the real-world scenario as verified by on-site investigations.

4.
J Appl Stat ; 49(5): 1323-1347, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35707504

RESUMO

In the present study, we provide a motivating example with a financial application under COVID-19 pandemic to investigate autoregressive (AR) modeling and its diagnostics based on asymmetric distributions. The objectives of this work are: (i) to formulate asymmetric AR models and their estimation and diagnostics; (ii) to assess the performance of the parameters estimators and of the local influence technique for these models; and (iii) to provide a tool to show how data following an asymmetric distribution under an AR structure should be analyzed. We take the advantages of the stochastic representation of the skew-normal distribution to estimate the parameters of the corresponding AR model efficiently with the expectation-maximization algorithm. Diagnostic analytics are conducted by using the local influence technique with four perturbation schemes. By employing Monte Carlo simulations, we evaluate the statistical behavior of the corresponding estimators and of the local influence technique. An illustration with financial data updated until 2020, analyzed using the methodology introduced in the present work, is presented as an example of effective applications, from where it is possible to explain atypical cases from the COVID-19 pandemic.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...