Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Language
Publication year range
1.
J Appl Stat ; 50(14): 2914-2933, 2023.
Article in English | MEDLINE | ID: mdl-37808617

ABSTRACT

This article concerns predictive modeling for spatio-temporal data as well as model interpretation using data information in space and time. We develop a novel approach based on supervised dimension reduction for such data in order to capture nonlinear mean structures without requiring a prespecified parametric model. In addition to prediction as a common interest, this approach emphasizes the exploration of geometric information from the data. The method of Pairwise Directions Estimation (PDE) is implemented in our approach as a data-driven function searching for spatial patterns and temporal trends. The benefit of using geometric information from the method of PDE is highlighted, which aids effectively in exploring data structures. We further enhance PDE, referring to it as PDE+, by incorporating kriging to estimate the random effects not explained in the mean functions. Our proposal can not only increase prediction accuracy but also improve the interpretation for modeling. Two simulation examples are conducted and comparisons are made with several existing methods. The results demonstrate that the proposed PDE+ method is very useful for exploring and interpreting the patterns and trends for spatio-temporal data. Illustrative applications to two real datasets are also presented.

2.
J Appl Stat ; 50(6): 1418-1434, 2023.
Article in English | MEDLINE | ID: mdl-37025283

ABSTRACT

In a systematic review of a diagnostic performance, summarizing performance metrics is crucial. There are various summary models in the literature, and hence model selection becomes inevitable. However, most existing large-sample-based model selection approaches may not fit in a meta-analysis of diagnostic studies, typically having a rather small sample size. Researchers need to effectively determine the final model for further inference, which motivates this article to investigate existing methods and to suggest a more robust method for this need. We considered models covering several widely-used methods for bivariate summary of sensitivity and specificity. Simulation studies were conducted based on different number of studies and different population sensitivity and specificity. Then final models were selected using several existing criteria, and we compared the summary receiver operating characteristic (sROC) curves to the theoretical ROC curve given the generating model. Even though parametric likelihood-based criteria are often applied in practice for their asymptotic property, they fail to consistently choose appropriate models under the limited number of studies. When the number of studies is as small as 10 or 5, our suggestion is best in different scenarios. An example for summary ROC curves for chemiluminescence immunoassay (CLIA) used in COVID-19 diagnosis is also illustrated.

3.
Biometrics ; 79(4): 3637-3649, 2023 12.
Article in English | MEDLINE | ID: mdl-36594650

ABSTRACT

The Taiwan air quality monitoring network (TAQMN) and the AirBox network both monitor PM2.5 in Taiwan. The TAQMN, managed by Taiwan's Environmental Protection Administration (EPA), provides high-quality PM2.5 measurements at 77 monitoring stations. The AirBox network launched more recently consists of low-cost, small internet-of-things (IoT) microsensors (i.e., AirBoxes) at thousands of locations. While the AirBox network provides broad spatial coverage, its measurements are unreliable and require calibrations. However, applying a universal calibration procedure to all AirBoxes does not work well because the calibration line varies with local factors, including the chemical compositions of PM2.5 , which are not homogeneous in space. Therefore, different calibrations are needed at different locations to adapt to their local environments. Unfortunately, AirBoxes and EPA locations are misaligned, challenging the calibration task. In this paper, we propose a spatial model with spatially varying coefficients to account for the heterogeneity in the data. Our method gives spatially adaptive calibrations of AirBoxes and produces accurate PM2.5 concentration estimates with their error bars at any location, incorporating two types of measurements. In addition, the proposed method is robust to outliers, requires no colocated data, and provides calibration formulas for new AirBoxes once they are added to the network. We illustrate our approach using hourly PM2.5 data in 2020. After the calibration, the results show that the PM2.5 prediction improves by about 38%-68% in root-mean-squared prediction error. Once the calibration formulas are established, we can obtain reliable PM2.5 values even if we ignore EPA data.


Subject(s)
Air Pollutants , Air Pollution , Air Pollutants/analysis , Particulate Matter/analysis , Calibration , Environmental Monitoring/methods , Air Pollution/analysis
4.
PLoS One ; 17(2): e0263516, 2022.
Article in English | MEDLINE | ID: mdl-35134072

ABSTRACT

The ability to determine a sound's location is critical in everyday life. However, sound source localization is severely compromised for patients with hearing loss who receive bilateral cochlear implants (BiCIs). Several patient factors relate to poorer performance in listeners with BiCIs, associated with auditory deprivation, experience, and age. Critically, characteristic errors are made by patients with BiCIs (e.g., medial responses at lateral target locations), and the relationship between patient factors and the type of errors made by patients has seldom been investigated across individuals. In the present study, several different types of analysis were used to understand localization errors and their relationship with patient-dependent factors (selected based on their robustness of prediction). Binaural hearing experience is required for developing accurate localization skills, auditory deprivation is associated with degradation of the auditory periphery, and aging leads to poorer temporal resolution. Therefore, it was hypothesized that earlier onsets of deafness would be associated with poorer localization acuity and longer periods without BiCI stimulation or older age would lead to greater amounts of variability in localization responses. A novel machine learning approach was introduced to characterize the types of errors made by listeners with BiCIs, making them simple to interpret and generalizable to everyday experience. Sound localization performance was measured in 48 listeners with BiCIs using pink noise trains presented in free-field. Our results suggest that older age at testing and earlier onset of deafness are associated with greater average error, particularly for sound sources near the center of the head, consistent with previous research. The machine learning analysis revealed that variability of localization responses tended to be greater for individuals with earlier compared to later onsets of deafness. These results suggest that early bilateral hearing is essential for best sound source localization outcomes in listeners with BiCIs.


Subject(s)
Hearing Loss, Bilateral/physiopathology , Sound Localization/physiology , Acoustic Stimulation/methods , Adult , Age Factors , Age of Onset , Aged , Aged, 80 and over , Auditory Perception/physiology , Cochlear Implantation/methods , Cochlear Implants/adverse effects , Cues , Deafness/physiopathology , Female , Hearing/physiology , Hearing Loss/physiopathology , Hearing Tests , Humans , Male , Middle Aged , Sound
5.
Stat Med ; 40(5): 1243-1261, 2021 02 28.
Article in English | MEDLINE | ID: mdl-33336451

ABSTRACT

Quantitative imaging biomarkers (QIB) are extracted from medical images in radiomics for a variety of purposes including noninvasive disease detection, cancer monitoring, and precision medicine. The existing methods for QIB extraction tend to be ad hoc and not reproducible. In this article, a general and flexible statistical approach is proposed for handling up to three-dimensional medical images and reasonably capturing features with respect to specific spatial patterns. In particular, a model-based spatial process decomposition is developed where the random weights are unique to individual patients for component functions common across patients. Model fitting and selection are based on maximum likelihood, while feature extractions are via optimal prediction of the underlying true image. Simulation studies are conducted to investigate the properties of the proposed methodology. For illustration, a cancer image data set is analyzed and QIBs are extracted in association with a clinical endpoint.


Subject(s)
Neoplasms , Biomarkers , Humans , Imaging, Three-Dimensional , Neoplasms/diagnostic imaging , Precision Medicine
6.
Biomark Med ; 12(8): 849-859, 2018 08.
Article in English | MEDLINE | ID: mdl-30022678

ABSTRACT

AIM: We develop a subgroup selection procedure using both prognostic and predictive biomarkers to identify four patient subpopulations: low- and high-risk responders, and low- and high-risk nonresponders. METHODS: We utilize three regression models to identify three sets of biomarkers: S, prognostic biomarkers; T, predictive biomarkers; and U, prognostic and predictive biomarkers. The prognostic signature C(S) combines with a predictive signature, either C(T) or C(U), to develop two procedures C(S,T) and C(S,U) for identification of four subgroups. RESULTS: Simulation experiment showed that proposed models for identifying the biomarker sets S and U performed well, as did the procedure C(S,U) for subgroup identification. CONCLUSION: The proposed model provides more comprehensive characterization of patient subpopulations, and better accuracy in patient treatment assignment.


Subject(s)
Clinical Decision-Making , Computer Simulation , Models, Theoretical , Biomarkers/metabolism , Humans
7.
Stat Methods Med Res ; 27(11): 3492-3504, 2018 11.
Article in English | MEDLINE | ID: mdl-28535712

ABSTRACT

Many studies measure the same type of information longitudinally on the same subject at multiple time points, and clustering of such functional data has many important applications. We propose a novel and easy method to implement dissimilarity measure for functional data clustering based on smoothing splines and smoothing parameter commutation. This method handles data observed at regular or irregular time points in the same way. We measure the dissimilarity between subjects based on varying curve estimates with pairwise commutation of smoothing parameters. The intuition is that smoothing parameters of smoothing splines reflect the inverse of the signal-to-noise ratios and that when applying an identical smoothing parameter the smoothed curves for two similar subjects are expected to be close. Our method takes into account the estimation uncertainty using smoothing parameter commutation and is not strongly affected by outliers. It can also be used for outlier detection. The effectiveness of our proposal is shown by simulations comparing it to other dissimilarity measures and by a real application to methadone dosage maintenance levels.


Subject(s)
Cluster Analysis , Data Accuracy , Longitudinal Studies , Algorithms , Bias , Humans , Methadone/administration & dosage , Opiate Substitution Treatment
8.
BMC Bioinformatics ; 8: 74, 2007 Mar 05.
Article in English | MEDLINE | ID: mdl-17338815

ABSTRACT

BACKGROUND: A microarray study may select different differentially expressed gene sets because of different selection criteria. For example, the fold-change and p-value are two commonly known criteria to select differentially expressed genes under two experimental conditions. These two selection criteria often result in incompatible selected gene sets. Also, in a two-factor, say, treatment by time experiment, the investigator may be interested in one gene list that responds to both treatment and time effects. RESULTS: We propose three layer ranking algorithms, point-admissible, line-admissible (convex), and Pareto, to provide a preference gene list from multiple gene lists generated by different ranking criteria. Using the public colon data as an example, the layer ranking algorithms are applied to the three univariate ranking criteria, fold-change, p-value, and frequency of selections by the SVM-RFE classifier. A simulation experiment shows that for experiments with small or moderate sample sizes (less than 20 per group) and detecting a 4-fold change or less, the two-dimensional (p-value and fold-change) convex layer ranking selects differentially expressed genes with generally lower FDR and higher power than the standard p-value ranking. Three applications are presented. The first application illustrates a use of the layer rankings to potentially improve predictive accuracy. The second application illustrates an application to a two-factor experiment involving two dose levels and two time points. The layer rankings are applied to selecting differentially expressed genes relating to the dose and time effects. In the third application, the layer rankings are applied to a benchmark data set consisting of three dilution concentrations to provide a ranking system from a long list of differentially expressed genes generated from the three dilution concentrations. CONCLUSION: The layer ranking algorithms are useful to help investigators in selecting the most promising genes from multiple gene lists generated by different filter, normalization, or analysis methods for various objectives.


Subject(s)
Databases, Genetic , Gene Expression Profiling/methods , Models, Genetic , Algorithms , Colonic Neoplasms/genetics , Humans , Oligonucleotide Array Sequence Analysis/methods
SELECTION OF CITATIONS
SEARCH DETAIL
...