Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Alzheimers Dement ; 20(4): 3074-3079, 2024 04.
Artigo em Inglês | MEDLINE | ID: mdl-38324244

RESUMO

This perspective outlines the Artificial Intelligence and Technology Collaboratories (AITC) at Johns Hopkins University, University of Pennsylvania, and University of Massachusetts, highlighting their roles in developing AI-based technologies for older adult care, particularly targeting Alzheimer's disease (AD). These National Institute on Aging (NIA) centers foster collaboration among clinicians, gerontologists, ethicists, business professionals, and engineers to create AI solutions. Key activities include identifying technology needs, stakeholder engagement, training, mentoring, data integration, and navigating ethical challenges. The objective is to apply these innovations effectively in real-world scenarios, including in rural settings. In addition, the AITC focuses on developing best practices for AI application in the care of older adults, facilitating pilot studies, and addressing ethical concerns related to technology development for older adults with cognitive impairment, with the ultimate aim of improving the lives of older adults and their caregivers. HIGHLIGHTS: Addressing the complex needs of older adults with Alzheimer's disease (AD) requires a comprehensive approach, integrating medical and social support. Current gaps in training, techniques, tools, and expertise hinder uniform access across communities and health care settings. Artificial intelligence (AI) and digital technologies hold promise in transforming care for this demographic. Yet, transitioning these innovations from concept to marketable products presents significant challenges, often stalling promising advancements in the developmental phase. The Artificial Intelligence and Technology Collaboratories (AITC) program, funded by the National Institute on Aging (NIA), presents a viable model. These Collaboratories foster the development and implementation of AI methods and technologies through projects aimed at improving care for older Americans, particularly those with AD, and promote the sharing of best practices in AI and technology integration. Why Does This Matter? The National Institute on Aging (NIA) Artificial Intelligence and Technology Collaboratories (AITC) program's mission is to accelerate the adoption of artificial intelligence (AI) and new technologies for the betterment of older adults, especially those with dementia. By bridging scientific and technological expertise, fostering clinical and industry partnerships, and enhancing the sharing of best practices, this program can significantly improve the health and quality of life for older adults with Alzheimer's disease (AD).


Assuntos
Doença de Alzheimer , Isotiocianatos , Estados Unidos , Humanos , Idoso , Doença de Alzheimer/terapia , Inteligência Artificial , Gerociência , Qualidade de Vida , Tecnologia
2.
Proc Mach Learn Res ; 216: 1047-1057, 2023 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-37724310

RESUMO

Just-in-Time Adaptive Interventions (JITAIs) are a class of personalized health interventions developed within the behavioral science community. JITAIs aim to provide the right type and amount of support by iteratively selecting a sequence of intervention options from a pre-defined set of components in response to each individual's time varying state. In this work, we explore the application of reinforcement learning methods to the problem of learning intervention option selection policies. We study the effect of context inference error and partial observability on the ability to learn effective policies. Our results show that the propagation of uncertainty from context inferences is critical to improving intervention efficacy as context uncertainty increases, while policy gradient algorithms can provide remarkable robustness to partially observed behavioral state information.

3.
Drug Alcohol Depend ; 250: 110898, 2023 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-37523916

RESUMO

BACKGROUND: Our group has established the feasibility of using on-body electrocardiographic (ECG) sensors to detect cocaine use in the human laboratory. The purpose of the current study was to test whether ECG sensors and features are capable of discriminating cocaine use from other non-cocaine sympathomimetics. METHODS: Eleven subjects with cocaine use disorder wore the Zephyr BioHarness™ 3 chest band under six experimental (drug and non-drug) conditions, including 1) laboratory, intravenous cocaine self-administration, 2) after a single oral dose of methylphenidate, 3) during aerobic exercise, 4) during tobacco use (N=7 who smoked tobacco), and 5) during routine activities of daily inpatient living (unit activity). Three ECG-derived feature sets served as primary outcome measures, including 1) the RR interval (i.e., heart rate), 2) a group of ECG interval proxies (i.e., PR, QS, QT and QTc intervals), and 3) the full ECG waveform. Discriminatory power between cocaine and non-cocaine conditions for each of the three outcomes measures was expressed as the area under the receiver operating characteristics (AUROC) curve. RESULTS: All three outcomes successfully discriminated cocaine use from unit activity, exercise, tobacco, and methylphenidate conditions with a mean AUROC values ranging from 0.66 to 0.99 and with least squares means values all statistically different/higher than 0.5 among all subjects [F(3, 99) = 3.38, p =0.02] and among those with tobacco use [F(4, 84) = 5.39, p = 0.0007]. CONCLUSIONS: These preliminary results support discriminatory power of wearable ECG sensors for detecting cocaine use.


Assuntos
Transtornos Relacionados ao Uso de Cocaína , Cocaína , Metilfenidato , Dispositivos Eletrônicos Vestíveis , Humanos , Simpatomiméticos , Eletrocardiografia , Transtornos Relacionados ao Uso de Cocaína/diagnóstico
4.
Transl Behav Med ; 13(1): 7-16, 2023 01 20.
Artigo em Inglês | MEDLINE | ID: mdl-36416389

RESUMO

The ILHBN is funded by the National Institutes of Health to collaboratively study the interactive dynamics of behavior, health, and the environment using Intensive Longitudinal Data (ILD) to (a) understand and intervene on behavior and health and (b) develop new analytic methods to innovate behavioral theories and interventions. The heterogenous study designs, populations, and measurement protocols adopted by the seven studies within the ILHBN created practical challenges, but also unprecedented opportunities to capitalize on data harmonization to provide comparable views of data from different studies, enhance the quality and utility of expensive and hard-won ILD, and amplify scientific yield. The purpose of this article is to provide a brief report of the challenges, opportunities, and solutions from some of the ILHBN's cross-study data harmonization efforts. We review the process through which harmonization challenges and opportunities motivated the development of tools and collection of metadata within the ILHBN. A variety of strategies have been adopted within the ILHBN to facilitate harmonization of ecological momentary assessment, location, accelerometer, and participant engagement data while preserving theory-driven heterogeneity and data privacy considerations. Several tools have been developed by the ILHBN to resolve challenges in integrating ILD across multiple data streams and time scales both within and across studies. Harmonization of distinct longitudinal measures, measurement tools, and sampling rates across studies is challenging, but also opens up new opportunities to address cross-cutting scientific themes of interest.


Health behavior changes, such as prevention of suicidal thoughts and behaviors, smoking, drug use, and alcohol use; and the promotion of mental health, sleep, and physical activities, and decreases in sedentary behavior, are difficult to sustain. The ILHBN is a cooperative agreement network funded jointly by seven participating units within the National Institutes of Health to collaboratively study how factors that occur in individuals' everyday life and in their natural environment influence the success of positive health behavior changes. This article discusses how information collected using smartphones, wearables, and other devices can provide helpful active and passive reflections of the participants' extent of risk and resources at the moment for an extended period of time. However, successful engagement and retention of participants also require tailored adaptations of study designs, measurement tools, measurement intervals, study span, and device choices that create hurdles in integrating (harmonizing) data from multiple studies. We describe some of the challenges, opportunities, and solutions that emerged from harmonizing intensive longitudinal data under heterogeneous study and participant characteristics within the ILHBN, and share some tools and recommendations to facilitate future data harmonization efforts.


Assuntos
Avaliação Momentânea Ecológica , Projetos de Pesquisa , Humanos , Necessidades e Demandas de Serviços de Saúde , Literatura de Revisão como Assunto
5.
Artigo em Inglês | MEDLINE | ID: mdl-35206455

RESUMO

Background: Recent advances in mobile and wearable technologies have led to new forms of interventions, called "Just-in-Time Adaptive Interventions" (JITAI). JITAIs interact with the individual at the most appropriate time and provide the most appropriate support depending on the continuously acquired Intensive Longitudinal Data (ILD) on participant physiology, behavior, and contexts. These advances raise an important question: How do we model these data to better understand and intervene on health behaviors? The HeartSteps II study, described here, is a Micro-Randomized Trial (MRT) intended to advance both intervention development and theory-building enabled by the new generation of mobile and wearable technology. Methods: The study involves a year-long deployment of HeartSteps, a JITAI for physical activity and sedentary behavior, with 96 sedentary, overweight, but otherwise healthy adults. The central purpose is twofold: (1) to support the development of modeling approaches for operationalizing dynamic, mathematically rigorous theories of health behavior; and (2) to serve as a testbed for the development of learning algorithms that JITAIs can use to individualize intervention provision in real time at multiple timescales. Discussion and Conclusions: We outline an innovative modeling paradigm to model and use ILD in real- or near-time to individually tailor JITIAs.


Assuntos
Comportamento Sedentário , Telemedicina , Adulto , Terapia Comportamental , Exercício Físico , Comportamentos Relacionados com a Saúde , Humanos , Ensaios Clínicos Controlados Aleatórios como Assunto , Telemedicina/métodos
6.
Artigo em Inglês | MEDLINE | ID: mdl-37736024

RESUMO

In this paper we present BayesLDM, a library for Bayesian longitudinal data modeling consisting of a high-level modeling language with specific features for modeling complex multivariate time series data coupled with a compiler that can produce optimized probabilistic program code for performing inference in the specified model. BayesLDM supports modeling of Bayesian network models with a specific focus on the efficient, declarative specification of dynamic Bayesian Networks (DBNs). The BayesLDM compiler combines a model specification with inspection of available data and outputs code for performing Bayesian inference for unknown model parameters while simultaneously handling missing data. These capabilities have the potential to significantly accelerate iterative modeling workflows in domains that involve the analysis of complex longitudinal data by abstracting away the process of producing computationally efficient probabilistic inference code. We describe the BayesLDM system components, evaluate the efficiency of representation and inference optimizations and provide an illustrative example of the application of the system to analyzing heterogeneous and partially observed mobile health data.

8.
IEEE Trans Image Process ; 26(11): 5435-5446, 2017 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-28783635

RESUMO

This paper proposes an out-of-sample extension framework for a global manifold learning algorithm (Isomap) that uses temporal information in out-of-sample points in order to make the embedding more robust to noise and artifacts. Given a set of noise-free training data and its embedding, the proposed framework extends the embedding for a noisy time series. This is achieved by adding a spatio-temporal compactness term to the optimization objective of the embedding. To the best of our knowledge, this is the first method for out-of-sample extension of manifold embeddings that leverages timing information available for the extension set. Experimental results demonstrate that our out-of-sample extension algorithm renders a more robust and accurate embedding of sequentially ordered image data in the presence of various noise and artifacts when compared with other timing-aware embeddings. Additionally, we show that an out-of-sample extension framework based on the proposed algorithm outperforms the state of the art in eye-gaze estimation.

9.
Artigo em Inglês | MEDLINE | ID: mdl-29417956

RESUMO

The ability to monitor eye closures and blink patterns has long been known to enable accurate assessment of fatigue and drowsiness in individuals. Many measures of the eye are known to be correlated with fatigue including coarse-grained measures like the rate of blinks as well as fine-grained measures like the duration of blinks and the extent of eye closures. Despite a plethora of research validating these measures, we lack wearable devices that can continually and reliably monitor them in the natural environment. In this work, we present a low-power system, iLid, that can continually sense fine-grained measures such as blink duration and Percentage of Eye Closures (PERCLOS) at high frame rates of 100fps. We present a complete solution including design of the sensing, signal processing, and machine learning pipeline; implementation on a prototype computational eyeglass platform; and extensive evaluation under many conditions including illumination changes, eyeglass shifts, and mobility. Our results are very encouraging, showing that we can detect blinks, blink duration, eyelid location, and fatigue-related metrics such as PERCLOS with less than a few percent error.

10.
KDD ; 2017: 1773-1781, 2017 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-29333328

RESUMO

In this paper, we present a new approach to learning cascaded classifiers for use in computing environments that involve networks of heterogeneous and resource-constrained, low-power embedded compute and sensing nodes. We present a generalization of the classical linear detection cascade to the case of tree-structured cascades where different branches of the tree execute on different physical compute nodes in the network. Different nodes have access to different features, as well as access to potentially different computation and energy resources. We concentrate on the problem of jointly learning the parameters for all of the classifiers in the cascade given a fixed cascade architecture and a known set of costs required to carry out the computation at each node. To accomplish the objective of joint learning of all detectors, we propose a novel approach to combining classifier outputs during training that better matches the hard cascade setting in which the learned system will be deployed. This work is motivated by research in the area of mobile health where energy efficient real time detectors integrating information from multiple wireless on-body sensors and a smart phone are needed for real-time monitoring and the delivery of just-in-time adaptive interventions. We evaluate our framework on mobile sensor-based human activity recognition and mobile health detector learning problems.

11.
Proc Mach Learn Res ; 54: 157-165, 2017 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-30465043

RESUMO

In this paper, we consider the problem of learning time series detection models from temporally imprecise labels. In this problem, the data consist of a set of input time series, and supervision is provided by a sequence of noisy time stamps corresponding to the occurrence of positive class events. Such temporally imprecise labels occur in areas like mobile health research when human annotators are tasked with labeling the occurrence of very short duration events. We propose a general learning framework for this problem that can accommodate different base classifiers and noise models. We present results on real mobile health data showing that the proposed framework significantly outperforms a number of alternatives including assuming that the label time stamps are noise-free, transforming the problem into the multiple instance learning framework, and learning on labels that were manually aligned.

12.
J Med Internet Res ; 18(11): e285, 2016 11 08.
Artigo em Inglês | MEDLINE | ID: mdl-27826134

RESUMO

BACKGROUND: Outside health care, content tailoring is driven algorithmically using machine learning compared to the rule-based approach used in current implementations of computer-tailored health communication (CTHC) systems. A special class of machine learning systems ("recommender systems") are used to select messages by combining the collective intelligence of their users (ie, the observed and inferred preferences of users as they interact with the system) and their user profiles. However, this approach has not been adequately tested for CTHC. OBJECTIVE: Our aim was to compare, in a randomized experiment, a standard, evidence-based, rule-based CTHC (standard CTHC) to a novel machine learning CTHC: Patient Experience Recommender System for Persuasive Communication Tailoring (PERSPeCT). We hypothesized that PERSPeCT will select messages of higher influence than our standard CTHC system. This standard CTHC was proven effective in motivating smoking cessation in a prior randomized trial of 900 smokers (OR 1.70, 95% CI 1.03-2.81). METHODS: PERSPeCT is an innovative hybrid machine learning recommender system that selects and sends motivational messages using algorithms that learn from message ratings from 846 previous participants (explicit feedback), and the prior explicit ratings of each individual participant. Current smokers (N=120) aged 18 years or older, English speaking, with Internet access were eligible to participate. These smokers were randomized to receive either PERSPeCT (intervention, n=74) or standard CTHC tailored messages (n=46). The study was conducted between October 2014 and January 2015. By randomization, we compared daily message ratings (mean of smoker ratings each day). At 30 days, we assessed the intervention's perceived influence, 30-day cessation, and changes in readiness to quit from baseline. RESULTS: The proportion of days when smokers agreed/strongly agreed (daily rating ≥4) that the messages influenced them to quit was significantly higher for PERSPeCT (73%, 23/30) than standard CTHC (44%, 14/30, P=.02). Among less educated smokers (n=49), this difference was even more pronounced for days strongly agree (intervention: 77%, 23/30; comparison: 23%, 7/30, P<.001). There was no significant difference in the frequency which PERSPeCT randomized smokers agreed or strongly agreed that the intervention influenced them to quit smoking (P=.07) and use nicotine replacement therapy (P=.09). Among those who completed follow-up, 36% (20/55) of PERSPeCT smokers and 32% (11/34) of the standard CTHC group stopped smoking for one day or longer (P=.70). CONCLUSIONS: Compared to standard CTHC with proven effectiveness, PERSPeCT outperformed in terms of influence ratings and resulted in similar cessation rates. CLINICALTRIAL: Clinicaltrials.gov NCT02200432; https://clinicaltrials.gov/ct2/show/NCT02200432 (Archived by WebCite at http://www.webcitation.org/6lEJY1KEd).


Assuntos
Comunicação em Saúde/métodos , Internet/estatística & dados numéricos , Aprendizado de Máquina , Abandono do Hábito de Fumar/métodos , Prática Clínica Baseada em Evidências , Feminino , Humanos , Masculino , Pessoa de Meia-Idade
13.
J Med Internet Res ; 18(3): e42, 2016 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-26952574

RESUMO

BACKGROUND: What is the next frontier for computer-tailored health communication (CTHC) research? In current CTHC systems, study designers who have expertise in behavioral theory and mapping theory into CTHC systems select the variables and develop the rules that specify how the content should be tailored, based on their knowledge of the targeted population, the literature, and health behavior theories. In collective-intelligence recommender systems (hereafter recommender systems) used by Web 2.0 companies (eg, Netflix and Amazon), machine learning algorithms combine user profiles and continuous feedback ratings of content (from themselves and other users) to empirically tailor content. Augmenting current theory-based CTHC with empirical recommender systems could be evaluated as the next frontier for CTHC. OBJECTIVE: The objective of our study was to uncover barriers and challenges to using recommender systems in health promotion. METHODS: We conducted a focused literature review, interviewed subject experts (n=8), and synthesized the results. RESULTS: We describe (1) limitations of current CTHC systems, (2) advantages of incorporating recommender systems to move CTHC forward, and (3) challenges to incorporating recommender systems into CTHC. Based on the evidence presented, we propose a future research agenda for CTHC systems. CONCLUSIONS: We promote discussion of ways to move CTHC into the 21st century by incorporation of recommender systems.


Assuntos
Comportamentos Relacionados com a Saúde , Comunicação em Saúde/métodos , Internet , Algoritmos , Computadores/tendências , Retroalimentação , Comunicação em Saúde/tendências , Humanos , Aprendizado de Máquina
14.
Proc ACM Int Conf Ubiquitous Comput ; 2016: 875-885, 2016 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-28090605

RESUMO

Mobile health research on illicit drug use detection typically involves a two-stage study design where data to learn detectors is first collected in lab-based trials, followed by a deployment to subjects in a free-living environment to assess detector performance. While recent work has demonstrated the feasibility of wearable sensors for illicit drug use detection in the lab setting, several key problems can limit lab-to-field generalization performance. For example, lab-based data collection often has low ecological validity, the ground-truth event labels collected in the lab may not be available at the same level of temporal granularity in the field, and there can be significant variability between subjects. In this paper, we present domain adaptation methods for assessing and mitigating potential sources of performance loss in lab-to-field generalization and apply them to the problem of cocaine use detection from wearable electrocardiogram sensor data.

15.
JMLR Workshop Conf Proc ; 48: 334-343, 2016 06.
Artigo em Inglês | MEDLINE | ID: mdl-28090606

RESUMO

The field of mobile health (mHealth) has the potential to yield new insights into health and behavior through the analysis of continuously recorded data from wearable health and activity sensors. In this paper, we present a hierarchical span-based conditional random field model for the key problem of jointly detecting discrete events in such sensor data streams and segmenting these events into high-level activity sessions. Our model includes higher-order cardinality factors and inter-event duration factors to capture domain-specific structure in the label space. We show that our model supports exact MAP inference in quadratic time via dynamic programming, which we leverage to perform learning in the structured support vector machine framework. We apply the model to the problems of smoking and eating detection using four real data sets. Our results show statistically significant improvements in segmentation performance relative to a hierarchical pairwise CRF.

16.
Proc Eye Track Res Appl Symp ; 2016: 313-314, 2016 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-29629433

RESUMO

The human eye offers a fascinating window into an individual's health, cognitive attention, and decision making, but we lack the ability to continually measure these parameters in the natural environment. We demonstrate CIDER, a system that operates in a highly optimized low-power mode under indoor settings by using a fast Search-Refine controller to track the eye, but detects when the environment switches to more challenging outdoor sunlight and switches models to operate robustly under this condition. Our design is holistic and tackles a) power consumption in digitizing pixels, estimating pupillary parameters, and illuminating the eye via near-infrared and b) error in estimating pupil center and pupil dilation. We demonstrate that CIDER can estimate pupil center with error less than two pixels (0.6°), and pupil diameter with error of one pixel (0.22mm). Our end-to-end results show that we can operate at power levels of roughly 7mW at a 4Hz eye tracking rate, or roughly 32mW at rates upwards of 250Hz.

17.
Proc ACM Int Conf Ubiquitous Comput ; 2015: 999-1010, 2015 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-26543927

RESUMO

Recent researches have demonstrated the feasibility of detecting smoking from wearable sensors, but their performance on real-life smoking lapse detection is unknown. In this paper, we propose a new model and evaluate its performance on 61 newly abstinent smokers for detecting a first lapse. We use two wearable sensors - breathing pattern from respiration and arm movements from 6-axis inertial sensors worn on wrists. In 10-fold cross-validation on 40 hours of training data from 6 daily smokers, our model achieves a recall rate of 96.9%, for a false positive rate of 1.1%. When our model is applied to 3 days of post-quit data from 32 lapsers, it correctly pinpoints the timing of first lapse in 28 participants. Only 2 false episodes are detected on 20 abstinent days of these participants. When tested on 84 abstinent days from 28 abstainers, the false episode per day is limited to 1/6.

18.
J Am Med Inform Assoc ; 22(6): 1137-42, 2015 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-26555017

RESUMO

Mobile sensor data-to-knowledge (MD2K) was chosen as one of 11 Big Data Centers of Excellence by the National Institutes of Health, as part of its Big Data-to-Knowledge initiative. MD2K is developing innovative tools to streamline the collection, integration, management, visualization, analysis, and interpretation of health data generated by mobile and wearable sensors. The goal of the big data solutions being developed by MD2K is to reliably quantify physical, biological, behavioral, social, and environmental factors that contribute to health and disease risk. The research conducted by MD2K is targeted at improving health through early detection of adverse health events and by facilitating prevention. MD2K will make its tools, software, and training materials widely available and will also organize workshops and seminars to encourage their use by researchers and clinicians.


Assuntos
Pesquisa Biomédica/instrumentação , Conjuntos de Dados como Assunto , Telemedicina/instrumentação , Telemetria , Sistemas de Informação Geográfica/instrumentação , Humanos , National Institutes of Health (U.S.) , Estados Unidos
19.
Proc Annu Int Conf Mob Comput Netw ; 2015: 400-412, 2015 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-27042165

RESUMO

The human eye offers a fascinating window into an individual's health, cognitive attention, and decision making, but we lack the ability to continually measure these parameters in the natural environment. The challenges lie in: a) handling the complexity of continuous high-rate sensing from a camera and processing the image stream to estimate eye parameters, and b) dealing with the wide variability in illumination conditions in the natural environment. This paper explores the power-robustness tradeoffs inherent in the design of a wearable eye tracker, and proposes a novel staged architecture that enables graceful adaptation across the spectrum of real-world illumination. We propose CIDER, a system that operates in a highly optimized low-power mode under indoor settings by using a fast Search-Refine controller to track the eye, but detects when the environment switches to more challenging outdoor sunlight and switches models to operate robustly under this condition. Our design is holistic and tackles a) power consumption in digitizing pixels, estimating pupillary parameters, and illuminating the eye via near-infrared, b) error in estimating pupil center and pupil dilation, and c) model training procedures that involve zero effort from a user. We demonstrate that CIDER can estimate pupil center with error less than two pixels (0.6°), and pupil diameter with error of one pixel (0.22mm). Our end-to-end results show that we can operate at power levels of roughly 7mW at a 4Hz eye tracking rate, or roughly 32mW at rates upwards of 250Hz.

20.
MobiSys ; 2014: 82-94, 2014 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-26539565

RESUMO

Continuous, real-time tracking of eye gaze is valuable in a variety of scenarios including hands-free interaction with the physical world, detection of unsafe behaviors, leveraging visual context for advertising, life logging, and others. While eye tracking is commonly used in clinical trials and user studies, it has not bridged the gap to everyday consumer use. The challenge is that a real-time eye tracker is a power-hungry and computation-intensive device which requires continuous sensing of the eye using an imager running at many tens of frames per second, and continuous processing of the image stream using sophisticated gaze estimation algorithms. Our key contribution is the design of an eye tracker that dramatically reduces the sensing and computation needs for eye tracking, thereby achieving orders of magnitude reductions in power consumption and form-factor. The key idea is that eye images are extremely redundant, therefore we can estimate gaze by using a small subset of carefully chosen pixels per frame. We instantiate this idea in a prototype hardware platform equipped with a low-power image sensor that provides random access to pixel values, a low-power ARM Cortex M3 microcontroller, and a bluetooth radio to communicate with a mobile phone. The sparse pixel-based gaze estimation algorithm is a multi-layer neural network learned using a state-of-the-art sparsity-inducing regularization function that minimizes the gaze prediction error while simultaneously minimizing the number of pixels used. Our results show that we can operate at roughly 70mW of power, while continuously estimating eye gaze at the rate of 30 Hz with errors of roughly 3 degrees.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...