Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
IEEE Trans Neural Netw Learn Syst ; 34(11): 8566-8578, 2023 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-35226610

RESUMO

Mesh is a type of data structure commonly used for 3-D shapes. Representation learning for 3-D meshes is essential in many computer vision and graphics applications. The recent success of convolutional neural networks (CNNs) for structured data (e.g., images) suggests the value of adapting insights from CNN for 3-D shapes. However, 3-D shape data are irregular since each node's neighbors are unordered. Various graph neural networks for 3-D shapes have been developed with isotropic filters or predefined local coordinate systems to overcome the node inconsistency on graphs. However, isotropic filters or predefined local coordinate systems limit the representation power. In this article, we propose a local structure-aware anisotropic convolutional operation (LSA-Conv) that learns adaptive weighting matrices for each template's node according to its neighboring structure and performs shared anisotropic filters. In fact, the learnable weighting matrix is similar to the attention matrix in the random synthesizer-a new Transformer model for natural language processing (NLP). Since the learnable weighting matrices require large amounts of parameters for high-resolution 3-D shapes, we introduce a matrix factorization technique to notably reduce the parameter size, denoted as LSA-small. Furthermore, a residual connection with a linear transformation is introduced to improve the performance of our LSA-Conv. Comprehensive experiments demonstrate that our model produces significant improvement in 3-D shape reconstruction compared to state-of-the-art methods.

2.
IEEE Trans Cybern ; 52(7): 7094-7106, 2022 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-33315574

RESUMO

In the era of multimedia and Internet, the quick response (QR) code helps people obtain information from offline to online quickly. However, the QR code is often limited in many scenarios because of its random and dull appearance. Therefore, this article proposes a novel approach to embed hyperlinks into common images, making the hyperlinks invisible for human eyes but detectable for mobile devices equipped with a camera. Our approach is an end-to-end neural network with an encoder to hide messages and a decoder to extract messages. To maintain the hidden message resilient to cameras, we build a distortion network between the encoder and the decoder to augment the encoded images. The distortion network uses differentiable 3-D rendering operations, which can simulate the distortion introduced by camera imaging in both printing and display scenarios. To maintain the visual attraction of the image with hyperlinks, a loss function conforming to the human visual system (HVS) is used to supervise the training of the encoder. Experimental results show that the proposed approach outperforms the previous work on both robustness and quality. Based on the proposed approach, many applications become possible, for example, "image hyperlinks" for advertisement on TV, website, or poster, and "invisible watermark" for copyright protection on digital resources or product packagings.


Assuntos
Redes Neurais de Computação , Humanos
3.
Int J Comput Assist Radiol Surg ; 16(2): 323-330, 2021 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-33146848

RESUMO

PURPOSE: Thyroid-associated ophthalmopathy (TAO) might lead to blindness and orbital deformity. The early diagnosis and treatment are conducive to control disease progression, but currently, there is no effective screening method. The present study aimed to introduce an artificial intelligence (AI) model for screening and testing the model with TAO patients under clinical conditions. METHODS: A total of 1435 computed tomography (CT) scans were obtained from the hospital. These CT scans were preprocessed by resampling and extracting the region of interest. CT from 193 TAO patients and 715 healthy individuals were adopted for three-dimensional (3D)-ResNet model training, and 49 TAO patients and 178 healthy people were adopted for external verification. Data from 150 TAO patients and 150 healthy people were utilized for application tests under clinical conditions, including non-inferiority experiments and diagnostic tests, respectively. RESULTS: In the external verification of the model, the area under the receiver operating characteristic (ROC) curve (AUC) was 0.919, indicating a satisfactory classification effect. The accuracy, sensitivity, and specificity were 0.87, 088, and 0.85, respectively. In non-inferiority experiments: the accuracy was 85.67% in the AI group and 84.33% in the resident group. The model passed both non-inferiority experiments (p = 0.001) and diagnostic test (the AI group sensitivity = 0.87 and specificity = 0.84%). CONCLUSIONS: A promising orbital CT-based TAO screening AI model was established and passed application tests under clinical conditions. This may provide a new TAO screening tool with further validation.


Assuntos
Inteligência Artificial , Oftalmopatia de Graves/diagnóstico por imagem , Órbita/diagnóstico por imagem , Progressão da Doença , Oftalmopatia de Graves/diagnóstico , Humanos , Rede Nervosa , Tomografia Computadorizada por Raios X/métodos
4.
PLoS One ; 15(10): e0240661, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33057363

RESUMO

Stereoscopic 3D (S3D) geometric distortions can be introduced by mismatches among image capture, display, and viewing configurations. In previous work of S3D geometric models, geometric distortions have been analyzed from a third-person perspective based on the binocular depth cue (i.e., binocular disparity). A third-person perspective is different from what the viewer sees since monocular depth cues (e.g., linear perspective, occlusion, and shadows) from different perspectives are different. However, depth perception in a 3D space involves both monocular and binocular depth cues. Geometric distortions that are solely predicted by the binocular depth cue cannot describe what a viewer really perceives. In this paper, we combine geometric models and retinal disparity models to analyze geometric distortions from the viewer's perspective where both monocular and binocular depth cues are considered. Results show that binocular and monocular depth-cue conflicts in a geometrically distorted S3D space. Moreover, user-initiated head translations averting from the optimal viewing position in conventional S3D displays can also introduce geometric distortions, which are inconsistent with our natural 3D viewing condition. The inconsistency of depth cues in a dynamic scene may be a source of visually induced motions sickness.


Assuntos
Percepção de Profundidade/fisiologia , Fixação Ocular/fisiologia , Humanos , Modelos Biológicos , Retina/fisiologia , Visão Monocular/fisiologia
5.
IEEE Sens J ; 20(22): 13674-13681, 2020 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-37974650

RESUMO

Coronavirus Disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronaviruses 2 (SARS-CoV-2) has become a serious global pandemic in the past few months and caused huge loss to human society worldwide. For such a large-scale pandemic, early detection and isolation of potential virus carriers is essential to curb the spread of the pandemic. Recent studies have shown that one important feature of COVID-19 is the abnormal respiratory status caused by viral infections. During the pandemic, many people tend to wear masks to reduce the risk of getting sick. Therefore, in this paper, we propose a portable non-contact method to screen the health conditions of people wearing masks through analysis of the respiratory characteristics from RGB-infrared sensors. We first accomplish a respiratory data capture technique for people wearing masks by using face recognition. Then, a bidirectional GRU neural network with an attention mechanism is applied to the respiratory data to obtain the health screening result. The results of validation experiments show that our model can identify the health status of respiratory with 83.69% accuracy, 90.23% sensitivity and 76.31% specificity on the real-world dataset. This work demonstrates that the proposed RGB-infrared sensors on portable device can be used as a pre-scan method for respiratory infections, which provides a theoretical basis to encourage controlled clinical trials and thus helps fight the current COVID-19 pandemic. The demo videos of the proposed system are available at: https://doi.org/10.6084/m9.figshare.12028032.

6.
Biomed Eng Online ; 18(1): 111, 2019 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-31729983

RESUMO

BACKGROUND: Head-mounted displays (HMDs) and virtual reality (VR) have been frequently used in recent years, and a user's experience and computation efficiency could be assessed by mounting eye-trackers. However, in addition to visually induced motion sickness (VIMS), eye fatigue has increasingly emerged during and after the viewing experience, highlighting the necessity of quantitatively assessment of the detrimental effects. As no measurement method for the eye fatigue caused by HMDs has been widely accepted, we detected parameters related to optometry test. We proposed a novel computational approach for estimation of eye fatigue by providing various verifiable models. RESULTS: We implemented three classifications and two regressions to investigate different feature sets, which led to present two valid assessment models for eye fatigue by employing blinking features and eye movement features with the ground truth of indicators for optometry test. Three graded results and one continuous result were provided by each model, respectively, which caused the whole result to be repeatable and comparable. CONCLUSION: We showed differences between VIMS and eye fatigue, and we also presented a new scheme to assess eye fatigue of HMDs users by analysis of parameters of the eye tracker.


Assuntos
Astenopia/diagnóstico , Movimentos Oculares , Cabeça , Adulto , Astenopia/fisiopatologia , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Adulto Jovem
7.
PLoS One ; 13(10): e0205032, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30296289

RESUMO

Motion in a distorted virtual 3D space may cause visually induced motion sickness. Geometric distortions in stereoscopic 3D can result from mismatches among image capture, display, and viewing parameters. Three pairs of potential mismatches are considered, including 1) camera separation vs. eye separation, 2) camera field of view (FOV) vs. screen FOV, and 3) camera convergence distance (i.e., distance from the cameras to the point where the convergence axes intersect) vs. screen distance from the observer. The effect of the viewer's head positions (i.e., head lateral offset from the screen center) is also considered. The geometric model is expressed as a function of camera convergence distance, the ratios of the three parameter-pairs, and the offset of the head position. We analyze the impacts of these five variables separately and their interactions on geometric distortions. This model facilitates insights into the various distortions and leads to methods whereby the user can minimize geometric distortions caused by some parameter-pair mismatches through adjusting of other parameter pairs. For example, in postproduction, viewers can correct for a mismatch between camera separation and eye separation by adjusting their distance from the real screen and changing the effective camera convergence distance.


Assuntos
Imageamento Tridimensional/métodos , Movimento (Física) , Algoritmos , Humanos , Fenômenos Fisiológicos Oculares , Fotografação
8.
Iperception ; 8(4): 2041669517723929, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28835815

RESUMO

In Hwang and Peli (2014), few errors occurred in computing the angular disparities. The direction of peripheral depth distortion (the angular disparity differences between what it is in real-world 3D viewing and S3D viewing) is reversed when the computational errors were corrected, making the perception of the peripheral depth to be expanded, not compressed. This reply points to the error and provides the corrected figures. Correcting these errors does not affect the general conclusion that S3D viewed on single screen display induces peripheral depth distortion which may be a cause of visually induced motion sickness.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...