Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 580
Filtrar
1.
PeerJ Comput Sci ; 10: e2083, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38983190

RESUMO

Aiming to automatically monitor and improve stereoscopic image and video processing systems, stereoscopic image quality assessment approaches are becoming more and more important as 3D technology gains popularity. We propose a full-reference stereoscopic image quality assessment method that incorporate monocular and binocular features based on binocular competition and binocular integration. To start, we create a three-channel RGB fused view by fusing Gabor filter bank responses and disparity maps. Then, using the monocular view and the RGB fusion view, respectively, we extract monocular and binocular features. To alter the local features in the binocular features, we simultaneously estimate the saliency of the RGB fusion image. Finally, the monocular and binocular quality scores are calculated based on the monocular and binocular features, and the quality scores of the stereo image prediction are obtained by fusion. Performance testing in the LIVE 3D IQA database Phase I and Phase II. The results of the proposed method are compared with newer methods. The experimental results show good consistency and robustness.

2.
Sci Rep ; 14(1): 15901, 2024 Jul 10.
Artigo em Inglês | MEDLINE | ID: mdl-38987266

RESUMO

The rapid development of the logistics industry has driven innovations in parcel sorting technology, among which the swift and precise positioning and classification of parcels have become key to enhancing the performance of logistics systems. This study aims to address the limitations of traditional light curtain positioning methods in logistics sorting workshops amidst high-speed upgrades. This paper proposes a high-speed classification and location algorithm for logistics parcels utilizing a monocular camera. The algorithm combines traditional visual processing methods with an enhanced version of the lightweight YOLOv5 object detection algorithm, achieving high-speed, high-precision parcel positioning. Through the adjustment of the network structure and the incorporation of new feature extraction modules and ECIOU loss functions, the model's robustness and detection accuracy have been significantly improved. Experimental results demonstrate that this model exhibits outstanding performance on a customized logistics parcel dataset, notably enhancing the model's parameter efficiency and computational speed, thereby offering an effective solution for industrial applications in high-speed logistics supply.

3.
Vision Res ; 222: 108449, 2024 Jun 22.
Artigo em Inglês | MEDLINE | ID: mdl-38909478

RESUMO

Short-term monocular deprivation in normally sighted adult humans produces a transient shift of ocular dominance, boosting the deprived eye. This effect has been documented with both perceptual tests and through physiological recordings, but no previous study simultaneously measured physiological responses and the perceptual effects of deprivation. Here we propose an integrated experimental paradigm that combines binocular rivalry with pupillometry, to introduce an objective physiological index of ocular dominance plasticity, acquired concurrently with perceptual testing. Ten participants reported the perceptual dynamics of binocular rivalry, while we measured pupil diameter. Stimuli were a white and a black disk, each presented monocularly. Rivalry dynamics and pupil-size traces were compared before and after 2 h of monocular deprivation, achieved by applying a translucent patch over the dominant eye. Consistent with prior research, we observed that monocular deprivation boosts the deprived-eye signal and consequently increases ocular dominance. In line with previous studies, we also observed subtle but systematic modulations of pupil size that tracked alternations between exclusive dominance phases of the black or white disk. Following monocular deprivation, the amplitude of these pupil-size modulations increased, which is consistent with the post-deprivation boost of the deprived eye and the increase of ocular dominance. This provides evidence that deprivation impacts the effective strength of monocular visual stimuli, coherently affecting perceptual reports and the automatic and unconscious regulation of pupil diameter. Our results show that a combined paradigm of binocular rivalry and pupillometry gives new insights into the physiological mechanisms underlying deprivation effects.

4.
Sensors (Basel) ; 24(12)2024 Jun 13.
Artigo em Inglês | MEDLINE | ID: mdl-38931606

RESUMO

Human pose estimation (HPE) is a technique used in computer vision and artificial intelligence to detect and track human body parts and poses using images or videos. Widely used in augmented reality, animation, fitness applications, and surveillance, HPE methods that employ monocular cameras are highly versatile and applicable to standard videos and CCTV footage. These methods have evolved from two-dimensional (2D) to three-dimensional (3D) pose estimation. However, in real-world environments, current 3D HPE methods trained on laboratory-based motion capture data encounter challenges, such as limited training data, depth ambiguity, left/right switching, and issues with occlusions. In this study, four 3D HPE methods were compared based on their strengths and weaknesses using real-world videos. Joint position correction techniques were proposed to eliminate and correct anomalies such as left/right inversion and false detections of joint positions in daily life motions. Joint angle trajectories were obtained for intuitive and informative human activity recognition using an optimization method based on a 3D humanoid simulator, with the joint position corrected by the proposed technique as the input. The efficacy of the proposed method was verified by applying it to three types of freehand gymnastic exercises and comparing the joint angle trajectories during motion.


Assuntos
Aprendizado Profundo , Articulações , Postura , Humanos , Postura/fisiologia , Articulações/fisiologia , Imageamento Tridimensional/métodos , Algoritmos , Movimento/fisiologia , Gravação em Vídeo/métodos
5.
Artigo em Inglês | MEDLINE | ID: mdl-38822980

RESUMO

PURPOSE: Surgical robots effectively improve the accuracy and safety of surgical procedures. Current optical-navigated oral surgical robots are typically developed based on binocular vision positioning systems, which are susceptible to factors including obscured visibility, limited workplace, and ambient light interference. Hence, the purpose of this study was to develop a lightweight robotic platform based on monocular vision for oral surgery that enhances the precision and efficiency of surgical procedures. METHODS: A monocular optical positioning system (MOPS) was applied to oral surgical robots, and a semi-autonomous robotic platform was developed utilizing monocular vision. A series of vitro experiments were designed to simulate dental implant procedures to evaluate the performance of optical positioning systems and assess the robotic system accuracy. The singular configuration detection and avoidance test, the collision detection and processing test, and the drilling test under slight movement were conducted to validate the safety of the robotic system. RESULTS: The position error and rotation error of MOPS were 0.0906 ± 0.0762 mm and 0.0158 ± 0.0069 degrees, respectively. The attitude angle of robotic arms calculated by the forward and inverse solutions was accurate. Additionally, the robot's surgical calibration point exhibited an average error of 0.42 mm, with a maximum error of 0.57 mm. Meanwhile, the robot system was capable of effectively avoiding singularities and demonstrating robust safety measures in the presence of minor patient movements and collisions during vitro experiment procedures. CONCLUSION: The results of this in vitro study demonstrate that the accuracy of MOPS meets clinical requirements, making it a promising alternative in the field of oral surgical robots. Further studies will be planned to make the monocular vision oral robot suitable for clinical application.

6.
J Anxiety Disord ; 104: 102871, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38723406

RESUMO

Individuals with social anxiety often exhibit atypical processing of facial expressions. Previous research in social anxiety has primarily emphasized cognitive bias associated with face processing and the corresponding abnormalities in cortico-limbic circuitry, yet whether social anxiety influences early perceptual processing of emotional faces remains largely unknown. We used a psychophysical method to investigate the monocular advantage for face perception (i.e., face stimuli are better recognized when presented to the same eye compared to different eyes), an effect that is indicative of early, subcortical processing of face stimuli. We compared the monocular advantage for different emotional expressions (neutral, angry and sad) in three groups (N = 24 per group): individuals clinically diagnosed with social anxiety disorder (SAD), individuals with high social anxiety in subclinical populations (SSA), and a healthy control (HC) group of individuals matched for age and gender. Compared to SSA and HC groups, we found that individuals with SAD exhibited a greater monocular advantage when processing neutral and sad faces. While the magnitudes of monocular advantages were similar across three groups when processing angry faces, individuals with SAD performed better in this condition when the faces were presented to different eye. The former findings suggest that social anxiety leads to an enhanced role of subcortical structures in processing nonthreatening expressions. The latter findings, on the other hand, likely reflect an enhanced cortical processing of threatening expressions in SAD group. These distinct patterns of monocular advantage indicate that social anxiety altered representation of emotional faces at various stages of information processing, starting at an early stage of the visual system.


Assuntos
Emoções , Expressão Facial , Reconhecimento Facial , Fobia Social , Humanos , Feminino , Masculino , Adulto , Emoções/fisiologia , Fobia Social/fisiopatologia , Fobia Social/psicologia , Reconhecimento Facial/fisiologia , Adulto Jovem
7.
Int J Comput Assist Radiol Surg ; 19(7): 1375-1383, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38771418

RESUMO

PURPOSE: Intraoperative reconstruction of endoscopic scenes is a key technology for surgical navigation systems. The accuracy and efficiency of 3D reconstruction directly determine the effectiveness of navigation systems in a variety of clinical applications. While current deformable SLAM algorithms can meet real-time requirements, their underlying reliance on regular templates still makes it challenging to efficiently capture abrupt geometric features within scenes, such as organ contours and surgical margins. METHODS: We propose a novel real-time monocular deformable SLAM algorithm with geometrically adapted template. To ensure real-time performance, the proposed algorithm consists of two threads: a deformation mapping thread updates the template at keyframe rate and a deformation tracking thread estimates the camera pose and the deformation at frame rate. To capture geometric features more efficiently, the algorithm first detects salient edge features using a pre-trained contour detection network and then constructs the template through a triangulation method with guidance of the salient features. RESULTS: We thoroughly evaluated this method on Mandala and Hamlyn datasets in terms of accuracy and performance. The results demonstrated that the proposed method achieves better accuracy with 0.75-7.95% improvement and achieves consistent effectiveness in data association compared with the closest method. CONCLUSION: This study verified an adaptive template does improve the performance of reconstruction of dynamic laparoscopic Scenes with abrupt geometric features. However, further exploration is needed for applications in laparoscopic surgery with incisal margins caused by surgical instruments. This research serves as a crucial step toward enhanced automatic computer-assisted navigation in laparoscopic surgery. Code is available at https://github.com/Tang257/SLAM-with-geometrically-adapted-template .


Assuntos
Algoritmos , Imageamento Tridimensional , Laparoscopia , Humanos , Laparoscopia/métodos , Imageamento Tridimensional/métodos , Cirurgia Assistida por Computador/métodos
8.
Int J Comput Assist Radiol Surg ; 19(7): 1259-1266, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38775904

RESUMO

PURPOSE: Monocular SLAM algorithms are the key enabling technology for image-based surgical navigation systems for endoscopic procedures. Due to the visual feature scarcity and unique lighting conditions encountered in endoscopy, classical SLAM approaches perform inconsistently. Many of the recent approaches to endoscopic SLAM rely on deep learning models. They show promising results when optimized on singular domains such as arthroscopy, sinus endoscopy, colonoscopy or laparoscopy, but are limited by an inability to generalize to different domains without retraining. METHODS: To address this generality issue, we propose OneSLAM a monocular SLAM algorithm for surgical endoscopy that works out of the box for several endoscopic domains, including sinus endoscopy, colonoscopy, arthroscopy and laparoscopy. Our pipeline builds upon robust tracking any point (TAP) foundation models to reliably track sparse correspondences across multiple frames and runs local bundle adjustment to jointly optimize camera poses and a sparse 3D reconstruction of the anatomy. RESULTS: We compare the performance of our method against three strong baselines previously proposed for monocular SLAM in endoscopy and general scenes. OneSLAM presents better or comparable performance over existing approaches targeted to that specific data in all four tested domains, generalizing across domains without the need for retraining. CONCLUSION: OneSLAM benefits from the convincing performance of TAP foundation models but generalizes to endoscopic sequences of different anatomies all while demonstrating better or comparable performance over domain-specific SLAM approaches. Future research on global loop closure will investigate how to reliably detect loops in endoscopic scenes to reduce accumulated drift and enhance long-term navigation capabilities.


Assuntos
Algoritmos , Endoscopia , Humanos , Endoscopia/métodos , Imageamento Tridimensional/métodos , Cirurgia Assistida por Computador/métodos , Processamento de Imagem Assistida por Computador/métodos
9.
Artigo em Inglês | MEDLINE | ID: mdl-38705922

RESUMO

PURPOSE: The utilization of image-guided surgery has demonstrated its ability to improve the precision and safety of minimally invasive surgery (MIS). Non-rigid scene reconstruction is a challenge in image-guided system duo to uniform texture, smoke, and instrument occlusion, etc. METHODS: In this paper, we introduced an algorithm for 3D reconstruction aimed at non-rigid surgery scenes. The proposed method comprises two main components: firstly, the front-end process involves the initial reconstruction of 3D information for deformable soft tissues using embedded deformation graph (EDG) on the basis of dual quaternions, enabling the reconstruction without the need for prior knowledge of the target. Secondly, the EDG is integrated with isometric nonrigid structure from motion (Iso-NRSFM) to facilitate centralized optimization of the observed map points and camera motion across different time instances in deformable scenes. RESULTS: For the quantitative evaluation of the proposed method, we conducted comparative experiments with both synthetic datasets and publicly available datasets against the state-of-the-art 3D reconstruction method, DefSLAM. The test results show that our proposed method achieved a maximum reduction of 1.6 mm in average reconstruction error compared to method DefSLAM across all datasets. Additionally, qualitative experiments were performed on video scene datasets involving surgical instrument occlusions. CONCLUSION: Our method proved to outperform DefSLAM on both synthetic datasets and public datasets through experiments, demonstrating its robustness and accuracy in the reconstruction of soft tissues in dynamic surgical scenes. This success highlights the potential clinical application of our method in delivering surgeons with critical shape and depth information for MIS.

10.
Sci Rep ; 14(1): 10657, 2024 May 09.
Artigo em Inglês | MEDLINE | ID: mdl-38724514

RESUMO

As industries develop, the automation and intelligence level of power plants is constantly improving, and the application of patrol robots is also increasingly widespread. This research combines computer vision technology and particle swarm optimization algorithm to build an obstacle recognition model and obstacle avoidance model of an intelligent patrol robot in a power plant respectively. Firstly, the traditional convolutional recurrent neural network is optimized, and the obstacle recognition model of an intelligent patrol robot is built by combining the connection timing classification algorithm. Then, the artificial potential field method optimizes the traditional particle swarm optimization algorithm, and an obstacle avoidance model of an intelligent patrol robot is built. The performance of the two models was tested, and it was found that the highest precision, recall, and F1 values of the identification model were 0.978, 0.974, and 0.975. The highest precision, recall, and F1 values of the obstacle avoidance model were 0.97, 0.96, and 0.96 respectively. The two optimization models designed in this research have better performance. In conclusion, the two models in this study are superior to the traditional methods in recognition effect and obstacle avoidance efficiency, providing an effective technical scheme for intelligent patrol inspection of power plants.

11.
Semin Diagn Pathol ; 2024 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-38627186

RESUMO

Microscopes, more than any other instrument, reflect advances in clinical medicine over the past several hundred years. As the primary tool of the pathologist, they were, and continue to be, a key connector between the bedside and basic sciences. One specific example is the science of clinical dermatology, which relies on clinical-pathologic correlation to make a definitive diagnosis. The microscopes used by pathologists, however, are more than scientific artifacts. Many antique microscopes are hand-crafted works of art. Even while recognizing that light microscopes may soon be obsolete as scanned slides and computer joy-sticks replace optical instruments in patient care and teaching, their significance will not be diminished. The microscope will never be forgotten in the history, art, and science of medicine, for these instruments set the social and cultural stage for modern, scientific patient care.

12.
Ophthalmol Ther ; 13(6): 1417-1425, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38587773

RESUMO

Transient monocular visual loss (TMVL), also known as transient monocular blindness or amaurosis fugax ("fleeting blindness"), is a temporary loss of vision often due to ischemia to the retina. While acute TMVL should be considered an emergency that further requires exhaustive investigation, there are some cases in which TMVL arises secondary to benign causes. Age has a major impact in the diagnosis of ischemia and although the differential diagnosis of TMVL can be broad, timely and appropriate history, examination, diagnostic testing, and treatment can be vision- or life-saving. We review the causes of TMVL and the impact of age on the differential diagnoses and management.

13.
Front Neurosci ; 18: 1265966, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38686329

RESUMO

Human subjects were tested in perception of shapes of 3D objects. The subjects reconstructed 3D shapes by viewing orthographic and perspective images. Perception of natural shapes was very close to veridical and was clearly better than perception of random symmetrical polyhedra. Viewing perspective images led to only slightly better performance than viewing orthographic images. In order to account for subjects' performance, we elaborated the previous computational models of 3D shape reconstruction. The previous models used as constraints mirror-symmetry and 3D compactness. The critical additional constraint was the use of a secondary mirror-symmetry that exists in most natural shapes. It is known that two planes of mirror symmetry are sufficient for a unique and veridical shape reconstruction. We also generalized the model so that it applies to both orthographic and perspective images. The results of our experiment suggest that the human visual system uses two planes of symmetry in addition to two forms of 3D compactness. Performance of the new model was highly correlated with subjects' performance with both orthographic and perspective images, which supports the claim that the most important 3D shape constraints that are used by the human visual system have been identified.

14.
Front Immunol ; 15: 1243526, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38596669

RESUMO

Background: Circulating immune cells have gained interest as biomarkers of hepatic steatosis. Data on the relationships between immune cell subsets and early-stage steatosis in population-based cohorts are limited. Methods: This study included 1,944 asymptomatic participants of the Multi-Ethnic Study of Atherosclerosis (MESA) with immune cell phenotyping and computed tomography measures of liver fat. Participants with heavy alcohol use were excluded. A liver-to-spleen ratio Hounsfield units (HU) <1.0 and liver attenuation <40 HU were used to diagnose liver fat presence and >30% liver fat content, respectively. Logistic regression estimated cross-sectional associations of immune cell subsets with liver fat parameters adjusted for risk factors. We hypothesized that higher proportions of non-classical monocytes, Th1, Th17, and memory CD4+ T cells, and lower proportions of classical monocytes and naive CD4+ T cells, were associated with liver fat. Exploratory analyses evaluated additional immune cell phenotypes (n = 19). Results: None of the hypothesized cells were associated with presence of liver fat. Higher memory CD4+ T cells were associated with >30% liver fat content, but this was not significant after correction for multiple hypothesis testing (odds ratio (OR): 1.31, 95% confidence interval (CI): 1.03, 1.66). In exploratory analyses unadjusted for multiple testing, higher proportions of CD8+CD57+ T cells were associated with liver fat presence (OR: 1.21, 95% CI: 1.02, 1.44) and >30% liver fat content (OR: 1.34, 95% CI: 1.07, 1.69). Conclusions: Higher circulating memory CD4+ T cells may reflect liver fat severity. CD8+CD57+ cells were associated with liver fat presence and severity, but replication of findings is required.


Assuntos
Aterosclerose , Fígado Gorduroso , Humanos , Monócitos , Estudos Transversais , Fígado Gorduroso/diagnóstico , Subpopulações de Linfócitos T , Biomarcadores
15.
J Neurosci ; 44(19)2024 May 08.
Artigo em Inglês | MEDLINE | ID: mdl-38538145

RESUMO

A classic example of experience-dependent plasticity is ocular dominance (OD) shift, in which the responsiveness of neurons in the visual cortex is profoundly altered following monocular deprivation (MD). It has been postulated that OD shifts also modify global neural networks, but such effects have never been demonstrated. Here, we use wide-field fluorescence optical imaging (WFOI) to characterize calcium-based resting-state functional connectivity during acute (3 d) MD in female and male mice with genetically encoded calcium indicators (Thy1-GCaMP6f). We first establish the fundamental performance of WFOI by computing signal to noise properties throughout our data processing pipeline. Following MD, we found that Δ band (0.4-4 Hz) GCaMP6 activity in the deprived visual cortex decreased, suggesting that excitatory activity in this region was reduced by MD. In addition, interhemispheric visual homotopic functional connectivity decreased following MD, which was accompanied by a reduction in parietal and motor homotopic connectivity. Finally, we observed enhanced internetwork connectivity between the visual and parietal cortex that peaked 2 d after MD. Together, these findings support the hypothesis that early MD induces dynamic reorganization of disparate functional networks including the association cortices.


Assuntos
Camundongos Endogâmicos C57BL , Rede Nervosa , Privação Sensorial , Córtex Visual , Animais , Camundongos , Masculino , Feminino , Privação Sensorial/fisiologia , Córtex Visual/fisiologia , Rede Nervosa/fisiologia , Plasticidade Neuronal/fisiologia , Dominância Ocular/fisiologia , Período Crítico Psicológico , Vias Visuais/fisiologia
16.
Int Ophthalmol ; 44(1): 162, 2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38538983

RESUMO

PURPOSE: We aimed to identify ocular comorbidities and reasons of blindness in monocular patients and to compare visual outcomes of cataract surgery between monocular and binocular patients. METHODS: A single-center case-control study was conducted between November 2011 and May 2019 to compare consecutive series of patients needing cataract surgery in Strasbourg University Hospitals, France. Cases were patients with permanent monocular vision loss. Controls were binocularly sighted patients. All patients underwent cataract surgery using phacoemulsification technique. Chart analysis included demographic data, medical history, and surgical determinants data. Student's t tests and Fisher's exact tests were the main methods used for statistical analysis. RESULTS: Each group included 80 patients. The mean age at the time of surgery was significantly higher in monocular than binocular patients (77 vs. 71 years, p < 0.001). Thirty-two monocular patients (40%) had ocular comorbidities, compared to only 19 (23%) in the control group (p < 0.05). The leading cause of monocular status was amblyopia caused by strabismus (22 patients, 27.5%). Age-related macular degeneration, open-angle glaucoma, and diabetic retinopathy were the three main ocular comorbidities that were observed in the monocular group. Monocular patients had significantly lower visual acuity than the control group (p < 0.01) before and after cataract surgery. Conversely, improvement in visual acuity after surgery was not statistically different between groups (p = 0.054). There was no statistically significant difference in the rate of surgical complications between groups (p = 0.622). CONCLUSIONS: This study illustrates that cataract surgery in monocular patients is not more complicated than in binocular patients, but that it is significantly delayed.


Assuntos
Extração de Catarata , Catarata , Glaucoma de Ângulo Aberto , Facoemulsificação , Humanos , Estudos de Casos e Controles , Catarata/complicações , Glaucoma de Ângulo Aberto/complicações , Resultado do Tratamento , Cegueira , Visão Binocular
17.
Sensors (Basel) ; 24(6)2024 Mar 20.
Artigo em Inglês | MEDLINE | ID: mdl-38544229

RESUMO

This study addresses the ongoing challenge for learning-based methods to achieve accurate object detection in foggy conditions. In response to the scarcity of foggy traffic image datasets, we propose a foggy weather simulation algorithm based on monocular depth estimation. The algorithm involves a multi-step process: a self-supervised monocular depth estimation network generates a relative depth map and then applies dense geometric constraints for scale recovery to derive an absolute depth map. Subsequently, the visibility of the simulated image is defined to generate a transmittance map. The dark channel map is then used to distinguish sky regions and estimate atmospheric light values. Finally, the atmospheric scattering model is used to generate fog simulation images under specified visibility conditions. Experimental results show that more than 90% of fog images have AuthESI values of less than 2, which indicates that their non-structural similarity (NSS) characteristics are very close to those of natural fog. The proposed fog simulation method is able to convert clear images in natural environments, providing a solution to the problem of lack of foggy image datasets and incomplete visibility data.

18.
Strabismus ; 32(2): 108-114, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38516834

RESUMO

INTRODUCTION: To study the rare and unusual causes of monocular elevation deficit. METHODS: Five patients presenting to us with diplopia and elevation deficit were thoroughly examined and were found to have monocular elevation deficit due to rare causes. OBSERVATIONS: All five were found to have different underlying etiologies - iatrogenic, sphenoid wing meningioma, cysticercosis, sarcoidosis and mid brain infarct, and were managed appropriately. DISCUSSION: Monocular Elevation Deficit can occur due to a variety of causes. Having a high index of suspicion for the more serious etiologies is of utmost importance. Thorough clinical examination and imaging help clinch the diagnosis.


Assuntos
Diplopia , Meningioma , Humanos , Feminino , Meningioma/complicações , Masculino , Pessoa de Meia-Idade , Diplopia/etiologia , Diplopia/fisiopatologia , Diplopia/diagnóstico , Adulto , Neoplasias Meníngeas/complicações , Sarcoidose/complicações , Sarcoidose/diagnóstico , Sarcoidose/fisiopatologia , Cisticercose/complicações , Cisticercose/diagnóstico , Cisticercose/fisiopatologia , Doença Iatrogênica , Infarto Encefálico/complicações , Infarto Encefálico/diagnóstico por imagem , Infarto Encefálico/fisiopatologia , Idoso , Músculos Oculomotores/fisiopatologia , Transtornos da Motilidade Ocular/fisiopatologia , Transtornos da Motilidade Ocular/etiologia , Transtornos da Motilidade Ocular/diagnóstico , Imageamento por Ressonância Magnética , Visão Monocular/fisiologia , Osso Esfenoide
19.
Sci Rep ; 14(1): 5868, 2024 Mar 11.
Artigo em Inglês | MEDLINE | ID: mdl-38467677

RESUMO

Monocular depth estimation has a wide range of applications in the field of autostereoscopic displays, while accuracy and robustness in complex scenes are still a challenge. In this paper, we propose a depth estimation network for autostereoscopic displays, which aims at improving the accuracy of monocular depth estimation by fusing Vision Transformer (ViT) and Convolutional Neural Network (CNN). Our approach feeds the input image as a sequence of visual features into the ViT module and utilizes its global perception capability to extract high-level semantic features of the image. The relationship between the losses is quantified by adding a weight correction module to improve robustness of the model. Experimental evaluation results on several public datasets show that AMENet exhibits higher accuracy and robustness than existing methods in different scenarios and complex conditions. In addition, a detailed experimental analysis was conducted to verify the effectiveness and stability of our method. The accuracy improvement on the KITTI dataset compared to the baseline method is 4.4%. In summary, AMENet is a promising depth estimation method with sufficient high robustness and accuracy for monocular depth estimation tasks.

20.
Sensors (Basel) ; 24(5)2024 Feb 29.
Artigo em Inglês | MEDLINE | ID: mdl-38475124

RESUMO

The lack of discernible vehicle contour features in low-light conditions poses a formidable challenge for nighttime vehicle detection under hardware cost constraints. Addressing this issue, an enhanced histogram of oriented gradients (HOGs) approach is introduced to extract relevant vehicle features. Initially, vehicle lights are extracted using a combination of background illumination removal and a saliency model. Subsequently, these lights are integrated with a template-based approach to delineate regions containing potential vehicles. In the next step, the fusion of superpixel and HOG (S-HOG) features within these regions is performed, and the support vector machine (SVM) is employed for classification. A non-maximum suppression (NMS) method is applied to eliminate overlapping areas, incorporating the fusion of vertical histograms of symmetrical features of oriented gradients (V-HOGs). Finally, the Kalman filter is utilized for tracking candidate vehicles over time. Experimental results demonstrate a significant improvement in the accuracy of vehicle recognition in nighttime scenarios with the proposed method.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...