18 Artículos

« Anterior Página: 1 de 1 Siguiente »

Audio-Visual Action Recognition Using Transformer Fusion Network

Acceso

en línea

Jun-Hwa Kim and Chee Sun Won

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 3 Año: 2024

Deep Learning for Automated Visual Inspection in Manufacturing and Maintenance: A Survey of Open- Access Papers

Acceso

en línea

Nils Hütten, Miguel Alves Gomes, Florian Hölken, Karlo Andricevic, Richard Meyes and Tobias Meisen

Quality assessment in industrial applications is often carried out through visual inspection, usually performed or supported by human domain experts. However, the manual visual inspection of processes and products is error-prone and expensive. It is ther... ver más

Revista: Applied System Innovation Formato: Electrónico

Tabla de contenido: Vol: 7 Num: 0 Par: 1 Año: 2024

LGViT: A Local and Global Vision Transformer with Dynamic Contextual Position Bias Using Overlapping Windows

Acceso

en línea

Qian Zhou, Hua Zou and Huanhuan Wu

Vision Transformers (ViTs) have shown their superiority in various visual tasks for the capability of self-attention mechanisms to model long-range dependencies. Some recent works try to reduce the high cost of vision transformers by limiting the self-at... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 3 Año: 2023

Comparative Study for Patch-Level and Pixel-Level Segmentation of Deep Learning Methods on Transparent Images of Environmental Microorganisms: From Convolutional Neural Networks to Visual Transformers

Acceso

en línea

Hechen Yang, Xin Zhao, Tao Jiang, Jinghua Zhang, Peng Zhao, Ao Chen, Marcin Grzegorzek, Shouliang Qi, Yueyang Teng and Chen Li

Currently, the field of transparent image analysis has gradually become a hot topic. However, traditional analysis methods are accompanied by large amounts of carbon emissions, and consumes significant manpower and material resources. The continuous deve... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 18 Año: 2022

DR-Transformer: A Multi-Features Fusion Framework for Tropical Cyclones Intensity Estimation

Acceso

en línea

Yicheng Luo, Yajing Xu, Si Li, Qifeng Qian and Bo Xiao

Convolutional neural networks have achieved great success in analyzing potential features inside tropical cyclones (TCs) using satellite images for intensity estimation. However, due to the high similarity of visual features in TC images, it is still a c... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 13 Año: 2021

A Review of Transformer-Based Approaches for Image Captioning

Acceso

en línea

Oscar Ondeng, Heywood Ouma and Peter Akuon

Visual understanding is a research area that bridges the gap between computer vision and natural language processing. Image captioning is a visual understanding task in which natural language descriptions of images are automatically generated using visio... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 19 Año: 2023

A Moving Object Tracking Technique Using Few Frames with Feature Map Extraction and Feature Fusion

Acceso

en línea

Abeer Abdulaziz AlArfaj and Hanan Ahmed Hosni Mahmoud

Moving object tracking techniques using machine and deep learning require large datasets for neural model training. New strategies need to be invented that utilize smaller data training sizes to realize the impact of large-sized datasets. However, curren... ver más

Revista: ISPRS International Journal of Geo-Information Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 7 Año: 2022

Rethink Motion Information for Occluded Person Re-Identification

Acceso

en línea

Hongye Liu and Xiai Chen

Person re-identification aims to identify the same pedestrians captured by various cameras from different viewpoints in multiple scenarios. Occlusion is the toughest problem for practical applications. In video-based ReID tasks, motion information can be... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 6 Año: 2024

A Deep-Learning-Based Multimodal Data Fusion Framework for Urban Region Function Recognition

Acceso

en línea

Mingyang Yu, Haiqing Xu, Fangliang Zhou, Shuai Xu and Hongling Yin

Accurate and efficient classification maps of urban functional zones (UFZs) are crucial to urban planning, management, and decision making. Due to the complex socioeconomic UFZ properties, it is increasingly challenging to identify urban functional zones... ver más

Revista: ISPRS International Journal of Geo-Information Formato: Electrónico

Tabla de contenido: Vol: 12 Num: 0 Par: 12 Año: 2023

Swin?MRDB: Pan-Sharpening Model Based on the Swin Transformer and Multi-Scale CNN

Acceso

en línea

Zifan Rong, Xuesong Jiang, Linfeng Huang and Hongping Zhou

Pan-sharpening aims to create high-resolution spectrum images by fusing low-resolution hyperspectral (HS) images with high-resolution panchromatic (PAN) images. Inspired by the Swin transformer used in image classification tasks, this research constructs... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 15 Año: 2023

Attention Mechanism Used in Monocular Depth Estimation: An Overview

Acceso

en línea

Yundong Li, Xiaokun Wei and Hanlu Fan

Monocular depth estimation (MDE), as one of the fundamental tasks of computer vision, plays important roles in downstream applications such as virtual reality, 3D reconstruction, and robotic navigation. Convolutional neural networks (CNN)-based methods g... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 17 Año: 2023

Hybrid No-Reference Quality Assessment for Surveillance Images

Acceso

en línea

Zhongchang Ye, Xin Ye and Zhonghua Zhao

Intelligent video surveillance (IVS) technology is widely used in various security systems. However, quality degradation in surveillance images (SIs) may affect its performance on vision-based tasks, leading to the difficulties in the IVS system extracti... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 12 Año: 2022

MCAW-YOLO: An Efficient Detection Model for Ceramic Tile Surface Defects

Acceso

en línea

Xulong Yu, Qiancheng Yu, Qunyue Mu, Zhiyong Hu and Jincai Xie

Traditional manual visual detection methods are inefficient, subjective, and costly, making them prone to false and missed detections. Deep-learning-based defect detection identifies the types of defects and pinpoints their locations. By employing this a... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 21 Año: 2023

Detection of Bridge Damages by Image Processing Using the Deep Learning Transformer Model

Acceso

en línea

Tomotaka Fukuoka and Makoto Fujiu

In Japan, bridges are inspected via close visual examinations every five years. However, these inspections are labor intensive, and a shortage of engineers and budget constraints will restrict such inspections in the future. In recent years, efforts have... ver más

Revista: Buildings Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 3 Año: 2023

An Optimized Hybrid Transformer for Enhanced Ultra-Fine-Grained Thin Sections Categorization via Integrated Region-to-Region and Token-to-Token Approaches

Acceso

en línea

Hongmei Zhang and Shuiqing Wang

The analysis of thin sections for lithology identification is a staple technique in geology. Although recent strides in deep learning have catalyzed the development of models for thin section recognition leveraging varied deep neural networks, there rema... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 13 Año: 2023

Research on the Visual Perception of Ship Engine Rooms Based on Deep Learning

Acceso

en línea

Yongkang Wang, Jundong Zhang, Jinting Zhu, Yuequn Ge and Guanyu Zhai

In the intelligent engine room, the visual perception of ship engine room equipment is the premise of defect identification and the replacement of manual operation. This paper improves YOLOv5 for the problems of mutual occlusion of cabin equipment, an un... ver más

Revista: Journal of Marine Science and Engineering Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 7 Año: 2023

False Information Detection via Multimodal Feature Fusion and Multi-Classifier Hybrid Prediction

Acceso

en línea

Yi Liang, Turdi Tohti and Askar Hamdulla

In the existing false information detection methods, the quality of the extracted single-modality features is low, the information between different modalities cannot be fully fused, and the original information will be lost when the information of diffe... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 4 Año: 2022

Object Detection of Road Assets Using Transformer-Based YOLOX with Feature Pyramid Decoder on Thai Highway Panorama

Acceso

en línea

Teerapong Panboonyuen, Sittinun Thongbai, Weerachai Wongweeranimit, Phisan Santitamnont, Kittiwan Suphan and Chaiyut Charoenphon

Due to the various sizes of each object, such as kilometer stones, detection is still a challenge, and it directly impacts the accuracy of these object counts. Transformers have demonstrated impressive results in various natural language processing (NLP)... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 1 Año: 2022

« Anterior Página: 1 de 1 Siguiente »