FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysis Mikel Williams-LekuonaGeorgina Cosma Regular Paper Open access 26 May 2025 Article: 20
Human behavior recognition based on DualBiNet model Lingling KanRuixuan LiuWenfeng Wang Regular Paper 14 May 2025 Article: 19
MMDL: a multi-modal deep learning for video highlight detection in sports Qiaoyun ZhangChih-Yung ChangDiptendu Sinha Roy Regular Paper 25 April 2025 Article: 18
Multimodal scene-graph matching for cheapfakes detection Minh-Tam NguyenQuynh T. NguyenBinh T. Nguyen Regular Paper 21 April 2025 Article: 17
A CNN-transformer hybrid model and a multi-modal multi-stage training strategy for visible-infrared person re-identification Xinxin HaoHaishun DuJieru Li Regular Paper 21 April 2025 Article: 16
Deep multimodal learning for time series analysis in social computing: a survey Chao YangYakun ChenZhongwen Guo Trends and Surveys 15 April 2025 Article: 15
Multi-view learning for camouflaged object detection with PVTv2 Pu YanKang RuanXu Wang Regular Paper 08 April 2025 Article: 14
Concept-based and embedding-based models in lifelog retrieval: an empirical comparison of performance Manh-Duy NguyenBinh T. NguyenCathal Gurrin Regular Paper 28 March 2025 Article: 13
DMFNet: geometric multi-scale pixel-level contrastive learning for video salient object detection Hemraj SinghMridula VermaRamalingaswamy Cheruku Regular Paper 10 March 2025 Article: 12
Cross-modal alignment with synthetic caption for text-based person search Weichen ZhaoYuxing LuGe Jiao Regular Paper 10 March 2025 Article: 11
VPC-VoxelNet: multi-modal fusion 3D object detection networks based on virtual point clouds Qiang ZhangQin ShiJiong Chen Regular Paper 06 March 2025 Article: 10
MFAFD: a few-shot learning method for cascading models with parameter free attention and finite discrete space Lixia XueJiang DongJuan Yang Regular Paper 06 March 2025 Article: 9
Image forgery classification and localization through vision transformers Digambar PawarRaghavendra GowdaKrishna Chandra Regular Paper 05 March 2025 Article: 8
PAMoE-MSA: polarity-aware mixture of experts network for multimodal sentiment analysis Changqin HuangZhenheng LinXiaodi Huang Regular Paper 01 March 2025 Article: 7
Multi-task classification network for few-shot learning Zhong JiYuanheng LiuYunLong Yu Regular Paper 17 February 2025 Article: 6
Optimized RT-DETR for accurate and efficient video object detection via decoupled feature aggregation Hao ChenWu HuangTao Zhang Regular Paper 12 February 2025 Article: 5
Dual-matrix guided reconstruction hashing for unsupervised cross-modal retrieval Ziyong LinXiaolong JiangMingyong Li Regular Paper 11 February 2025 Article: 4
Improving skeleton-based action recognition with interactive object information Hao WenZiqian LuJialin Cui Regular Paper 07 January 2025 Article: 3
CAMIR: fine-tuning CLIP and multi-head cross-attention mechanism for multimodal image retrieval with sketch and text features Fan YangNor Azman IsmailAlhuseen Omar Alsayed Regular Paper 24 December 2024 Article: 2
STCA: an action recognition network with spatio-temporal convolution and attention Qiuhong TianWeilun MiaoLan Yao Regular Paper 04 December 2024 Article: 1
Recent trends in recommender systems: a survey Chintoo KumarC. Ravindranath ChowdaryAshok Kumar Meena Trends and Surveys 10 October 2024 Article: 41
Advancements in machine learning techniques for threat item detection in X-ray images: a comprehensive survey Archana SinghDhiraj Trends and Surveys 04 October 2024 Article: 40
Multi-modal emotion recognition using tensor decomposition fusion and self-supervised multi-tasking Rui WangJiawei ZhuXianxun Zhu Regular Paper 03 September 2024 Article: 39
DBTSF-VSOD: a decision-based two-stage framework for video salient object detection Sandeep Chand KumainMaheep SinghLalit Kumar Awasthi Regular Paper 03 September 2024 Article: 38
Multimodal music datasets? Challenges and future goals in music processing Anna-Maria ChristodoulouOlivier LartillotAlexander Refsum Jensenius Regular Paper Open access 28 August 2024 Article: 37
Enhancing deep learning image classification using data augmentation and genetic algorithm-based optimization Nouara BoudouhBilal MokhtariSebti Foufou Regular Paper 22 August 2024 Article: 36
Stratified Graph Indexing for efficient search in deep descriptor databases M. M. Mahabubur RahmanJelena Tešić Regular Paper 07 August 2024 Article: 35
A order-based content-based information retrieval system proposal applied in 3D meshes Thiago Kobashigawa AmorimHelton Hideraldo Biscaro Regular Paper 02 August 2024 Article: 34
3D skeleton-based human motion prediction using spatial–temporal graph convolutional network Jianying HuangHoon Kang Regular Paper 29 July 2024 Article: 33
Bridging language to visuals: towards natural language query-to-chart image retrieval Neelu VermaAnik DeAnand Mishra Regular Paper 29 July 2024 Article: 32
Mual: enhancing multimodal sentiment analysis with cross-modal attention and difference loss Yang DengYonghong LiHaiyang Qiu Regular Paper 22 July 2024 Article: 31
LSECA: local semantic enhancement and cross aggregation for video-text retrieval Zhiwen WangDonglin ZhangZhikai Hu Regular Paper 22 July 2024 Article: 30
Human action recognition using an optical flow-gated recurrent neural network Davar Giveki Regular Paper 16 July 2024 Article: 29
Similarity-based face image retrieval using sparsely embedded deep features and binary code learning Abdessamad ElboushakiRachida HannaneKarim Afdel Regular Paper 08 July 2024 Article: 28
DSPformer: discovering semantic parts with token growth and clustering for zero-shot learning Peng ZhaoQiangchang WangYilong Yin Regular Paper 27 June 2024 Article: 27
Adversarial attacks and defenses for large language models (LLMs): methods, frameworks & challenges Pranjal Kumar Trends and Surveys 25 June 2024 Article: 26
A spatiotemporal bidirectional network for video salient object detection using multiscale transfer learning Gaurav SharmaMaheep Singh Regular Paper 07 May 2024 Article: 25
RDAT: an efficient regularized decoupled adversarial training mechanism Yishan LiYanming GuoYirun Ruan Regular Paper 07 May 2024 Article: 24
Strengthening attention: knowledge distillation via cross-layer feature fusion for image classification Zhongyi ZhaiJie LiangJunyan Qian Regular Paper 02 May 2024 Article: 23
State of art and emerging trends on group recommender system: a comprehensive review Shilpa SinghalKunwar Pal Trends and Surveys 02 May 2024 Article: 22
Relevance equilibrium network for cross-domain few-shot learning Zhong JiXiangyu KongXiyao Liu Regular Paper 29 April 2024 Article: 21
Domain-specific image captioning: a comprehensive review Himanshu SharmaDevanand Padha Trends and Surveys 18 April 2024 Article: 20
Multi-knowledge-driven enhanced module for visible-infrared cross-modal person Re-identification Shihao ShanPeixin SunSong Wu Regular Paper 16 April 2024 Article: 19
DAF-Net: dense attention feature pyramid network for multiscale object detection Divine Njengwie AchinekIbrahim Shehi ShehuXianping Fu Regular Paper 08 April 2024 Article: 18
Progressive spatial–temporal transfer model for unsupervised person re-identification Shuren ZhouZhixiong LiJianming Zhang Regular Paper 03 April 2024 Article: 17
Unsupervised graph reasoning distillation hashing for multimodal hamming space search with vision-language model Lina SunYumin Dong Regular Paper 30 March 2024 Article: 16
Interactive multimodal video search: an extended post-evaluation for the VBS 2022 competition Konstantin SchallWerner BailerClaudio Vairo Regular Paper Open access 26 March 2024 Article: 15
Parameter-efficient tuning of cross-modal retrieval for a specific database via trainable textual and visual prompts Huaying ZhangRintaro YanagiMiki Haseyama Regular Paper 29 February 2024 Article: 14
A voting-based novel spatio-temporal fusion framework for video saliency using transfer learning mechanism Sandeep Chand KumainMaheep SinghLalit Kumar Awasthi Regular Paper 28 February 2024 Article: 13
DAABNet: depth-wise asymmetric attention bottleneck for real-time semantic segmentation Qingsong TangYingli ChenWuming Jiang Regular Paper 24 February 2024 Article: 12