Search for Articles

Article

10 Citations

4,895 Views

18 Pages

Dense Model for Automatic Image Description Generation with Game Theoretic Optimization

Sreela S R and
Sumam Mary Idicula

Information2019, 10(11), 354;https://doi.org/10.3390/info10110354

-

15 November 2019

Due to the rapid growth of deep learning technologies, automatic image description generation is an interesting problem in computer vision and natural language generation. It helps to improve access to photo collections on social media and gives guid...

564 Results Found

Dense Model for Automatic Image Description Generation with Game Theoretic Optimization

Framework of Specific Description Generation for Aluminum Alloy Metallographic Image Based on Visual and Language Information Fusion

The Role of AI-Generated Clinical Image Descriptions in Enhancing Teledermatology Diagnosis: A Cross-Sectional Exploratory Study

Making Images Speak: Human-Inspired Image Description Generation

Visual Description Augmented Integration Network for Multimodal Entity and Relation Extraction

Recent Advances in Synthesis and Interaction of Speech, Text, and Vision

Description Generation for Remote Sensing Images Using Attribute Attention Mechanism

Generating Image Descriptions of Rice Diseases and Pests Based on DeiT Feature Encoder

Generating Scenery Images with Larger Variety According to User Descriptions

A New Generative Model for Textual Descriptions of Medical Images Using Transformers Enhanced with Convolutional Neural Networks

Video Description Generation Method Based on Contrastive Language–Image Pre-Training Combined Retrieval-Augmented and Multi-Scale Semantic Guidance

Distinguishing Human- and AI-Generated Image Descriptions Using CLIP Similarity and Transformer-Based Classification

Impact of Video Compression and Multimodal Embedding on Scene Description

Middle-Level Attribute-Based Language Retouching for Image Caption Generation

Enhanced Image Captioning with Color Recognition Using Deep Learning Methods

A Study on Generative Models for Visual Recognition of Unknown Scenes Using a Textual Description

Automatic Identification and Description of Jewelry Through Computer Vision and Neural Networks for Translators and Interpreters

Towards Generating and Evaluating Iconographic Image Captions of Artworks

Supervised Deep Learning Techniques for Image Description: A Systematic Review

Towards Mapping Images to Text Using Deep-Learning Architectures

ACapMed: Automatic Captioning for Medical Imaging

Distributed Artificial Intelligence for Organizational and Behavioral Recognition of Bees and Ants

A Unified Visual and Linguistic Semantics Method for Enhanced Image Captioning

Expanding Open-Vocabulary Understanding for UAV Aerial Imagery: A Vision–Language Framework to Semantic Segmentation

Sequential Dual Attention: Coarse-to-Fine-Grained Hierarchical Generation for Image Captioning

Visual Positioning in Indoor Environments Using RGB-D Images and Improved Vector of Local Aggregated Descriptors

Cross-Modal Data Fusion via Vision-Language Model for Crop Disease Recognition

Novel Paintings from the Latent Diffusion Model through Transfer Learning

Zero-Shot Image Classification with Rectified Embedding Vectors Using a Caption Generator

Caps Captioning: A Modern Image Captioning Approach Based on Improved Capsule Network

Cross Encoder-Decoder Transformer with Global-Local Visual Extractor for Medical Image Captioning

Veg-DenseCap: Dense Captioning Model for Vegetable Leaf Disease Images

Dermatology “AI Babylon”: Cross-Language Evaluation of AI-Crafted Dermatology Descriptions

TextControlGAN: Text-to-Image Synthesis with Controllable Generative Adversarial Networks

Generalized Image Captioning for Multilingual Support

Zero-Shot Image Caption Inference System Based on Pretrained Models

Technetium Complexes and Radiopharmaceuticals with Scorpionate Ligands

Attention-Guided Image Captioning through Word Information

IAACLIP: Image Aesthetics Assessment via CLIP

Intelligent Detection and Description of Foreign Object Debris on Airport Pavements via Enhanced YOLOv7 and GPT-Based Prompt Engineering

Remote Sensing Image Semantic Segmentation Sample Generation Using a Decoupled Latent Diffusion Framework

Images of Generalized Multilinear Polynomials on Upper Triangular Matrix Algebras

Modeling of Hyperparameter Tuned Deep Learning Model for Automated Image Captioning

Exploration of Generative Neural Networks for Police Facial Sketches

Machine-to-Machine Visual Dialoguing with ChatGPT for Enriched Textual Image Description

RI-MFM: A Novel Infrared and Visible Image Registration with Rotation Invariance and Multilevel Feature Matching

Level of Agreement between Emotions Generated by Artificial Intelligence and Human Evaluation: A Methodological Proposal

A Study on Generating Maritime Image Captions Based on Transformer Dual Information Flow

aRTIC GAN: A Recursive Text-Image-Conditioned GAN

Image Generation from Text Using StackGAN with Improved Conditional Consistency Regularization