our task, as noted above, is to build an image captioning model that will analyse input images and generate a short desc

Business, Finance, Economics, Accounting, Operations Management, Computer Science, Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Algebra, Precalculus, Statistics and Probabilty, Advanced Math, Physics, Chemistry, Biology, Nursing, Psychology, Certifications, Tests, Prep, and more.
Post Reply
answerhappygod
Site Admin
Posts: 899604
Joined: Mon Aug 02, 2021 8:13 am

our task, as noted above, is to build an image captioning model that will analyse input images and generate a short desc

Post by answerhappygod »

our task, as noted above, is to build an image captioning model
that will analyse input images and generate a short
description.
An image captioning model in its simplest form is typically
composed of two parts:
Dataset:
The dataset you will use is the Flickr8k dataset, available from
a variety of online sources. It is composed of a training and a
testing sets.
Define your custom architecture that will receive images as
inputs and produce text as outputs. The goal is to generate the
most relevant and accurate image captioning as possible.
You must score your model (at minimum) with the scores BLEU-1,
BLEU-2, BLEU-3, BLEU-4.
However, there may be other metrics you wish to use to score
your model including visual
inspection and analysis among others.
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!
Post Reply