Image captioning is a brand-new study area in the science of computer vision. The primary goal of picture
captioning is to create a natural language description for the input image. In recent years, research on natural language
processing and computer vision has become increasingly interested in the problem of automatically synthesising
descriptive phrases for photos. Image captioning is a crucial task that demands both the ability to create precise and
accurate description phrases as well as a semantic understanding of the images.