Researchers in the fields of computer vision and natural language processing have been concentrating their efforts in
recent years on automatically developing natural language descriptions for videos. Although video comprehension has a variety
of applications, such as video retrieval and indexing.