Deep Learning Based Video Captioning through Encoder-Decoder Based Long Short-Term Memory (LSTM)

Grimsby Chelsea

Deep Learning Based Video Captioning through Encoder-Decoder Based Long Short-Term Memory (LSTM)

International Journal of Advance Computer Science and Application (forthcoming) Copy BIBT_EX

Abstract

This work demonstrates the implementation and use of an encoder-decoder model to perform a many-to-many mapping of video data to text captions. The many-to-many mapping occurs via an input temporal sequence of video frames to an output sequence of words to form a caption sentence. Data preprocessing, model construction, and model training are discussed. Caption correctness is evaluated using 2-gram BLEU scores across the different splits of the dataset. Specific examples of output captions were shown to demonstrate model generality over the video temporal dimension. Predicted captions were shown to generalize over video action, even in instances where the video scene changed dramatically. Model architecture changes are discussed to improve sentence grammar and correctness.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Author's Profile

Chelsea Grimsby

ESPAM FORMATION UNIVERSITY

Keywords

Add keywords

Reprint years

My notes

Similar books and articles

Deep Learning Based Video Captioning through Encoder-Decoder Based Long Short-Term Memory (LSTM).Grimsby Chelsea - forthcoming - International Journal of Advanced Computer Science and Applications:1-6.

Encoder-Decoder Based Long Short-Term Memory (LSTM) Model for Video Captioning.Adewale Sikiru, Tosin Ige & Bolanle Matti Hafiz - forthcoming - Proceedings of the IEEE:1-6.

Aircraft Gearbox Fault Diagnosis System: An Approach based on Deep Learning Techniques.Niranjan C. Kundur, S. Manjunath, M. Sreenatha & P. B. Mallikarjuna - 2020 - Journal of Intelligent Systems 30 (1):258-272.

Forecasting Foreign Exchange Volatility Using Deep Learning Autoencoder-LSTM Techniques.Gunho Jung & Sun-Yong Choi - 2021 - Complexity 2021:1-16.

Forecasting Volatility of Stock Index: Deep Learning Model with Likelihood-Based Loss Function.Fang Jia & Boli Yang - 2021 - Complexity 2021:1-13.

Bangla hate speech detection on social media using attention-based recurrent neural network.Md Nur Hossain, Anik Paul, Abdullah Al Asif & Amit Kumar Das - 2021 - Journal of Intelligent Systems 30 (1):578-591.

Short-term prediction of parking availability in an open parking lot.Vijay Paidi - 2022 - Journal of Intelligent Systems 31 (1):541-554.

A DEEP LEARNING APPROACH FOR LSTM BASED COVID-19 FORECASTING SYSTEM.K. Jothimani - 2022 - Journal of Science Technology and Research (JSTAR) 3 (1):28-38.

Construction and Analysis of Emotion Computing Model Based on LSTM.Huiping Jiang, Rui Jiao, Zequn Wang, Ting Zhang & Licheng Wu - 2021 - Complexity 2021:1-12.

Deep Learning-Based Text Emotion Analysis for Legal Anomie.Botong She - 2022 - Frontiers in Psychology 13.

A CNN-LSTM-Based Model to Forecast Stock Prices.Wenjie Lu, Jiazheng Li, Yifan Li, Aijun Sun & Jingyang Wang - 2020 - Complexity 2020:1-10.

Machine translation of English speech: Comparison of multiple algorithms.Yonghong Qin & Yijun Wu - 2022 - Journal of Intelligent Systems 31 (1):159-167.

A model for memory systems based on processing modes rather than consciousness.Katharina Henke - 2010 - Nature 11.

The short-term/long-term memory distinction: Back to the past?Giuseppe Vallar - 2003 - Behavioral and Brain Sciences 26 (6):757-758.

A Nonintrusive Load Monitoring Method for Microgrid EMS Using Bi-LSTM Algorithm.Dongguo Zhou, Yangjie Wu & Hong Zhou - 2021 - Complexity 2021:1-11.

Analytics

Added to PP
2024-03-08

Downloads
44 (#363,319)

6 months
44 (#94,399)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Chelsea Grimsby

ESPAM FORMATION UNIVERSITY

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Deep Learning Based Video Captioning through Encoder-Decoder Based Long Short-Term Memory (LSTM)

Abstract

Author's Profile

Categories

Keywords

Reprint years

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work