Boosting image captioning using ConvNeXt deep neural networks

This work proposes a ConvNeXt backbone-based model for image captioning. Specifically, the ConvNeXt convolutional model, a state-of-the-art computer vision architecture, is integrated with a long short-term memory network enclosing a visual attention module. Diverse experiments were performed to eva...

全面介紹

Saved in:
書目詳細資料
主要作者: Ramos Granda, Leo Thomas (author)
格式: bachelorThesis
語言:eng
出版: 2023
主題:
在線閱讀:http://repositorio.yachaytech.edu.ec/handle/123456789/627
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!