Boosting image captioning using ConvNeXt deep neural networks
This work proposes a ConvNeXt backbone-based model for image captioning. Specifically, the ConvNeXt convolutional model, a state-of-the-art computer vision architecture, is integrated with a long short-term memory network enclosing a visual attention module. Diverse experiments were performed to eva...
Saved in:
| 主要作者: | |
|---|---|
| 格式: | bachelorThesis |
| 语言: | eng |
| 出版: |
2023
|
| 主题: | |
| 在线阅读: | http://repositorio.yachaytech.edu.ec/handle/123456789/627 |
| 标签: |
添加标签
没有标签, 成为第一个标记此记录!
|