Boosting image captioning using ConvNeXt deep neural networks
This work proposes a ConvNeXt backbone-based model for image captioning. Specifically, the ConvNeXt convolutional model, a state-of-the-art computer vision architecture, is integrated with a long short-term memory network enclosing a visual attention module. Diverse experiments were performed to eva...
Saved in:
| 主要作者: | |
|---|---|
| 格式: | bachelorThesis |
| 語言: | eng |
| 出版: |
2023
|
| 主題: | |
| 在線閱讀: | http://repositorio.yachaytech.edu.ec/handle/123456789/627 |
| 標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|