Boosting image captioning using ConvNeXt deep neural networks
This work proposes a ConvNeXt backbone-based model for image captioning. Specifically, the ConvNeXt convolutional model, a state-of-the-art computer vision architecture, is integrated with a long short-term memory network enclosing a visual attention module. Diverse experiments were performed to eva...
Wedi'i Gadw mewn:
| Prif Awdur: | |
|---|---|
| Fformat: | bachelorThesis |
| Iaith: | eng |
| Cyhoeddwyd: |
2023
|
| Pynciau: | |
| Mynediad Ar-lein: | http://repositorio.yachaytech.edu.ec/handle/123456789/627 |
| Tagiau: |
Ychwanegu Tag
Dim Tagiau, Byddwch y cyntaf i dagio'r cofnod hwn!
|